Confidence interval for LOWESS in Python

python regression statsmodels smoothing loess

LOESS doesn't have an explicit concept for standard error. It just doesn't mean anything in this context. Since that's out, your stuck with the brute-force approach.

Bootstrap your data. Your going to fit a LOESS curve to the bootstrapped data. See the middle of this page to find a pretty picture of what your doing. http://statweb.stanford.edu/~susan/courses/s208/node20.html

Once you have your large number of different LOESS curves, you can find the top and bottom Xth percentile.

python regression statsmodels smoothing loess

This is a very old question but it's one of the first that pops up on google search. You can do this using the loess() function from scikit-misc. Here's an example (I tried to keep your original variable names, but I bumped up the noise a bit to make it more visible)

import numpy as npimport pylab as pltfrom skmisc.loess import loessx = np.linspace(0,2*np.pi,100)y = np.sin(x) + np.random.random(100) * 0.4l = loess(x,y)l.fit()pred = l.predict(x, stderror=True)conf = pred.confidence()lowess = pred.valuesll = conf.lowerul = conf.upperplt.plot(x, y, '+')plt.plot(x, lowess)plt.fill_between(x,ll,ul,alpha=.33)plt.show()

result:

python regression statsmodels smoothing loess

For a project of mine, I need to create intervals for time-series modeling, and to make the procedure more efficient I created tsmoothie: A python library for time-series smoothing and outlier detection in a vectorized way.

It provides different smoothing algorithms together with the possibility to computes intervals.

In the case of LowessSmoother:

import numpy as npimport matplotlib.pyplot as pltfrom tsmoothie.smoother import *from tsmoothie.utils_func import sim_randomwalk# generate 10 randomwalks of length 200np.random.seed(33)data = sim_randomwalk(n_series=10, timesteps=200,                       process_noise=10, measure_noise=30)# operate smoothingsmoother = LowessSmoother(smooth_fraction=0.1, iterations=1)smoother.smooth(data)# generate intervalslow, up = smoother.get_intervals('prediction_interval', confidence=0.05)# plot the first smoothed timeseries with intervalsplt.figure(figsize=(11,6))plt.plot(smoother.smooth_data[0], linewidth=3, color='blue')plt.plot(smoother.data[0], '.k')plt.fill_between(range(len(smoother.data[0])), low[0], up[0], alpha=0.3)

I point out also that tsmoothie can carry out the smoothing of multiple time-series in a vectorized way. Hope this can help someone

CodeHunter

Confidence interval for LOWESS in Python

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last