Rolling or sliding window iterator?

python algorithm

There's one in an old version of the Python docs with itertools examples:

from itertools import islicedef window(seq, n=2):    "Returns a sliding window (of width n) over data from the iterable"    "   s -> (s0,s1,...s[n-1]), (s1,s2,...,sn), ...                   "    it = iter(seq)    result = tuple(islice(it, n))    if len(result) == n:        yield result    for elem in it:        result = result[1:] + (elem,)        yield result

The one from the docs is a little more succinct and uses itertools to greater effect I imagine.

python algorithm

This seems tailor-made for a collections.deque since you essentially have a FIFO (add to one end, remove from the other). However, even if you use a list you shouldn't be slicing twice; instead, you should probably just pop(0) from the list and append() the new item.

Here is an optimized deque-based implementation patterned after your original:

from collections import dequedef window(seq, n=2):    it = iter(seq)    win = deque((next(it, None) for _ in xrange(n)), maxlen=n)    yield win    append = win.append    for e in it:        append(e)        yield win

In my tests it handily beats everything else posted here most of the time, though pillmuncher's tee version beats it for large iterables and small windows. On larger windows, the deque pulls ahead again in raw speed.

Access to individual items in the deque may be faster or slower than with lists or tuples. (Items near the beginning are faster, or items near the end if you use a negative index.) I put a sum(w) in the body of my loop; this plays to the deque's strength (iterating from one item to the next is fast, so this loop ran a a full 20% faster than the next fastest method, pillmuncher's). When I changed it to individually look up and add items in a window of ten, the tables turned and the tee method was 20% faster. I was able to recover some speed by using negative indexes for the last five terms in the addition, but tee was still a little faster. Overall I would estimate that either one is plenty fast for most uses and if you need a little more performance, profile and pick the one that works best.

python algorithm

I like tee():

from itertools import tee, izipdef window(iterable, size):    iters = tee(iterable, size)    for i in xrange(1, size):        for each in iters[i:]:            next(each, None)    return izip(*iters)for each in window(xrange(6), 3):    print list(each)

gives:

[0, 1, 2][1, 2, 3][2, 3, 4][3, 4, 5]

CodeHunter

Rolling or sliding window iterator?

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last