pandas rolling max with groupby

python python-3.x pandas dataframe group-by

It looks like you need cummax() instead of .rolling(N).max()

In [29]: df['new'] = df.groupby('id').value.cummax()In [30]: dfOut[30]:   id  value  new0   1      3    31   1      6    62   1      3    63   2      2    24   2      1    2

Timing (using brand new Pandas version 0.20.1):

In [3]: df = pd.concat([df] * 10**4, ignore_index=True)In [4]: df.shapeOut[4]: (50000, 2)In [5]: %timeit df.groupby('id').value.apply(lambda x: x.cummax())100 loops, best of 3: 15.8 ms per loopIn [6]: %timeit df.groupby('id').value.cummax()100 loops, best of 3: 4.09 ms per loop

NOTE: from Pandas 0.20.0 what's new

Improved performance of groupby().cummin() and groupby().cummax() (GH15048, GH15109, GH15561, GH15635)

python python-3.x pandas dataframe group-by

Using apply will be a tiny bit faster:

# Using apply  df['output'] = df.groupby('id').value.apply(lambda x: x.cummax())%timeit df['output'] = df.groupby('id').value.apply(lambda x: x.cummax())1000 loops, best of 3: 1.57 ms per loop

Other method:

df['output'] = df.groupby('id').value.cummax()%timeit df['output'] = df.groupby('id').value.cummax()1000 loops, best of 3: 1.66 ms per loop

CodeHunter

pandas rolling max with groupby

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last