Renaming Column Names in Pandas Groupby function [duplicate]

python pandas group-by pandas-groupby rename

For the first question I think answer would be:

<your DataFrame>.rename(columns={'count':'Total_Numbers'})

<your DataFrame>.columns = ['ID', 'Region', 'Total_Numbers']

As for second one I'd say the answer would be no. It's possible to use it like 'df.ID' because of python datamodel:

Attribute references are translated to lookups in this dictionary, e.g., m.x is equivalent to m.dict["x"]

python pandas group-by pandas-groupby rename

The current (as of version 0.20) method for changing column names after a groupby operation is to chain the rename method. See this deprecation note in the documentation for more detail.

Deprecated Answer as of pandas version 0.20

This is the first result in google and although the top answer works it does not really answer the question. There is a better answer here and a long discussion on github about the full functionality of passing dictionaries to the agg method.

These answers unfortunately do not exist in the documentation but the general format for grouping, aggregating and then renaming columns uses a dictionary of dictionaries. The keys to the outer dictionary are column names that are to be aggregated. The inner dictionaries have keys that the new column names with values as the aggregating function.

Before we get there, let's create a four column DataFrame.

df = pd.DataFrame({'A' : list('wwwwxxxx'),                    'B':list('yyzzyyzz'),                    'C':np.random.rand(8),                    'D':np.random.rand(8)})   A  B         C         D0  w  y  0.643784  0.8284861  w  y  0.308682  0.9940782  w  z  0.518000  0.7256633  w  z  0.486656  0.2595474  x  y  0.089913  0.2384525  x  y  0.688177  0.7531076  x  z  0.955035  0.4626777  x  z  0.892066  0.368850

Let's say we want to group by columns A, B and aggregate column C with mean and median and aggregate column D with max. The following code would do this.

df.groupby(['A', 'B']).agg({'C':['mean', 'median'], 'D':'max'})            D         C                    max      mean    medianA B                              w y  0.994078  0.476233  0.476233  z  0.725663  0.502328  0.502328x y  0.753107  0.389045  0.389045  z  0.462677  0.923551  0.923551

This returns a DataFrame with a hierarchical index. The original question asked about renaming the columns in the same step. This is possible using a dictionary of dictionaries:

df.groupby(['A', 'B']).agg({'C':{'C_mean': 'mean', 'C_median': 'median'},                             'D':{'D_max': 'max'}})            D         C                  D_max    C_mean  C_medianA B                              w y  0.994078  0.476233  0.476233  z  0.725663  0.502328  0.502328x y  0.753107  0.389045  0.389045  z  0.462677  0.923551  0.923551

This renames the columns all in one go but still leaves the hierarchical index which the top level can be dropped with df.columns = df.columns.droplevel(0).

CodeHunter

Renaming Column Names in Pandas Groupby function [duplicate]

Deprecated Answer as of pandas version 0.20

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last