Pandas: Is there a way to use something like 'droplevel' and in process, rename the other level using the dropped level labels as prefix/suffix? Pandas: Is there a way to use something like 'droplevel' and in process, rename the other level using the dropped level labels as prefix/suffix? pandas pandas

Pandas: Is there a way to use something like 'droplevel' and in process, rename the other level using the dropped level labels as prefix/suffix?


Use list comprehension for set new column names:

df.columns = df.columns.map('_'.join)Or:df.columns = ['_'.join(col) for col in df.columns]

Sample:

df = pd.DataFrame({'A':[1,2,2,1],                   'B':[4,5,6,4],                   'C':[7,8,9,1],                   'D':[1,3,5,9]})print (df)   A  B  C  D0  1  4  7  11  2  5  8  32  2  6  9  53  1  4  1  9df = df.groupby('A').agg([max, min])df.columns = df.columns.map('_'.join)print (df)   B_max  B_min  C_max  C_min  D_max  D_minA                                          1      4      4      7      1      9      12      6      5      9      8      5      3

print (['_'.join(col) for col in df.columns])['B_max', 'B_min', 'C_max', 'C_min', 'D_max', 'D_min']df.columns = ['_'.join(col) for col in df.columns]print (df)   B_max  B_min  C_max  C_min  D_max  D_minA                                          1      4      4      7      1      9      12      6      5      9      8      5      3

If need prefix simple swap items of tuples:

df.columns = ['_'.join((col[1], col[0])) for col in df.columns]print (df)   max_B  min_B  max_C  min_C  max_D  min_DA                                          1      4      4      7      1      9      12      6      5      9      8      5      3

Another solution:

df.columns = ['{}_{}'.format(i[1], i[0]) for i in df.columns]print (df)   max_B  min_B  max_C  min_C  max_D  min_DA                                          1      4      4      7      1      9      12      6      5      9      8      5      3

If len of columns is big (10^6), then rather use to_series and str.join:

df.columns = df.columns.to_series().str.join('_')


Using @jezrael's setup

df = pd.DataFrame({'A':[1,2,2,1],                   'B':[4,5,6,4],                   'C':[7,8,9,1],                   'D':[1,3,5,9]})df = df.groupby('A').agg([max, min])

Assign new columns with

from itertools import starmapdef flat(midx, sep=''):    fstr = sep.join(['{}'] * midx.nlevels)    return pd.Index(starmap(fstr.format, midx))df.columns = flat(df.columns, '_')df

enter image description here