How to select columns from dataframe by regex

You can use DataFrame.filter this way:

import pandas as pddf = pd.DataFrame(np.array([[2,4,4],[4,3,3],[5,9,1]]),columns=['d','t','didi'])>>   d  t  didi0  2  4     41  4  3     32  5  9     1df.filter(regex=("d.*"))>>   d  didi0  2     41  4     32  5     1

The idea is to select columns by regex

python python-2.7 pandas

Use select:

import pandas as pddf = pd.DataFrame([[10, 14, 12, 44, 45, 78]], columns=['a', 'b', 'c', 'd1', 'd2', 'd3'])df.select(lambda col: col.startswith('d'), axis=1)

Result:

   d1  d2  d30  44  45  78

This is a nice solution if you're not comfortable with regular expressions.

python python-2.7 pandas

On a larger dataset especially, a vectorized approach is actually MUCH FASTER (by more than two orders of magnitude) and is MUCH more readable.I'm providing a screenshot as proof. (Note: Except for the last few lines I wrote at the bottom to make my point clear with a vectorized approach, the other code was derived from the answer by @Alexander.)

Here's that code for reference:

import pandas as pdimport numpy as npn = 10000cols = ['{0}_{1}'.format(letters, number)         for number in range(n) for letters in ('d', 't', 'didi')]df = pd.DataFrame(np.random.randn(30000, n * 3), columns=cols)%timeit df[[c for c in df if c[0] == 'd']]%timeit df[[c for c in df if c.startswith('d')]]%timeit df.select(lambda col: col.startswith('d'), axis=1)%timeit df.filter(regex=("d.*"))%timeit df.filter(like='d')%timeit df.filter(like='d', axis=1)%timeit df.filter(regex=("d.*"), axis=1)%timeit df.columns.map(lambda x: x.startswith("d"))columnVals = df.columns.map(lambda x: x.startswith("d"))%timeit df.filter(columnVals, axis=1)

CodeHunter

How to select columns from dataframe by regex

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last