Writing large Pandas Dataframes to CSV file in chunks

python pandas dataframe export-to-csv large-data

Solution:

header = Truefor chunk in chunks:    chunk.to_csv(os.path.join(folder, new_folder, "new_file_" + filename),        header=header, cols=[['TIME','STUFF']], mode='a')    header = False

Notes:

The mode='a' tells pandas to append.
We only write a column header on the first chunk.

python pandas dataframe export-to-csv large-data

Check out the chunksize argument in the to_csv method. Here are the docs.

Writing to file would look like:

df.to_csv("path/to/save/file.csv", chunksize=1000, cols=['TIME','STUFF'])

python pandas dataframe export-to-csv large-data

Why don't you only read the columns of interest and then save it?

file_in = os.path.join(folder, filename)file_out = os.path.join(folder, new_folder, 'new_file' + filename)df = pd.read_csv(file_in, sep='\t', skiprows=(0, 1, 2), header=0, names=['TIME', 'STUFF'])df.to_csv(file_out)

CodeHunter

Writing large Pandas Dataframes to CSV file in chunks

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last