Python Pandas - Using to_sql to write large data frames in chunks

python mysql sql pandas sqlalchemy

Update: this functionality has been merged in pandas master and will be released in 0.15 (probably end of september), thanks to @artemyk! See https://github.com/pydata/pandas/pull/8062

So starting from 0.15, you can specify the chunksize argument and e.g. simply do:

df.to_sql('table', engine, chunksize=20000)

python mysql sql pandas sqlalchemy

There is beautiful idiomatic function chunks provided in answer to this question

In your case you can use this function like this:

def chunks(l, n):""" Yield successive n-sized chunks from l."""    for i in xrange(0, len(l), n):         yield l.iloc[i:i+n]def write_to_db(engine, frame, table_name, chunk_size):    for idx, chunk in enumerate(chunks(frame, chunk_size)):        if idx == 0:            if_exists_param = 'replace':        else:            if_exists_param = 'append'        chunk.to_sql(con=engine, name=table_name, if_exists=if_exists_param)

Only drawback that it doesn't support slicing second index in iloc function.

python mysql sql pandas sqlalchemy

Reading from one table and writing to other in chunks....

[myconn1 ---> Source Table],[myconn2----> Target Table],[ch= 10000]

for chunk in pd.read_sql_table(table_name=source, con=myconn1, chunksize=ch):    chunk.to_sql(name=target, con=myconn2, if_exists="replace", index=False,                 chunksize=ch)    LOGGER.info(f"Done 1 chunk")

CodeHunter

Python Pandas - Using to_sql to write large data frames in chunks

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last