Dask: How would I parallelize my code with dask delayed?

multithreading python-3.x parallel-processing python-multiprocessing dask

You need to call dask.compute to eventually compute the result. See dask.delayed documentation.

Sequential code

import pandas as pdfrom sklearn.metrics import mean_squared_error as msefilenames = [...]results = []for count, name in enumerate(filenames):    file1 = pd.read_csv(name)    df = pd.DataFrame(file1)  # isn't this already a dataframe?    prediction = df['Close'][:-1]    observed = df['Close'][1:]    mean_squared_error = mse(observed, prediction)      results.append(mean_squared_error)

Parallel code

import daskimport pandas as pdfrom sklearn.metrics import mean_squared_error as msefilenames = [...]delayed_results = []for count, name in enumerate(filenames):    df = dask.delayed(pd.read_csv)(name)    prediction = df['Close'][:-1]    observed = df['Close'][1:]    mean_squared_error = dask.delayed(mse)(observed, prediction)    delayed_results.append(mean_squared_error)results = dask.compute(*delayed_results)

multithreading python-3.x parallel-processing python-multiprocessing dask

A much clearer solution, IMO, than the accepted answer is this snippet.

from dask import compute, delayedimport pandas as pdfrom sklearn.metrics import mean_squared_error as msefilenames = [...]def compute_mse(file_name):    df = pd.read_csv(file_name)    prediction = df['Close'][:-1]    observed = df['Close'][1:]    return mse(observed, prediction)delayed_results = [delayed(compute_mse)(file_name) for file_name in filenames]mean_squared_errors = compute(*delayed_results, scheduler="processes")

CodeHunter

Dask: How would I parallelize my code with dask delayed?

Sequential code

Parallel code

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last