Scikit-learn cross validation scoring for regression

python scikit-learn regression

I dont have the reputation to comment but I want to provide this link for you and/or a passersby where the negative output of the MSE in scikit learn is discussed - https://github.com/scikit-learn/scikit-learn/issues/2439

In addition (to make this a real answer) your first option is correct in that not only is MSE the metric you want to use to compare models but R^2 cannot be calculated depending (I think) on the type of cross-val you are using.

If you choose MSE as a scorer, it outputs a list of errors which you can then take the mean of, like so:

# Doing linear regression with leave one out cross valfrom sklearn import cross_validation, linear_modelimport numpy as np# Including this to remind you that it is necessary to use numpy arrays rather # than lists otherwise you will get an errorX_digits = np.array(x)Y_digits = np.array(y)loo = cross_validation.LeaveOneOut(len(Y_digits))regr = linear_model.LinearRegression()scores = cross_validation.cross_val_score(regr, X_digits, Y_digits, scoring='mean_squared_error', cv=loo,)# This will print the mean of the list of errors that were output and # provide your metric for evaluationprint scores.mean()

python scikit-learn regression

The first one is correct. It outputs the negative of the MSE, as it always tries to maximize the score. Please help us by suggesting an improvement to the documentation.

CodeHunter

Scikit-learn cross validation scoring for regression

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last