Error in Python script "Expected 2D array, got 1D array instead:"? Error in Python script "Expected 2D array, got 1D array instead:"? python-3.x python-3.x

Error in Python script "Expected 2D array, got 1D array instead:"?


You are just supposed to provide the predict method with the same 2D array, but with one value that you want to process (or more). In short, you can just replace

[0.58,0.76]

With

[[0.58,0.76]]

And it should work.

EDIT: This answer became popular so I thought I'd add a little more explanation about ML. The short version: we can only use predict on data that is of the same dimensionality as the training data (X) was.

In the example in question, we give the computer a bunch of rows in X (with 2 values each) and we show it the correct responses in y. When we want to predict using new values, our program expects the same - a bunch of rows. Even if we want to do it to just one row (with two values), that row has to be part of another array.


The problem is occurring when you run prediction on the array [0.58,0.76]. Fix the problem by reshaping it before you call predict():

import numpy as npimport matplotlib.pyplot as pltfrom matplotlib import stylestyle.use("ggplot")from sklearn import svmx = [1, 5, 1.5, 8, 1, 9]y = [2, 8, 1.8, 8, 0.6, 11]plt.scatter(x,y)plt.show()X = np.array([[1,2],             [5,8],             [1.5,1.8],             [8,8],             [1,0.6],             [9,11]])y = [0,1,0,1,0,1]clf = svm.SVC(kernel='linear', C = 1.0)clf.fit(X,y)test = np.array([0.58, 0.76])print test       # Produces: [ 0.58  0.76]print test.shape # Produces: (2,) meaning 2 rows, 1 coltest = test.reshape(1, -1)print test       # Produces: [[ 0.58  0.76]]print test.shape # Produces (1, 2) meaning 1 row, 2 colsprint(clf.predict(test)) # Produces [0], as expected


I use the below approach.

reg = linear_model.LinearRegression()reg.fit(df[['year']],df.income)reg.predict([[2136]])