Библиотека GridSearchCV - значение параметра должно быть ошибкой последовательности - PullRequest
0 голосов
/ 23 апреля 2019

У меня есть следующий фрагмент кода:

# TODO: Import 'make_scorer', 'DecisionTreeRegressor', and 'GridSearchCV'
from sklearn.tree import DecisionTreeRegressor
from sklearn.metrics import make_scorer
from sklearn.grid_search import GridSearchCV

def fit_model(X, y):
    """ Performs grid search over the 'max_depth' parameter for a 
        decision tree regressor trained on the input data [X, y]. """

    # Create cross-validation sets from the training data
    # sklearn version 0.18: ShuffleSplit(n_splits=10, test_size=0.1, train_size=None, random_state=None)
    # sklearn versiin 0.17: ShuffleSplit(n, n_iter=10, test_size=0.1, train_size=None, random_state=None)
    cv_sets = ShuffleSplit(X.shape[0], n_iter = 10, test_size = 0.20, random_state = 0)

    # TODO: Create a decision tree regressor object
    regressor = DecisionTreeRegressor()

    # TODO: Create a dictionary for the parameter 'max_depth' with a range from 1 to 10
    params = {'max_depth' : range(1, 10)}

    # TODO: Transform 'performance_metric' into a scoring function using 'make_scorer' 
    scoring_fnc = make_scorer(performance_metric)

    # TODO: Create the grid search cv object --> GridSearchCV()
    # Make sure to include the right parameters in the object:
    # (estimator, param_grid, scoring, cv) which have values 'regressor', 'params', 'scoring_fnc', and 'cv_sets' respectively.
    scoring = scoring_fnc
    cv = cv_sets
    grid = GridSearchCV(regressor, params, scoring, cv)

    # Fit the grid search object to the data to compute the optimal model
    grid = grid.fit(X, y)

    # Return the optimal model after fitting the data
    return grid.best_estimator_

Когда я вызываю следующие несколько строк, я получаю сообщение об ошибке:

# Fit the training data to the model using grid search
reg = fit_model(X_train, y_train)

# Produce the value for 'max_depth'
print("Parameter 'max_depth' is {} for the optimal model.".format(reg.get_params()['max_depth']))

Ошибка:

---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
<ipython-input-48-ede909fc46d6> in <module>()
      1 # Fit the training data to the model using grid search
----> 2 reg = fit_model(X_train, y_train)
      3 
      4 # Produce the value for 'max_depth'
      5 print("Parameter 'max_depth' is {} for the optimal model.".format(reg.get_params()['max_depth']))

<ipython-input-47-8bc5d7617f09> in fit_model(X, y)
     27     scoring = scoring_fnc
     28     cv = cv_sets
---> 29     grid = GridSearchCV(regressor, params, scoring, cv)
     30 
     31     # Fit the grid search object to the data to compute the optimal model

/opt/conda/lib/python3.6/site-packages/sklearn/grid_search.py in __init__(self, estimator, param_grid, scoring, fit_params, n_jobs, iid, refit, cv, verbose, pre_dispatch, error_score)
    819             refit, cv, verbose, pre_dispatch, error_score)
    820         self.param_grid = param_grid
--> 821         _check_param_grid(param_grid)
    822 
    823     def fit(self, X, y=None):

/opt/conda/lib/python3.6/site-packages/sklearn/grid_search.py in _check_param_grid(param_grid)
    349             if True not in check:
    350                 raise ValueError("Parameter values for parameter ({0}) need "
--> 351                                  "to be a sequence.".format(name))
    352 
    353             if len(v) == 0:

ValueError: Parameter values for parameter (max_depth) need to be a sequence.

Я новичок в Python и не уверен, что говорит мне эта ошибка. Что ожидает метод max_depth? Он говорит, что ищет последовательность, но последовательность чего? символы? Интс

...