您可以将两者完美结合,VotingClassifier
with RandomizedSearchCV
. 无需单独运行它们。请参阅文档:http://scikit-learn.org/stable/modules/ensemble.html#using-the-votingclassifier-with-gridsearch http://scikit-learn.org/stable/modules/ensemble.html#using-the-votingclassifier-with-gridsearch
技巧是在你的前面加上前缀params
列出您的估算员姓名。例如,如果您创建了一个RandomForest
估计器,你将其创建为('rf',clf2)
然后你可以在表单中设置它的参数<name__param>
。具体例子:rf__n_estimators: [20,200]
,因此您可以引用特定的估计器并设置值来测试特定参数。
准备测试可执行代码示例;)
import numpy as np
from sklearn.ensemble import RandomForestClassifier
from sklearn.ensemble import VotingClassifier
from sklearn.tree import DecisionTreeClassifier
from sklearn.grid_search import RandomizedSearchCV
X = np.array([[-1, -1], [-2, -1], [-3, -2], [1, 1], [2, 1], [3, 2]])
y = np.array([1, 1, 1, 2, 2, 2])
clf1 = DecisionTreeClassifier()
clf2 = RandomForestClassifier(random_state=1)
params = {'dt__max_depth': [5, 10], 'rf__n_estimators': [20, 200],}
eclf = VotingClassifier(estimators=[('dt', clf1), ('rf', clf2)], voting='hard')
random_search = RandomizedSearchCV(eclf, param_distributions=params,n_iter=4)
random_search.fit(X, y)
print(random_search.grid_scores_)