文章/答案/技术大牛

发布

社区首页 >问答首页 >计算sklearn.metrics.ndcg_score时的误差

问计算sklearn.metrics.ndcg_score时的误差
EN

Stack Overflow用户

提问于 2021-03-12 13:00:47

回答 1查看 849关注 0票数 1

我试图计算分类器的ndcg分数，但我得到了以下错误：

ValueError:只支持(“多标签-指示器”、“连续-多输出”、“多类-多输出”)格式。得到多类

这是我的密码：

# Declare classifier, fit on data and make predictions
from sklearn.ensemble import RandomForestClassifier
rnd_forest = RandomForestClassifier()
rnd_forest.fit(X_train_tr, y_train)
y_pred_prob = rnd_forest.predict_proba(X_train_tr)

# Calculate ndcg score
from sklearn.metrics import ndcg_score
# This is where I get an error
ndcg_score(y_train, y_pred_prob, k=5)

这就是我的目标和预测的概率：

# True labels of the first two samples
y_train[:2]
> array([7, 7])
    
# Predicted probabilities for first two observation
y_pred_prob[:2]
> array([[0., 0., 0., 0., 0., 0., 0., 1., 0., 0., 0., 0.],
         [0., 0., 0., 0., 0., 0., 0., 1., 0., 0., 0., 0.]])

我试图将y_train重塑成一个二维数组，但它不起作用。有人能告诉我如何纠正这个错误吗？

python

numpy

scikit-learn

reshape

回答 1

Stack Overflow用户

回答已采纳

发布于 2021-03-12 19:00:38

假设在N中有y_train观测。您必须将y_train转换为由N行和12列组成的矩阵。

# Create an ndarray of size (N, 12) filled with zeros
y_train_matrix = np.zeros(shape=(y_pred_prob.shape[0], y_pred_prob.shape[1]))
# Write a 1 on each row's corresponding category
y_train_matrix[np.arange(y_pred_prob.shape[0]), y_train] = 1
# You now have this ndarray
y_train_matrix

array([[0., 0., 0., 0., 0., 0., 0., 1., 0., 0., 0., 0.],
       [0., 0., 0., 0., 0., 0., 0., 1., 0., 0., 0., 0.]])

现在你可以计算分数了：

ndcg_score(y_train_matrix, y_pred_prob)

1.0

票数 1

页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持

原文链接：

https://stackoverflow.com/questions/66600401

复制

相似问题

问计算sklearn.metrics.ndcg_score时的误差
EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问计算sklearn.metrics.ndcg_score时的误差EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问计算sklearn.metrics.ndcg_score时的误差
EN