发布于 2018-12-14 07:26:27
在我的例子中,基于10个直方图箱计算单个样本权重是有帮助的。见下面的代码:
import pandas as pd
import numpy as np
from sklearn.utils.class_weight import compute_sample_weight
hist, bin_edges = np.histogram(training_targets, bins = 10)
classes = training_targets.apply(lambda x: pd.cut(x, bin_edges, labels = False,
include_lowest = True)).values
sample_weights = compute_sample_weight('balanced', classes)https://stackoverflow.com/questions/53739726
复制相似问题