我正在建立一个逻辑回归模型,其中包含两个类别的结果变量: a_category / z_category,我有以下问题:
分裂:
splits<- initial_split(df1, strata = c(outcome), prop = 3/4)
training_set <- training(splits)
test_set <- testing(splits)配方:
glm_rec <-
recipe(outcome~., data=training_set) %>%
step_zv(all_predictors()) %>%
step_normalize(all_predictors()) %>%
step_dummy(all_nominal(), -all_outcomes())型号规格:
glm_spec <-
logistic_reg() %>%
set_engine("glm") 工作流程:
glm_final_wf <-
workflow() %>%
add_model(glm_spec) %>%
add_recipe(glm_rec)我说的对吗?
# metrics
glm_internalval_res <- glm_final_wf %>%
fit_resamples(
resamples = vfold_cv(training_set,
v= 10,
repeats = 2,
strata = outcome),
control = control_resamples(save_pred = TRUE, event_level = "second"),
metrics = metric_set(
yardstick::roc_auc,
yardstick::accuracy,
yardstick::sens,
yardstick::spec,
yardstick::precision,
yardstick::ppv,
yardstick::npv)
)
# ROC curve
glm_internalval_res %>%
collect_predictions()%>%
group_by(id, id2) %>%
roc_curve(truth=outcome,
.pred_z_category,
event_level = "second"
) %>%
autoplot()我说的对吗?
glm_externalval_res <-
last_fit(glm_final_wf,
splits,
metrics = metric_set(yardstick::roc_auc,
yardstick::accuracy,
yardstick::sens,
yardstick::spec,
yardstick::precision,
yardstick::ppv,
yardstick::npv)
)使用这个块,度量被引用到第一类"a_category",我认为这是不正确的。
我想知道如何向last_fit表明我感兴趣的类别是"z_category“。我在包裹里找不到答案。
谢谢。
拉斐尔。
发布于 2022-09-20 18:45:26
一个选项是为第二个事件设置全局选项:
前0.0.7
options(yardstick.event_first = FALSE)
0.0.7员额:
options(yardstick.event_level = 'second')
https://stackoverflow.com/questions/68561374
复制相似问题