考虑到示例dataframe:
df <- data.frame(A = seq(1,10,1), B = c("Type A", "9383", "Type B", "Duplicate", "No",
"Type B", "No", "others", "Type A", "Duplicate"))假设我已经对数据报做了一些修改,如下所示:
library(dplyr)
df <- df %>% mutate(A = paste(.$A, "hours"))我想添加另一行可变项,以更改列B中与向量plan_types与"TBD"不匹配的元素。
plan_types <- c(“重复”、“A”、“B”、“否”)
预期产出将是:
> df
A B
1 1 hours Type A
2 2 hours TBD
3 3 hours Type B
4 4 hours Duplicate
5 5 hours No
6 6 hours Type B
7 7 hours No
8 8 hours TBD
9 9 hours Type A
10 10 hours Duplicate发布于 2022-04-12 16:38:29
可能是更好的方法,但我还是会发出去的。
df <- df %>% mutate(B = if_else(B %in% plan_types, B, "TBD"))发布于 2022-04-12 16:37:12
我们可以使用replace
library(dplyr)
df <- df %>%
mutate(B = replace(B, ! B %in% plan_types, "TBD"))或在base R中
df$B[! df$B %in% plan_types] <- "TBD"发布于 2022-04-12 17:15:34
另一种策略是使用来自str_detect包的stringr:在我们必须创建模式之前:
library(dplyr)
library(stringr)
pattern <- paste(plan_types, collapse = '|')
df %>%
mutate(A = paste(.$A, "hours")) %>%
mutate(B = ifelse(str_detect(B, pattern), B, "TBD")) A B
1 1 hours Type A
2 2 hours TBD
3 3 hours Type B
4 4 hours Duplicate
5 5 hours No
6 6 hours Type B
7 7 hours No
8 8 hours TBD
9 9 hours Type A
10 10 hours Duplicatehttps://stackoverflow.com/questions/71846131
复制相似问题