我想探索一种用于data.table分组的优雅的(一行)解决方案。
我有一个data.table如下所示:
library(data.table)
library(lubridate)
dt.master <- data.table(user = c(1000, 1002, 2008, 3005, 1000, 1002, 1002),
target = c(50000, 50004, 50501, 50001, 50000, 50000, 50004),
channel = c("A", "B", "C", "A", "B", "A", "C"),
date = c(dmy("10/02/2018"), dmy("11/04/2018"), dmy("14/03/2018"), dmy("02/03/2018"), dmy("05/01/2018"), dmy("08/05/2018"), dmy("05/03/2018")))这就是:
user target channel date
1: 1000 50000 A 2018-02-10
2: 1002 50004 B 2018-04-11
3: 2008 50501 C 2018-03-14
4: 3005 50001 A 2018-03-02
5: 1000 50000 B 2018-01-05
6: 1002 50000 A 2018-05-08
7: 1002 50004 C 2018-03-05我想知道,对于每一组(user, target),第一频道的频道,并添加到dt.master。这是:
user target channel date first_channel
1: 1000 50000 A 2018-02-10 B
2: 1000 50000 B 2018-01-05 B
3: 1002 50000 A 2018-05-08 A
4: 1002 50004 B 2018-04-11 C
5: 1002 50004 C 2018-03-05 C
6: 2008 50501 C 2018-03-14 C
7: 3005 50001 A 2018-03-02 A目前,我正在分两步进行:
dt.master合并
集合名称( dt.result,“信道”,"first_channel") dt.master <- merge(dt.master,dt.result,.(用户,目标,first_channel),by.y .x= c("user"," target "),by.y.y= c("user",“first_channel”),all.x = T,all.y = F)有没有合并的方法吗?,我相信必须有一个解决方案来修改第一行,但我找不到。
非常感谢!
发布于 2018-05-30 09:51:27
您可以按组引用更新如下:
dt.master[, first_channel := channel[which.min(date)], keyby=.(user, target)]发布于 2018-05-30 09:50:38
dt.master[,date:=as.character(date)]
dt.master[,date:=as.numeric(gsub("-","",date))]
dt.master<-dt.master[order(user,date)]
dt.master[,firt_occ:=channel[1],by=c("user")]https://stackoverflow.com/questions/50601262
复制相似问题