我有一张像这样的长桌:
gene tissue tpm
A liver 5
A brain 2
B ovary 10
B brain 1
C brain 15
C liver 6我想把它转换成更广泛的格式:
gene tissue1 tissue2 tpm1 tpm2
A liver brain 5 2
B ovary brain 10 1
C brain liver 15 6我尝试过dcast和spread,但是我得到了这样的结果:
gene liver brain ovary
A 5 2 NA
B NA 1 10
C 6 15 NA这不是我想要的。
谢谢!
发布于 2022-03-30 22:49:08
我不知道有一个函数可以同时用R语言解决这个难题,但是您可以使用for循环来重新排列数据帧。
守则列示如下:
data <- data.frame(gene=c("A","A","B","B","C","C"),
tissue=c("liver", "brain", "ovary", "brain", "brain", "liver"),
tpm=c(5,2,10,1,15,6))
gene.unique <- unique(data$gene)
i <- 1
for (dummy in gene.unique) {
genes.idx <- which(data$gene == dummy)
tissue1[i] <- data$tissue[genes.idx[1]]
tissue2[i] <- data$tissue[genes.idx[2]]
tpm1[i] <- data$tpm[genes.idx[1]]
tpm2[i] <- data$tpm[genes.idx[2]]
i <- i+1
}
data.final <- data.frame(gene=gene.unique, tissue1, tissue2, tpm1, tpm2)
gene tissue1 tissue2 tpm1 tpm2
1 A liver brain 5 2
2 B ovary brain 10 1
3 C brain liver 15 6希望它能帮到你。
https://stackoverflow.com/questions/71663176
复制相似问题