我有以下数据框架:
tdf <- structure(list(GO = c("Cytokine-cytokine receptor interaction",
"Cytokine-cytokine receptor interaction|Endocytosis", "I-kappaB kinase/NF-kappaB signaling",
"NF-kappa B signaling pathway", "NF-kappaB import into nucleus",
"T cell chemotaxis"), PosCount = c(17, 18, 4, 5, 1, 2), shortgo = structure(c(1L,
1L, 2L, 2L, 2L, 3L), .Label = c("z", "X", "y"), class = "factor")), .Names = c("GO",
"PosCount", "shortgo"), row.names = c(NA, 6L), class = "data.frame")
desired_order <- c("y", "X", "z")看起来是这样的:
GO PosCount shortgo
1 Cytokine-cytokine receptor interaction 17 z
2 Cytokine-cytokine receptor interaction|Endocytosis 18 z
3 I-kappaB kinase/NF-kappaB signaling 4 X
4 NF-kappa B signaling pathway 5 X
5 NF-kappaB import into nucleus 1 X
6 T cell chemotaxis 2 y那么,我要做的就是用一个预定义的列表对shortgo进行排序。
desired_order <- c("y", "X", "z")然后对每个shortgo组按PosCount进行内部排序。产生这种情况:
GO PosCount shortgo
T cell chemotaxis 2 y
NF-kappa B signaling pathway 5 X
I-kappaB kinase/NF-kappaB signaling 4 X
NF-kappaB import into nucleus 1 X
Cytokine-cytokine receptor interaction|Endocytosis 18 z
Cytokine-cytokine receptor interaction 17 z我试过但失败了:
library(dplyr)
#tdf %>% arrange(as.character(shortgo), desc(PosCount))
tdf %>% arrange(desired_order, desc(PosCount))正确的方法是什么?
发布于 2015-05-05 04:06:15
使用变量的factor表示来强制执行所需的order
在dplyr中,只需:
tdf %>% arrange(factor(shortgo,levels=desired_order), desc(PosCount) )在基数R中,只需使用:
tdf[order(factor(tdf$shortgo,levels=desired_order), -tdf$PosCount),]https://stackoverflow.com/questions/30044020
复制相似问题