我有一个由多列组成的数据框架。我希望将数据框子集设置为仅包括来自一个因子的级别与另一个因子中的多个级别共同出现的行。在下面的简化数据示例中,我只剩下前两行,即GeneA、GeneA和TissueA TissueB。
A <- c("GeneA","GeneA","GeneB","GeneB","GeneC","GeneC")
B <- c("TissueA","TissueB","TissueA","TissueA","TissueA","TissueA")
df <- data.frame(Gene = A, Tissue = B)提前谢谢。
发布于 2020-07-23 23:12:45
这里有一个想法。您可以使用Gene定义组。在每个组中,您希望检查是否存在多个唯一值。
group_by(df, Gene) %>%
filter(n_distinct(Tissue) >= 2)
Gene Tissue
<fct> <fct>
1 GeneA TissueA
2 GeneA TissueBhttps://stackoverflow.com/questions/63056913
复制相似问题