首页
学习
活动
专区
圈层
工具
发布
社区首页 >问答首页 >在R中执行group_by函数时使用“shapiro_test”函数

在R中执行group_by函数时使用“shapiro_test”函数
EN

Stack Overflow用户
提问于 2022-05-06 18:23:23
回答 1查看 122关注 0票数 1

我以前问过这个问题,但没有运气,所以我再问一遍:

我的数据:

代码语言:javascript
复制
data.type <- c("DNA","DNA","DNA","DNA","DNA","DNA","DNA","DNA","DNA","DNA","DNA","DNA","DNA","DNA","DNA","DNA","DNA","DNA","DNA","DNA","DNA","RNA","RNA","RNA","RNA","RNA","RNA","RNA","RNA","RNA","RNA","RNA","RNA","RNA","RNA","RNA","RNA","RNA","RNA","RNA","RNA","RNA")
hour <- c(1,1,1,2,2,2,24,24,24,48,48,48,96,96,96,168,168,168,672,672,672,1,1,1,2,2,2,24,24,24,48,48,48,96,96,96,168,168,168,672,672,672)
zotu.count <- c(11,14,16,7,16,15,5,14,13,6,5,17,7,7,12,3,4,5,3,5,4,2,3,2,1,6,2,1,1,1,1,0,0,1,1,4,1,1,1,6,7,6)
id <- c(1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42)

我试图使用以下代码进行shapiro测试,以测试数据是否正常,并被告知以下错误:

代码语言:javascript
复制
dataset %>% group_by(data.type, hour) %>% shapiro_test(zotu.count)

Error: Problem with `mutate()` column `data`.
ℹ `data = map(.data$data, .f, ...)`.
x Problem with `mutate()` column `data`.
ℹ `data = map(.data$data, .f, ...)`.
x all 'x' values are identical

这是非常奇怪的,因为它以前工作在另一个数据集,具有相同的数据结构,但我不知道为什么我现在得到这个错误。我非常沮丧,因为我已经搜索了互联网上的答案,但什么也没有。任何有能力帮忙的人都是天赐之物!

谢谢!

EN

回答 1

Stack Overflow用户

回答已采纳

发布于 2022-05-06 18:31:18

我们可以使用一个if/else条件来检查'zotu.count‘中是否有多个唯一值,并应用shapiro_test

代码语言:javascript
复制
library(rstatix)
library(dplyr)
library(tidyr)
dataset %>% 
  group_by(data.type, hour) %>%
  summarise(out = if(n_distinct(zotu.count) == 1) list(NA) 
    else list(shapiro_test(zotu.count)), .groups = 'drop') %>% 
  unnest(out)

-output

代码语言:javascript
复制
# A tibble: 14 × 5
   data.type  hour variable   statistic p.value
   <chr>     <dbl> <chr>          <dbl>   <dbl>
 1 DNA           1 zotu.count     0.987   0.780
 2 DNA           2 zotu.count     0.832   0.194
 3 DNA          24 zotu.count     0.832   0.194
 4 DNA          48 zotu.count     0.812   0.144
 5 DNA          96 zotu.count     0.75    0    
 6 DNA         168 zotu.count     1       1.00 
 7 DNA         672 zotu.count     1       1.00 
 8 RNA           1 zotu.count     0.75    0    
 9 RNA           2 zotu.count     0.893   0.363
10 RNA          24 <NA>          NA      NA    
11 RNA          48 zotu.count     0.75    0    
12 RNA          96 zotu.count     0.75    0    
13 RNA         168 <NA>          NA      NA    
14 RNA         672 zotu.count     0.75    0    

我们也可以filter出那些只有一个唯一值的组。

代码语言:javascript
复制
dataset %>% 
   group_by(data.type, hour) %>% 
   filter(n_distinct(zotu.count) > 1) %>% 
   shapiro_test(zotu.count)
# A tibble: 12 × 5
   data.type  hour variable   statistic     p
   <chr>     <dbl> <chr>          <dbl> <dbl>
 1 DNA           1 zotu.count     0.987 0.780
 2 DNA           2 zotu.count     0.832 0.194
 3 DNA          24 zotu.count     0.832 0.194
 4 DNA          48 zotu.count     0.812 0.144
 5 DNA          96 zotu.count     0.75  0    
 6 DNA         168 zotu.count     1     1.00 
 7 DNA         672 zotu.count     1     1.00 
 8 RNA           1 zotu.count     0.75  0    
 9 RNA           2 zotu.count     0.893 0.363
10 RNA          48 zotu.count     0.75  0    
11 RNA          96 zotu.count     0.75  0    
12 RNA         672 zotu.count     0.75  0    
票数 2
EN
页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持
原文链接:

https://stackoverflow.com/questions/72145881

复制
相关文章

相似问题

领券
问题归档专栏文章快讯文章归档关键词归档开发者手册归档开发者手册 Section 归档