文章/答案/技术大牛

发布

社区首页 >问答首页 >如何通过复制值来pivot_wider和填充缺少的值

问如何通过复制值来pivot_wider和填充缺少的值
EN

Stack Overflow用户

提问于 2020-09-03 07:43:49

回答 1查看 122关注 0票数 0

我试图获得一个从长到宽的数据帧，但是由于它的结构，每当我使用pivot_wider()时，我都会得到两列包含数据向量的数据。

以下是原始数据：

structure(list(type = c("radio", "radio", "radio", "television", 
"television", "television", "television", "television", "television", 
"television", "television", "television", "television", "television", 
"television", "television", "television", "television", "television"
), Resource = c("samsung", "samsung", "samsung", "samsung", "samsung", 
"samsung", "samsung", "samsung", "samsung", "samsung", "samsung", 
"sony", "sony", "sony", "sony", "sony", "sony", "sony", "sony"
), Property = c("lot_number", "lot_number", "manufacturer", "lot_number", 
"lot_number", "lot_number", "lot_number", "lot_number", "manufacturer", 
"other_PN", "part_number", "lot_number", "lot_number", "lot_number", 
"lot_number", "lot_number", "manufacturer", "other_PN", "part_number"
), value = c("12345", "54321", "John", "9876", "12345", "54321", 
"56789", "67890", "Walt", "5g6h3f", "6789", "9876", "12345", 
"54321", "56789", "67890", "John", "2a3b4c", "3461")), class = c("spec_tbl_df", 
"tbl_df", "tbl", "data.frame"), row.names = c(NA, -19L), spec = structure(list(
    cols = list(type = structure(list(), class = c("collector_character", 
    "collector")), Resource = structure(list(), class = c("collector_character", 
    "collector")), Property = structure(list(), class = c("collector_character", 
    "collector")), value = structure(list(), class = c("collector_character", 
    "collector"))), default = structure(list(), class = c("collector_guess", 
    "collector")), skip = 1), class = "col_spec"))

我使用的命令：

df_wide <- df %>% pivot_wider(names_from = Property, values_from = value)

但是多个值被保存为"lot_number“列中的一个向量：

structure(list(type = c("radio", "television", "television"), 
    Resource = c("samsung", "samsung", "sony"), lot_number = list(
        c("12345", "54321"), c("9876", "12345", "54321", "56789", 
        "67890"), c("9876", "12345", "54321", "56789", "67890"
        )), manufacturer = list("John", "Walt", "John"), other_PN = list(
        NULL, "5g6h3f", "2a3b4c"), part_number = list(NULL, "6789", 
        "3461")), row.names = c(NA, -3L), class = c("tbl_df", 
"tbl", "data.frame"))

这是我想在最后得到的数据帧。请注意，在所需的输出中，"radio“有一些缺失值。此外，列"manufacturer“、"other_PN”和"part_number“中的值必须在多个行中重复。

structure(list(type = c("radio", "radio", "television", "television", 
"television", "television", "television", "television", "television", 
"television", "television", "television"), Resource = c("samsung", 
"samsung", "samsung", "samsung", "samsung", "samsung", "samsung", 
"sony", "sony", "sony", "sony", "sony"), manufacturer = c("John", 
"John", "Walt", "Walt", "Walt", "Walt", "Walt", "John", "John", 
"John", "John", "John"), other_PN = c(NA, NA, "5g6h3f", "5g6h3f", 
"5g6h3f", "5g6h3f", "5g6h3f", "2a3b4c", "2a3b4c", "2a3b4c", "2a3b4c", 
"2a3b4c"), part_number = c(NA, NA, 6789, 6789, 6789, 6789, 6789, 
3461, 3461, 3461, 3461, 3461), lot_number = c(12345, 54321, 9876, 
12345, 54321, 56789, 67890, 9876, 12345, 54321, 56789, 67890)), class = c("spec_tbl_df", 
"tbl_df", "tbl", "data.frame"), row.names = c(NA, -12L), spec = structure(list(
    cols = list(type = structure(list(), class = c("collector_character", 
    "collector")), Resource = structure(list(), class = c("collector_character", 
    "collector")), manufacturer = structure(list(), class = c("collector_character", 
    "collector")), other_PN = structure(list(), class = c("collector_character", 
    "collector")), part_number = structure(list(), class = c("collector_double", 
    "collector")), lot_number = structure(list(), class = c("collector_double", 
    "collector"))), default = structure(list(), class = c("collector_guess", 
    "collector")), skip = 1), class = "col_spec"))

感谢您的帮助!！

tidyverse

回答 1

Stack Overflow用户

回答已采纳

发布于 2020-09-03 07:54:49

我建议使用dput()数据作为df来使用这种方法。您可以使用group_by()创建一个id变量来标识行，然后可以使用pivot_wider()进行整形。由于需要填充一些值，您可以使用tidyr中的fill()，这是一个tidyverse包：

library(tidyverse)
#Data
df %>% group_by(type,Resource,Property) %>% mutate(id=1:n()) %>%
  pivot_wider(names_from = Property,values_from=value) %>%
  fill(everything()) %>% select(-id)

输出：

# A tibble: 12 x 6
# Groups:   type, Resource [3]
   type       Resource lot_number manufacturer other_PN part_number
   <chr>      <chr>    <chr>      <chr>        <chr>    <chr>      
 1 radio      samsung  12345      John         NA       NA         
 2 radio      samsung  54321      John         NA       NA         
 3 television samsung  9876       Walt         5g6h3f   6789       
 4 television samsung  12345      Walt         5g6h3f   6789       
 5 television samsung  54321      Walt         5g6h3f   6789       
 6 television samsung  56789      Walt         5g6h3f   6789       
 7 television samsung  67890      Walt         5g6h3f   6789       
 8 television sony     9876       John         2a3b4c   3461       
 9 television sony     12345      John         2a3b4c   3461       
10 television sony     54321      John         2a3b4c   3461       
11 television sony     56789      John         2a3b4c   3461       
12 television sony     67890      John         2a3b4c   3461

票数 0

页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持

原文链接：

https://stackoverflow.com/questions/63714874

复制

相似问题

问如何通过复制值来pivot_wider和填充缺少的值
EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问如何通过复制值来pivot_wider和填充缺少的值EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问如何通过复制值来pivot_wider和填充缺少的值
EN