我有一个如下所示的df:
Visitor_ID Form Name Page Views Downloads (event9) Video Start (event1) Form Open (event10) Form Success (event11)
0 1000012332_3700058682 NaN 1 0 0 0 0
1 1000012332_3700058682 NaN 0 0 0 0 0
2 1000025219_4231004519 NaN 1 0 0 0 0
3 1000025219_4231004519 NaN 1 0 0 0 0
4 1000036902_602553643 NaN 1 0 0 0 0所以我试着在Visitor_ID上做一个这样的群:
df = df.groupby(['Visitor_ID'])预期:
Visitor_ID Form Name Page Views Downloads (event9) Video Start (event1) Form Open (event10) Form Success (event11)
0 1000012332_3700058682 NaN 1 0 0 0 0
1 1000025219_4231004519 NaN 2 0 0 0 0
2 1000036902_602553643 NaN 1 0 0 0 0 但我有
Visitor_ID Form Name Page Views Downloads (event9) Video Start (event1) Form Open (event10) Form Success (event11)
0 1000012332_3700058682 NaN 1 0 0 0 0
1 1000012332_3700058682 NaN 0 0 0 0 0
2 1000025219_4231004519 NaN 1 0 0 0 0
3 1000025219_4231004519 NaN 1 0 0 0 0
4 1000036902_602553643 NaN 1 0 0 0 0有人能解释一下为什么“Visitor_ID”一栏不会聚在一起吗?
发布于 2020-08-03 11:44:43
我认为你应该用一些东西,它应该根据
您可以使用
df.groupby(['Visitor_ID']).sum()
OR
df.groupby(['Visitor_ID']).mean()像这样,您必须指定要在组上执行的操作。
https://stackoverflow.com/questions/63228687
复制相似问题