文章/答案/技术大牛

发布

社区首页 >问答首页 >按其他列的值创建新列组

问按其他列的值创建新列组
EN

Stack Overflow用户

提问于 2022-11-22 12:18:18

回答 1查看 27关注 0票数 1

我有以下数据

df1 = pd.DataFrame({'sentence': ['A', "A", "A", "A", 'A', 'B', "B", 'B'], 'entity': ['Stay home', "Stay home", "WAY", "WAY", "Stay home", 'Go outside', "Go outside", "purpose"], 'token' : ['Severe weather', "raining", "smt", "SMT0", "Windy", 'Sunny', "Good weather", "smt"]
})


    sentence        entity      token
0   A               Stay home   Severe weather
1   A               Stay home   raining
2   A               Way         smt
3   A               Way         SMT0
4   A               Stay home   Windy
5   B               Go outside  Sunny
6   B               Go outside  Good weather
7   B               Purpose     smt

我想group by sentences的值，并在Way和Purpose存在于entity列时创建新的columns。

预期成果：

   sentence entity      token                          Way       Purpose
0   A        Stay home  Severe weather, raining, Windy smt, SMTO Nan
1   B        Go outside Sunny, Good weather            Nan       smt

python

python-3.x

dataframe

group-by

回答 1

Stack Overflow用户

回答已采纳

发布于 2022-11-22 12:27:20

用Series.isin在boolean indexing中为非匹配行筛选行，~用于反向掩码，聚合join，并使用DataFrame.join进行与DataFrame.pivot_table匹配的筛选行列表

vals = ['WAY','purpose']

m = df1['entity'].isin(vals)

df2 = df1[m].pivot_table(index='sentence',columns='entity',values='token', aggfunc=','.join)
df3 = df1[~m].groupby(['sentence','entity'])['token'].agg(', '.join).reset_index()

df = df3.join(df2, on='sentence')
print (df)
  sentence      entity                           token       WAY purpose
0        A   Stay home  Severe weather, raining, Windy  smt,SMT0     NaN
1        B  Go outside             Sunny, Good weather       NaN     smt

票数 1

页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持

原文链接：

https://stackoverflow.com/questions/74532513

复制

相似问题

问按其他列的值创建新列组
EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问按其他列的值创建新列组EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问按其他列的值创建新列组
EN