我使用了一个命名实体识别器来从职位空缺中提取技能。这是可行的,但现在这些实体都列在了熊猫栏中。我想要创建一个新的列,只使用技能和使用替换函数或astype(str)都不起作用。
在这里,您可以看到我的数据:
Skills
0 [(Responsibility), (communication)]
1 [(responsibility), (responsibility), (Leadersh...
2 [(Flexibility)]
3 []
4 [(communicate)]
Name: skills, dtype: object这就是我最终希望我的专栏的样子:
Skills_clean
0 Responsibility, communication
1 responsibility, responsibility, Leadership
2 Flexibility
3 NaN
4 communicate
Name: skills, dtype: object所以,我想失去所有的方括号,或者有没有人有更好的想法,如何做到这一点?
发布于 2022-06-07 15:31:05
这个,代替我的工作
ds = pd.Series(['(Responsibility), (communication)]','[(responsibility), (responsibility), (Leadership)',
'[(Flexibility)]','[(communicate)]'])
ds.str.replace(r'\(|\)|\[|\]',' ', regex=True)0 Responsibility , communication
1 responsibility , responsibility , Leadership
2 Flexibility
3 communicathttps://stackoverflow.com/questions/72533381
复制相似问题