这是我的文本文件的样子
000000005|19670905|M|20060201|20070131|6709055223085|01|PRINCIPLE|000021629633|ONYX
000000005|19740423|F|20060201|20070131|7404230424084|01|WIFE|000021629633|ONYX
000000005|19991028|F|20060201|20070131|9910280147084|01|DAUGHTER|000021629633|ONYX我需要遇到单词PRINCIPLE,然后提供WIFE和DAUGHTER的计数这里WIFE的计数是1,DAUGHTER也是1这些列和行都有标题有多个条目,如000004、000008等
counts = data['gender'].value_counts().to_dict()我这样做是为了统计雄性和雌性的数量。我只是在试一试。我需要一些关于如何使用python来解决这个问题的帮助。
我想要像这样的东西
PRINCIPLE WIFE DAUGHTER
and below the counts发布于 2019-01-22 17:01:35
你可以试试这个
import pandas as pd
# select the gender col
gender = data[['gender']]
# groupby to a new dataframe
counts = pd.DataFrame({'count' : gender.groupby(['gender']).size()}).reset_index()如果你想添加第一列“条目”
gender = data[['gender','entries']]
# groupby to a new dataframe
counts = pd.DataFrame({'count' : gender.groupby(['entries','gender']).size()}).reset_index()示例:
>>> print(d)
entries gender
0 5 PRINCIPLE
1 5 WIFE
2 5 DAUGHTER
3 6 PRINCIPLE
4 6 PRINCIPLE
5 6 DAUGHTER
6 7 WIFE
7 7 DAUGHTER
8 7 WIFE
>>> count = pd.DataFrame({'count' : d.groupby(['entries','gender']).size()}).reset_index()
>>> print(count)
entries gender count
0 5 DAUGHTER 1
1 5 PRINCIPLE 1
2 5 WIFE 1
3 6 DAUGHTER 1
4 6 PRINCIPLE 2
5 7 DAUGHTER 1
6 7 WIFE 2https://stackoverflow.com/questions/54304387
复制相似问题