我有个数据:

我想计算销售人员卖给医院的设备数量。
有几种情况:
出售了一台设备。
我需要的最后一个输出表如下所示:

我怎么能用熊猫做这件事?
发布于 2021-12-31 08:39:04
IIUC用途:
#test all rows with at least one non NaN, Nat, None values
m = df[['Signing time', 'Arrival time']].notna().any(axis=1)
#get sums per Hospitals of Trues
df['Number of equipment'] = df.assign(m = m).groupby(['Hospital'])['m'].transform('sum')
#remove duplicates per both columns
df = df.drop_duplicates(['Sales','Hospital'])
#divide by number of Hospitals
df['Number of equipment'] /= df['Hospital'].map(df['Hospital'].value_counts())
#aggreagte sum if Sales are duplicates
df1 = df.groupby('Sales', sort=False, as_index=False)['Number of equipment'].sum()
#filter only columns if Sales are not duplicated
#df1 = df[['Sales','Number of equipment']]https://stackoverflow.com/questions/70540134
复制相似问题