嗨,我有两个数据格式,在其中我必须根据另一个数据更新值-
示例: df1:
Region Sub_Region Status Reason
LATAM CRM Success
LATAM Genesys Failed
ASPAC CRM Success
ASPAC Genesys Successdf2:
Region Sub_Region Max_Load_Date
LATAM CRM 2021-08-15
LATAM Genesys 2021-09-10
ASPAC CRM 2021-10-11
ASPAC Genesys 2021-10-15在最后产出中:
Region Sub_Region Status Reason Max_Load_Date
LATAM CRM Success 2021-08-15
LATAM Genesys Failed
ASPAC CRM Success 2021-10-11
ASPAC Genesys Success 2021-10-15只有那些人才会更新状态=“成功”
发布于 2022-01-11 12:32:41
你可以merge和mask
(df1.merge(df2, on=['Region', 'Sub_Region'])
.assign(Max_Load_Date=lambda d: d['Max_Load_Date'].mask(d['Status']=='Failed', ''))
)合并将确保将正确的数据映射到正确的行,以防两个数据都没有按照完全相同的顺序排序或缺少行。
产出:
Region Sub_Region Status Max_Load_Date
0 LATAM CRM Success 2021-08-15
1 LATAM Genesys Failed
2 ASPAC CRM Success 2021-10-11
3 ASPAC Genesys Success 2021-10-15https://stackoverflow.com/questions/70666759
复制相似问题