我有一个Dataframe,它有票的清单,以及他们的冲刺和地位如下:
ticket,sprint,status
101,sprint_1,Closed
102,sprint_1,Open
103,sprint_2,Working
103,sprint_3,Fixed
103,sprint_4,Fixed
103,sprint_5,Open
103,sprint_6,Closed如果票是另一个冲刺的一部分的话,我正在试图找到在特定的冲刺中不是Closed的票。
在给定的示例集中,我们看到票证102没有在给定的sprint中完成,但是没有未来的sprint,而这是we票据103从sprint_2迁移到sprint_3并在sprint_3中关闭的一部分。
我试图添加在给定的sprint中不是Closed的票证,如果它们有未来sprint的另一个条目的话
预期输出:
ticket,sprint,status,part_of_future_sprint_if_not_closed,no_future_sprint_planned_open_tickets
101,sprint_1,Closed,0,0
102,sprint_1,Open,0,1
103,sprint_2,Working,1,0
103,sprint_3,Fixed,1,0
103,sprint_4,Fixed,1,0
103,sprint_5,Open,1,0
103,sprint_6,Closed,0,0发布于 2019-12-08 06:46:15
使用:
#test equal
m1 = df['status'].eq('Open')
#test all duplicated tickets
m2 = df['ticket'].duplicated(keep=False)
#test all duplicated sprints
m3 = df['sprint'].duplicated(keep=False)
#test equal
m4 = df['status'].eq('Closed')
#test if at least one Open per group
m5 = m1.groupby(df['ticket']).transform('any')
df['part_of_future_sprint_if_not_closed'] = (m2 & ~m4 & m5).astype(int)
df['no_future_sprint_planned_open_tickets'] = (m1 & ~m2 & m3).astype(int)
print (df)
ticket sprint status part_of_future_sprint_if_not_closed \
0 101 sprint_1 Closed 0
1 102 sprint_1 Open 0
2 103 sprint_2 Working 1
3 103 sprint_3 Fixed 1
4 103 sprint_4 Fixed 1
5 103 sprint_5 Open 1
6 103 sprint_6 Closed 0
no_future_sprint_planned_open_tickets
0 0
1 1
2 0
3 0
4 0
5 0
6 0 https://stackoverflow.com/questions/59233038
复制相似问题