文章/答案/技术大牛

发布

社区首页 >问答首页 >在pandas中有没有更优雅的方法来计算条件累积和？

问在pandas中有没有更优雅的方法来计算条件累积和？
EN

Stack Overflow用户

提问于 2020-05-19 15:18:51

回答 1查看 41关注 0票数 0

我正在尝试构建一个小的投资组合应用程序，并计算我的平均入门价和从中获得的收益。以下是我到目前为止的工作，但很想知道是否有更优雅的方法来获得条件累积和，而不需要创建额外的列。要在excel中有效地使用sumifs语句，似乎有很多步骤。

输入数据帧：

    hist_pos = pd.DataFrame(data=[
        [datetime(2020, 5, 1), 'PPT.AX', 30, 20.00, 15.00, 'Buy'],
        [datetime(2020, 5, 2), 'RIO.AX', 25, 25.00, 15.00, 'Buy'],
        [datetime(2018, 5, 3), 'BHP.AX', 100, 4.00, 15.00, 'Buy'],
        [datetime(2019, 5, 3), 'BHP.AX', 50, 4.00, 15.00, 'Sell'],
        [datetime(2019, 12, 3), 'PPT.AX', 80, 4.00, 15.00, 'Buy'],
        [datetime(2020, 5, 3), 'RIO.AX', 100, 4.00, 15.00, 'Buy'],
        [datetime(2020, 5, 5), 'PPT.AX', 50, 40.00, 15.00, 'Sell'],
        [datetime(2020, 5, 10), 'PPT.AX', 15, 45.00, 15.00, 'Sell'],
        [datetime(2020, 5, 18), 'PPT.AX', 30, 20.00, 15.00, 'Sell']],
        columns=['Date', 'Ticker', 'Quantity', 'Price', 'Fees', 'Direction'])

代码库：

    hist_pos.sort_values(['Ticker', 'Date'], inplace=True) 
    hist_pos.Quantity = pd.to_numeric(hist_pos.Quantity) #convert to number

    # where direction is sale, make quantity negative
    hist_pos['AdjQ'] = np.where(
                hist_pos.Direction == 'Buy', 1, -1)*hist_pos.Quantity
    #Sum quantity to get closing quantity for each ticker using the AdjQ column
    hist_pos['CumQuan'] = hist_pos.groupby('Ticker')['AdjQ'].cumsum()

预期输出：

  Date        Ticker  Quantity  Price  Fees Direction  AdjQ  CumQuan
2 2018-05-03  BHP.AX       100    4.0  15.0       Buy   100      100
3 2019-05-03  BHP.AX        50    4.0  15.0      Sell   -50       50
4 2019-12-03  PPT.AX        80    4.0  15.0       Buy    80       80
0 2020-05-01  PPT.AX        30   20.0  15.0       Buy    30      110
6 2020-05-05  PPT.AX        50   40.0  15.0      Sell   -50       60
7 2020-05-10  PPT.AX        15   45.0  15.0      Sell   -15       45
8 2020-05-18  PPT.AX        30   20.0  15.0      Sell   -30       15
1 2020-05-02  RIO.AX        25   25.0  15.0       Buy    25       25
5 2020-05-03  RIO.AX       100    4.0  15.0       Buy   100      125

上面的代码运行良好，并为列CumQuan生成预期的输出。然而，我有更广泛的代码(这里是Repl)，我需要对不同的列多次执行这个过程。所以想知道是否有一种更简单的方法来处理数据，以使用group by、累积和和条件。

python

pandas

回答 1

Stack Overflow用户

回答已采纳

发布于 2020-05-19 16:51:43

分组是我唯一能想到的事情。

hist_pos2 = hist_pos.groupby('Ticker').agg(CumQuan=('AdjQ','cumsum'), CumCost=('CFBuy','cumsum'))

CumQuan CumCost
2   100 -415.0
3   50  -415.0
4   80  -335.0
0   110 -950.0
6   60  -950.0
7   45  -950.0
8   15  -950.0
1   25  -640.0
5   125 -1055.0

票数 0

页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持

原文链接：

https://stackoverflow.com/questions/61885303

复制

相似问题

问在pandas中有没有更优雅的方法来计算条件累积和？
EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问在pandas中有没有更优雅的方法来计算条件累积和？EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问在pandas中有没有更优雅的方法来计算条件累积和？
EN