我有一个DataFrame
customer_number purchase_time quantity
14 2007-03-01 07:06:00 10
20 2007-03-12 13:05:00 13我试着找出上午和下午的总采购量。我将purchase_time转换为datetime
df['purchase_time'] = pd.to_datetime(df['purchase_time'])
# Baskets bought in morning.
df[df['purchase_time'] < '12:00:00']但是,结果是原始数据集。
发布于 2018-03-17 22:54:52
在groupby中使用布尔数组
df.groupby(df.purchase_time.dt.hour < 12).sum().rename(
{True: 'Morning', False: 'Afternoon'})
customer_number quantity
purchase_time
Afternoon 20 13
Morning 14 10发布于 2018-03-17 22:26:26
你可以的
df[df['purchase_time'].dt.time < pd.to_datetime('12:00:00').time()]
Out[152]:
customer_number purchase_time quantity
0 14 2007-03-01 07:06:00 10发布于 2018-03-17 22:24:04
你可能不需要在这里转换,只要按词汇学来比较一下时间-
df[df['purchase_time'].str.split().str[1] < '12:00:00']
customer_number purchase_time quantity
0 14 2007-03-01 07:06:00 10虽然,为了增加一层安全性,我建议转换为timedelta并进行比较--这些比较仍然适用于字符串(熊猫奇迹般地)--
df[pd.to_timedelta(
df['purchase_time'].str.split().str[1], errors='coerce'
) < '12:00:00']
customer_number purchase_time quantity
0 14 2007-03-01 07:06:00 10https://stackoverflow.com/questions/49342465
复制相似问题