我想使用s3桶中的数据交换csv文件中的列。Dataframe有标题行,其中有列名(sa_instance_state, data_interval_start,NAME)
我的代码
def modify_csv(**kwargs):
s3=boto3.client('s3')
read_file = s3.get_object(Bucket=Bucket, Key=Key)
df = pd.read_csv(read_file['Body'],sep=',')
print(df)输入数据格式 (print(df))
sa_instance_state data_interval_start NAME
<sqlalchemy.orm.state.InstanceState1> 2022-07-17 12:55:00+00:00 XYZ
<sqlalchemy.orm.state.InstanceState2> 2022-07-17 12:58:00+00:00 abx预期结果:
NAME sa_instance_state data_interval_start
XYZ <sqlalchemy.orm.state.InstanceState1> 2022-07-17 12:55:00+00:00
abx <sqlalchemy.orm.state.InstanceState2> 2022-07-17 12:58:00+00:00发布于 2022-07-19 22:15:06
你可以试试这个:
df = df[['NAME', 'sa_instance_state', 'data_interval_start']]发布于 2022-07-19 22:24:43
您可以使用set_index和reset_index
>>> df.set_index('NAME').reset_index()
NAME sa_instance_state data_interval_start
0 XYZ <sqlalchemy.orm.state.InstanceState1> 2022-07-17 12:55:00+00:00
1 abx <sqlalchemy.orm.state.InstanceState2> 2022-07-17 12:58:00+00:00https://stackoverflow.com/questions/73043928
复制相似问题