因此,我有一个表,如下所示:
开始结束5 5 4 8 10 19 4 4
现在我想做的是,如果特定行中两列的值相等,那么在第三列中只打印其中一列。如果它们不同,则按print - start + "-“+ end。数据在数据帧中。示例如下:
起始结束范围5 5 5 4 8 4-8 10 10 10-19 4 4 4
这是我正在尝试的代码:
if df['start'] - df['end'] != 0:
df['range'] = df['start'] + "-" + df['end']
else:
df['range'] = df['start']但这是行不通的。我该怎么做呢?
发布于 2019-10-15 14:38:09
df['range'] = np.where(df['start'] != df['end'], df['start'] + "-" + df['end'], df['start'])类似的另一种解决方案:
df['range'] = df['start'] + np.where(df['start'] != df['end'], "-" + df['end'], '')
print (df)
start end range
0 5 5 5
1 4 8 4-8
2 10 19 10-19
3 4 4 4值不是字符串时的解决方案:
s = df['start'].astype(str)
e = df['end'].astype(str)
df['range'] = np.where(df['start'] != df['end'], s + "-" + e, s)类似的另一种解决方案:
df['range'] = s + np.where(df['start'] != df['end'], "-" + e, '')https://stackoverflow.com/questions/58388559
复制相似问题