我正试图准确地理解quoting和doublequote在pandas.read_csv模块中的含义。假设我有以下数据:
['name' ,'age' ,'position']
['tom', 14, 'vp']
['jared', 100, 'head, sales']熊猫有三个quoting选项:
如果quotechar是",那么上述四个参数将如何解释上述数据?
发布于 2018-12-19 23:12:35
您可以尝试这样做来对csv作者进行测试:
import csv
DATA = [
['name' ,'age' ,'position'],
['tom', 14, 'vp'],
['jared', 100, 'head, sales'],
]
with open('test_min.csv', 'w') as csvfile:
writer = csv.writer(csvfile, delimiter=',',quotechar='"', quoting=csv.QUOTE_MINIMAL)
for row in DATA:
writer.writerow(row)
with open('test_all.csv', 'w') as csvfile:
writer = csv.writer(csvfile, delimiter=',',quotechar='"', quoting=csv.QUOTE_ALL)
for row in DATA:
writer.writerow(row)
with open('test_nonnumeric.csv', 'w') as csvfile:
writer = csv.writer(csvfile, delimiter=',',quotechar='"', quoting=csv.QUOTE_NONNUMERIC)
for row in DATA:
writer.writerow(row)
with open('test_quotenone.csv', 'w') as csvfile:
writer = csv.writer(csvfile, delimiter=',',quotechar='"', quoting=csv.QUOTE_NONE)
for row in DATA:
writer.writerow(row)下面是你会看到的:
name,age,position tom,14,vp
Notes:无效输出,将需要转义字符集。"name","age","position" "tom",14,"vp" "jared",100,"head, sales"
Notes:注意14和100没有转义。name,age,position tom,14,vp jared,100,"head, sales"
Notes:将只引用head, sales,因为它是违规字段。"name","age","position" "tom","14","vp" "jared","100","head, sales"
Notes:所有内容都引用,不管类型如何。https://stackoverflow.com/questions/53859162
复制相似问题