我有一个输入文件,其中包含一个输入列表,每一行一个。每一行输入都用双引号括起来。输入有时有反斜杠或几个双引号,如包围双引号(请参阅下面的示例)。
样本输入-
"each line is enclosed in double-quotes"
"Double quotes inside a \"double-quoted\" string!"
"This line contains backslashes \\not so cool\\"
"too many double-quotes in a line \"\"\"too much\"\"\""
"too many backslashes \\\\\\\"horrible\"\\\\\\"我想接受上面的输入,并简单地将行中转义双引号的输入转换为回勾`。
我假设有一个简单的单行解决方案。我试过以下几种方法,但不起作用。任何其他一条龙解决方案或对以下代码的修复将不胜感激。
def fix(line):
return re.sub(r'\\"', '`', line)它对输入第3行和第5行失败。
"each line is enclosed in double-quotes"
"Double quotes inside a `double-quoted` string!"
"This line contains backslashes \\not so cool\`
"too many double-quotes in a line ```too much```"
"too many backslashes \\\\\\`horrible`\\\\\`我能想到的任何解决办法都会打断其他线路。请帮帮我!
发布于 2016-06-08 03:26:14
这并不完全是您所要求的,因为它代替了"而不是, but I'll mention it ... you could always leverage offcsvto do\‘转换:
>>> for line in csv.reader(["each line is enclosed in double-quotes",
... "Double quotes inside a \"double-quoted\" string!",
... "This line contains backslashes \\not so cool\\",
... "too many double-quotes in a line \"\"\"too much\"\"\"",
... "too many backslashes \\\\\\\"horrible\"\\\\\\",
... ]):
... print(line)
...
['each line is enclosed in double-quotes']
['Double quotes inside a "double-quoted" string!']
['This line contains backslashes \\not so cool\\']
['too many double-quotes in a line """too much"""']
['too many backslashes \\\\\\"horrible"\\\\\\']如果它们是实际的's, you could simply do a replace on the text returned by thecsv` `模块很重要的话。
发布于 2016-06-08 03:20:40
在反斜杠后添加+。
return re.sub(r'\\+"', '`', line)https://stackoverflow.com/questions/37692629
复制相似问题