如何从列表中删除'\xa0·\xa0‘和'\xa0·\xa022h’或类似形式的元素?
我已经尝试过以下几种方法:
from itertools import groupby
# initializing list
test_list = ['\xa0·\xa0', '\xa0·\xa022h', 'moon', 'mars', 'earth']
# initializing char list
char_list = ['\xa']
# printing original list
print ("The original list is : " + str(test_list))
# printing character list
print ("The character list is : " + str(char_list))
# Remove words containing list characters
# using loop
res = []
flag = 1
for ele in test_list:
for idx in char_list:
if idx not in ele:
flag = 1
else:
flag = 0
break
if(flag == 1):
res.append(ele)
# printing result
print ("The filtered strings are : " + str(res)) 图片来源:geeksforgeeks
发布于 2021-03-07 02:53:47
如果您只是希望标识其中包含非ASCII码字符的元素,则可以使用字符串的.isascii()方法:
In [12]: test_list = ['\xa0\xa0', '\xa0\xa022h', 'moon', 'mars', 'earth']
In [13]: [s for s in test_list if s.isascii()]
Out[13]: ['moon', 'mars', 'earth']https://stackoverflow.com/questions/66509339
复制相似问题