我有两个文件,一个是用户输入f1,另一个是数据库f2。我想搜索来自f1的字符串是否在数据库(F2)中。如果不打印那些如果f2不存在的。我的代码有问题,它不能正常工作:这里是f1:
rbs003491
rbs003499
rbs003531
rbs003539
rbs111111以下是f2:
AHPTUR13,rbs003411
AHPTUR13,rbs003419
AHPTUR13,rbs003451
AHPTUR13,rbs003459
AHPTUR13,rbs003469
AHPTUR13,rbs003471
AHPTUR13,rbs003479
AHPTUR13,rbs003491
AHPTUR13,rbs003499
AHPTUR13,rbs003531
AHPTUR13,rbs003539
AHPTUR13,rbs003541
AHPTUR13,rbs003549
AHPTUR13,rbs003581 在这种情况下,它将返回rbs11111,因为它不在f2中。守则是:
with open(c,'r') as f1:
s1 = set(x.strip() for x in f1)
print s1
with open("/tmp/ARNE/blt",'r') as f2:
for line in f2:
if line not in s1:
print line 发布于 2015-02-16 15:40:07
如果您只关心每一行的第二部分(rbs003411来自AHPTUR13,rbs003411):
with open(user_input_path) as f1, open('/tmp/ARNE/blt') as f2:
not_found = set(f1.read().split())
for line in f2:
_, found = line.strip().split(',')
not_found.discard(found) # remove found word
print not_found
# for x in not_found:
# print x发布于 2015-02-16 15:38:41
for循环中的line变量将包含类似于"AHPTUR13,rbs003411“的内容,但您只对第二部分感兴趣。你应该做这样的事情:
for line in f2:
line = line.strip().split(",")[1]
if line not in s1:
print line发布于 2015-02-16 15:38:51
您需要检查行的最后一部分,而不是所有行,您可以将行从f2与,拆分,然后选择最后一部分(x.strip().split(',')[-1]),如果要搜索来自f1的字符串是否在数据库(F2)中,这里的逻辑是错误的,您需要从f2创建您的集合:
with open(c,'r') as f1,open("/tmp/ARNE/blt",'r') as f2:
s1 = set(x.strip().split(',')[-1] for x in f2)
print s1
for line in f1:
if line.strip() not in s1:
print linehttps://stackoverflow.com/questions/28544921
复制相似问题