如何在下面的元组列表中找到重复的值?
[(1622, 4081), (1622, 4082), (1624, 4083), (1626, 4085), (1650, 4086), (1650, 4090)]我想得到一个列表,如下所示:
[4081, 4082, 4086, 4090]我尝试过使用itemgetter,然后使用group by option,但不起作用。
怎么才能做到这一点呢?
发布于 2016-07-28 01:55:16
使用有序字典,第一项作为其键,第二项的列表作为值(对于使用dict.setdefalt()创建的重复项),然后选取长度大于1的项:
>>> from itertools import chain
>>> from collections import OrderedDict
>>> d = OrderedDict()
>>> for i, j in lst:
... d.setdefault(i,[]).append(j)
...
>>>
>>> list(chain.from_iterable([j for i, j in d.items() if len(j)>1]))
[4081, 4082, 4086, 4090]发布于 2016-07-28 02:07:18
作为另一种选择,如果你想使用groupby,这里有一种方法:
In [1]: from itertools import groupby
In [2]: ts = [(1622, 4081), (1622, 4082), (1624, 4083), (1626, 4085), (1650, 4086), (1650, 4090)]
In [3]: dups = []
In [4]: for _, g in groupby(ts, lambda x: x[0]):
...: grouped = list(g)
...: if len(grouped) > 1:
...: dups.extend([dup[1] for dup in grouped])
...:
In [5]: print(dups)
[4081, 4082, 4086, 4090]您可以使用groupby从元组的第一个元素开始分组,并将重复的值从元组添加到列表中。
发布于 2016-07-28 03:22:34
还有另一种方法(没有任何导入):
In [896]: lot = [(1622, 4081), (1622, 4082), (1624, 4083), (1626, 4085), (1650, 4086), (1650, 4090)]
In [897]: d = dict()
In [898]: for key, value in lot:
...: d[key] = d.get(key, []) + [value]
...:
...:
In [899]: d
Out[899]: {1622: [4081, 4082], 1624: [4083], 1626: [4085], 1650: [4086, 4090]}
In [900]: [d[key] for key in d if len(d[key]) > 1]
Out[900]: [[4086, 4090], [4081, 4082]]
In [901]: sorted([num for num in lst for lst in [d[key] for key in d if len(d[key]) > 1]])
Out[901]: [4081, 4081, 4082, 4082]https://stackoverflow.com/questions/38619907
复制相似问题