我有一个元组列表,如下所示:
[
(1, "red")
(1, "red,green")
(1, "green,blue")
(2, "green")
(2, "yellow,blue")
]我正在尝试汇总数据,以便可以获得以下dict输出:
{
1: ["red", "green", "blue"]
2: ["green", "yellow", "blue"]
}注意:将颜色字符串组合成主键(数字),然后拆分成一个列表,并对其进行去重(例如使用set)。
我也想反其道而行之,并按颜色分组:
{
"red": [1],
"green": [1, 2]
"yellow": [2]
"blue": [1, 2]
}显然,我可以通过循环遍历所有元组来做到这一点,但如果可能的话,我想尝试使用list / dict理解。
发布于 2013-06-23 03:42:27
您可以使用collections.defaultdict
>>> from collections import defaultdict
>>> lis = [
(1, "red"),
(1, "red,green"),
(1, "green,blue"),
(2, "green"),
(2, "yellow,blue"),
]
>>> dic = defaultdict(set) #sets only contain unique items
for k, v in lis:
dic[k].update(v.split(','))
>>> dic
defaultdict(<type 'set'>,
{1: set(['blue', 'green', 'red']),
2: set(['blue', 'green', 'yellow'])})现在遍历dic
>>> dic2 = defaultdict(list)
for k,v in dic.iteritems():
for val in v:
dic2[val].append(k)
...
>>> dic2
defaultdict(<type 'list'>,
{'blue': [1, 2],
'green': [1, 2],
'yellow': [2],
'red': [1]})发布于 2021-10-02 18:16:00
另一种无缺省的解决方案。
>>> input = [
... (1, "red"),
... (1, "red,green"),
... (1, "green,blue"),
... (2, "green"),
... (2, "yellow,blue")
... ]
>>> result1 = {s[0]: set(s[1].split(',')) for s in input}
>>> for num, cols in input:
... result1[num].update(cols.split(','))
...
>>> print(result1)
{1: {'red', 'green', 'blue'}, 2: {'green', 'yellow', 'blue'}}
>>>
>>> result2 = dict((k, []) for k in set.union(*result1.values()))
>>> for k,v in result1.items():
... for val in v:
... result2[val].append(k)
...
>>> print(result2)
{'red': [1], 'green': [1, 2], 'yellow': [2], 'blue': [1, 2]}
>>> 这不一定比使用defaultdict的解决方案更好。此外,这不是纯粹的理解,而是将理解作为解决方案的一部分。
发布于 2015-08-28 06:07:57
使用dict理解生成唯一元素计数的一行解决方案:
X = [1,2,1,3,1,4,1,5,2,3,2,4,2,5,3,4,3,5,4,5,5]
Xagg = {xx: sum([int(y==xx) for y in X]) for xx in set(X)}
Xagg
https://stackoverflow.com/questions/17254599
复制相似问题