首页
学习
活动
专区
圈层
工具
发布
社区首页 >问答首页 >在嵌套字典中将多个相同的键合并为一个键

在嵌套字典中将多个相同的键合并为一个键
EN

Stack Overflow用户
提问于 2019-09-13 04:03:38
回答 4查看 101关注 0票数 1

我在列表中有一个嵌套的字典,如下所示:

代码语言:javascript
复制
      my_list =

      [{'id': '166073',
        'ref': [{'MeSH': 'C548074'},
       {'UMLS': 'C1969084'},
       {'OMIM': '611523'},
       {'ICD-10': 'Q04.3'}]},

       {'id': '213',
       'ref': [{'MeSH': 'D003554'},
       {'UMLS': 'C0010690'},
       {'MedDRA': '10011777'},
       {'ICD-10': 'E72.0'},
       {'OMIM': '219750'},
       {'OMIM': '219800'},
       {'OMIM': '219900'}]},

       {'id': '333',
        'ref': [{'UMLS': 'C2936785'},
       {'ICD-10': 'E75.2'},
       {'MeSH': 'C537075'},
       {'MeSH': 'D055577'},
       {'UMLS': 'C0268255'},
       {'OMIM': '228000'}]}
         .
         .
         .               
                         ]

我想将嵌套字典中具有相同键的字典合并为键中的列表,如下所示:

代码语言:javascript
复制
    my_list =

    [{'id': '166073',
      'ref': [{'MeSH': 'C548074'},
       {'UMLS': 'C1969084'},
       {'OMIM': '611523'},
       {'ICD-10': 'Q04.3'}]},

     {'id': '213',
      'ref': [{'MeSH': 'D003554'},
       {'UMLS': 'C0010690'},
       {'MedDRA': '10011777'},
       {'ICD-10': 'E72.0'},
       {'OMIM': ['219750', '219800', '219900']}]},

     {'id': '333',
      'ref': [{'UMLS': 'C2936785'},
       {'ICD-10': 'E75.2'},
       {'MeSH': ['C537075', 'D055577']},
       {'UMLS': 'C0268255'},
       {'OMIM': '228000'}]}
         .
         .
         .               
                         ]

我试图通过使用双for循环读取字典并将信息存储到另一个新字典中来进行合并,但我发现这种方法不是最优的,有没有其他推荐的方法来完成这种合并?谢谢!

EN

回答 4

Stack Overflow用户

发布于 2019-09-13 04:19:22

为什么它不是最优的?我认为这种合并应该是很好的。我假设您的合并将如下所示:

代码语言:javascript
复制
my_list = [{'id': '166073',
        'ref': [{'MeSH': 'C548074'},
       {'UMLS': 'C1969084'},
       {'OMIM': '611523'},
       {'ICD-10': 'Q04.3'}]},

       {'id': '213',
       'ref': [{'MeSH': 'D003554'},
       {'UMLS': 'C0010690'},
       {'MedDRA': '10011777'},
       {'ICD-10': 'E72.0'},
       {'OMIM': '219750'},
       {'OMIM': '219800'},
       {'OMIM': '219900'}]},

       {'id': '333',
        'ref': [{'UMLS': 'C2936785'},
       {'ICD-10': 'E75.2'},
       {'MeSH': 'C537075'},
       {'MeSH': 'D055577'},
       {'UMLS': 'C0268255'},
       {'OMIM': '228000'}]}]

def merge(item):
  from collections import defaultdict
  merged = defaultdict(list)
  for ref in item.get('ref', []):
    for key, val in ref.items():
      merged[key].append(val)
  return {**item, 'ref': dict(merged)}

print(list(map(merge, my_list)))
票数 3
EN

Stack Overflow用户

发布于 2019-09-13 04:30:53

代码语言:javascript
复制
#!/usr/bin/env python                                                                                                                                                                                                                                                       

o = {'id': '213',
     'ref': [{'MeSH': 'D003554'},
             {'UMLS': 'C0010690'},
             {'MedDRA': '10011777'},
             {'ICD-10': 'E72.0'},
             {'OMIM': '219750'},
             {'OMIM': '219800'},
             {'OMIM': '219900'}]}

n = {'id': o['id'],
     'ref': {x:[] for x in set([item for sublist in o['ref'] for item in sublist])}}

for p in o['ref']:
    for k, v in p.items():
        n['ref'][k].append(v)

n['ref'] = [n['ref']]

print(n)
票数 1
EN

Stack Overflow用户

发布于 2019-09-13 04:31:38

我发现创建一个字典来收集这些值比将其解包为所需的格式更容易:

代码语言:javascript
复制
new_list = []

for item in my_list:
    d = {'id': item['id'], 'ref': {}}
    for r in item['ref']:
        only_key = list(r.keys())[0]
        d['ref'][only_key] = d['ref'].get(only_key, []) + [r[only_key]]
    new_list.append(d)

    new_ref = []
    for k, v in d['ref'].items():
        new_ref.append({k: v if len(v) > 1 else v[0]})
    d['ref'] = new_ref



[{'id': '166073', 'ref': [{'OMIM': '611523'}, {'MeSH': 'C548074'}, {'ICD-10': 'Q04.3'}, {'UMLS': 'C1969084'}]},
 {'id': '213', 'ref': [{'MeSH': 'D003554'}, {'UMLS': 'C0010690'}, {'MedDRA': '10011777'}, {'ICD-10': 'E72.0'}, {'OMIM': ['219750', '219800', '219900']}]},
 {'id': '333', 'ref': [{'ICD-10': 'E75.2'}, {'OMIM': '228000'}, {'MeSH': ['C537075', 'D055577']}, {'UMLS': ['C2936785', 'C0268255']}]}]
票数 1
EN
页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持
原文链接:

https://stackoverflow.com/questions/57913797

复制
相关文章

相似问题

领券
问题归档专栏文章快讯文章归档关键词归档开发者手册归档开发者手册 Section 归档