我的问题本质上与这个问题相反:
Create a Pandas DataFrame from deeply nested JSON
我想知道是否有可能做相反的事情。给出一张桌子,如:
Library Level School Major 2013 Total
200 MS_AVERY UGRAD GENERAL STUDIES GEST 5079
201 MS_AVERY UGRAD GENERAL STUDIES HIST 5
202 MS_AVERY UGRAD GENERAL STUDIES MELC 2
203 MS_AVERY UGRAD GENERAL STUDIES PHIL 10
204 MS_AVERY UGRAD GENERAL STUDIES PHYS 1
205 MS_AVERY UGRAD GENERAL STUDIES POLS 53是否可以生成嵌套的dict (或JSON),如:
迪特:
{'MS_AVERY':
{ 'UGRAD' :
{'GENERAL STUDIES' : {'GEST' : 5}
{'MELC' : 2}
...发布于 2014-05-10 06:17:16
考虑到您的DataFrame对象,创建一个函数构建递归字典似乎并不难:
def fdrec(df):
drec = dict()
ncols = df.values.shape[1]
for line in df.values:
d = drec
for j, col in enumerate(line[:-1]):
if not col in d.keys():
if j != ncols-2:
d[col] = {}
d = d[col]
else:
d[col] = line[-1]
else:
if j!= ncols-2:
d = d[col]
return drec它将产生:
{'MS_AVERY':
{'UGRAD':
{'GENERAL STUDIES': {'PHYS': 1L,
'POLS': 53L,
'PHIL': 10L,
'HIST': 5L,
'MELC': 2L,
'GEST': 5079L}}}}发布于 2016-12-23 09:20:46
下面是我在this question工作时提出的一个解决方案
def rollup_to_dict_core(x, values, columns, d_columns=None):
if d_columns is None:
d_columns = []
if len(columns) == 1:
if len(values) == 1:
return x.set_index(columns)[values[0]].to_dict()
else:
return x.set_index(columns)[values].to_dict(orient='index')
else:
res = x.groupby([columns[0]] + d_columns).apply(lambda y: rollup_to_dict_core(y, values, columns[1:]))
if len(d_columns) == 0:
return res.to_dict()
else:
res.name = columns[1]
res = res.reset_index(level=range(1, len(d_columns) + 1))
return res.to_dict(orient='index')
def rollup_to_dict(x, values, d_columns=None):
if d_columns is None:
d_columns = []
columns = [c for c in x.columns if c not in values and c not in d_columns]
return rollup_to_dict_core(x, values, columns, d_columns)
>>> pprint(rollup_to_dict(df, ['2013 Total']))
{'MS_AVERY': {'UGRAD': {'GENERAL STUDIES': {'GEST': 5079,
'HIST': 5,
'MELC': 2,
'PHIL': 10,
'PHYS': 1,
'POLS': 53}}}}发布于 2020-11-23 03:41:15
key = ['Library', 'Level', 'School']
series = (df.groupby(key, sort=False)[df.columns.difference(key)]
.apply(lambda x: x[['Major', '2013 Total']].to_dict('records'))
)
# build: {Major: Total}
major = {}
values = series.values[0]
for i in range(len(values)):
major.update({values[i]['Major']: values[i]['2013 Total']})
# build the recursive dictionary
index = series.index[0]
d = {}
for i in reversed(range(len(index))):
if not bool(d):
d = {index[i]: major}
else:
d = {index[i]: d}
print(json.dumps(d, indent=2))它将产生:
{
"MS_AVERY": {
"UGRAD": {
"GENERAL STUDIES": {
"GEST": 5079,
"HIST": 5,
"MELC": 2,
"PHIL": 10,
"PHYS": 1,
"POLS": 53
}
}
}
}https://stackoverflow.com/questions/23576869
复制相似问题