这就是我的问题,我正在尝试用hmmlearn教授隐马尔可夫模型。我刚接触这门语言,在理解列表和数组之间的区别时遇到了一些困难。下面是我的代码:
from hmmlearn import hmm
from babel import lists
import numpy as np
import unidecode as u
from numpy import char
l = []
data = []
gods_egypt = ["Amon","Anat","Anouket","Anubis","Apis","Atoum","Bastet","Bès","Gheb","Hâpy","Harmachis","Hathor","Heh","Héket","Horus","Isis","Ka","Khepri","Khonsou","Khnoum","Maât","Meresger","Mout","Nefertoum","Neith","Nekhbet","Nephtys","Nout","Onouris","Osiris","Ouadjet","Oupaout","Ptah","Rê","Rechef","Renenoutet","Satet","Sebek","Sekhmet","Selkis","Seth","Shou","Sokaris","Tatenen","Tefnout","Thot","Thouéris"]
for i in range(0, len(gods_egypt)):
data.append([])
for j in range(0, len(gods_egypt[i])):
data[i].append([u.unidecode(gods_egypt[i][j].lower())])
l.append(len(data[i]))
data = np.asarray(data).reshape(-1,1)
model = hmm.MultinomialHMM(20, verbose=True)
model = model.fit(data, l)以及由此产生的输出
Traceback (most recent call last):
File "~~~\HMM_test.py", line 17, in <module>
model = model.fit(data, l)
File "~~~\Python\Python36\site-packages\hmmlearn\base.py", line 420, in fit
X = check_array(X)
File "~~~\Python36-32\lib\site-packages\sklearn\utils\validation.py", line 402, in check_array
array = np.array(array, dtype=dtype, order=order, copy=copy)
ValueError: setting an array element with a sequence.我在ValueError: setting an array element with a sequence看到,这可能是一个不同数组长度的问题,但我不知道如何解决它。
有什么建议吗?
发布于 2017-08-18 05:43:37
错误本身是因为model.fit()需要一个数值数组的数组。现在,您的输入data是一个字符串列表列表数组的数组。这就是导致错误的原因,因为函数发现它期望的array element是is a sequence,即(字符串列表的)列表。
然而,即使你解决了列表问题,也会出现另一个问题:学习HMM意味着通过一些等式来计算数值。学习HMM的输入数据应该是数字,而不是一组字母。(除非hmmlearn对我不知道的字符有一个非常特殊的选项。)
如果您想使用HMM,首先需要将字母转换为数字。
我不知道你的最终目标是什么。HMM旨在为生成或分类目的而对数据进行建模(如果训练了几个HMM)。一旦你从组成单词的字母中得到一个经过训练的模型,你打算做什么?
至于数据应该以什么格式提供给不同的函数,我建议您查看一下documentation。它包括使用该库的教程。
https://stackoverflow.com/questions/45680809
复制相似问题