我使用的是Stanford dependency parser,我得到以下句子的输出
我在睡梦中射杀了一只大象
python dep_parsing.py
[((u'shot', u'VBD'), u'nsubj', (u'I', u'PRP')), ((u'shot', u'VBD'), u'dobj', (u'elephant', u'NN')), ((u'elephant', u'NN'), u'det', (u'an', u'DT')), ((u'shot', u'VBD'), u'nmod', (u'sleep', u'NN')), ((u'sleep', u'NN'), u'case', (u'in', u'IN')), ((u'sleep', u'NN'), u'nmod:poss', (u'my', u'PRP$'))]--我想把它转换成一个图,节点是每个令牌,边是它们之间的关系。
我需要图形结构来进行进一步的处理,因此,如果修改它很容易,并且必须易于表示,它将有所帮助。
到现在为止这是我的密码。
from nltk.parse.stanford import StanfordDependencyParser
stanford_parser_dir = 'stanford-parser/'
eng_model_path = stanford_parser_dir + "stanford-parser-models/edu/stanford/nlp/models/lexparser/englishRNN.ser.gz"
my_path_to_models_jar = stanford_parser_dir + "stanford-parser-3.5.2-models.jar"
my_path_to_jar = stanford_parser_dir + "stanford-parser.jar"
dependency_parser = StanfordDependencyParser(path_to_jar=my_path_to_jar, path_to_models_jar=my_path_to_models_jar)
result = dependency_parser.raw_parse('I shot an elephant in my sleep')
dep = result.next()
a = list(dep.triples())
print a怎样才能形成这样的图形结构呢?
发布于 2016-09-06 06:33:38
您可以遍历dep.triples()并获得所需的输出。
代码:
for triple in dep.triples():
print triple[1],"(",triple[0][0],", ",triple[2][0],")"输出:
nsubj ( shot , I )
dobj ( shot , elephant )
det ( elephant , an )
nmod ( shot , sleep )
case ( sleep , in )
nmod:poss ( sleep , my )有关更多信息,您可以检查:NLTK测向仪方法triples()、to_dot()和dep.tree().draw()
编辑-
dep.to_dot()的输出是
digraph G{
edge [dir=forward]
node [shape=plaintext]
0 [label="0 (None)"]
0 -> 2 [label="root"]
1 [label="1 (I)"]
2 [label="2 (shot)"]
2 -> 4 [label="dobj"]
2 -> 7 [label="nmod"]
2 -> 1 [label="nsubj"]
3 [label="3 (an)"]
4 [label="4 (elephant)"]
4 -> 3 [label="det"]
5 [label="5 (in)"]
6 [label="6 (my)"]
7 [label="7 (sleep)"]
7 -> 5 [label="case"]
7 -> 6 [label="nmod:poss"]
}https://stackoverflow.com/questions/39340907
复制相似问题