首页
学习
活动
专区
圈层
工具
发布
社区首页 >问答首页 >如何在字符串列表中仅获取一个字符串的同义词集?

如何在字符串列表中仅获取一个字符串的同义词集?
EN

Stack Overflow用户
提问于 2017-07-28 15:16:28
回答 1查看 199关注 0票数 0

我试图在列表中查找字符串的同义词集。下面是我的代码:

代码语言:javascript
复制
from nltk.corpus import wordnet as wn
from nltk import pos_tag

word1 =  ['orange','man','bench']

def getSynonyms(word1):
    synonymList1 = []
    for data1 in word1:
        wordnetSynset1 = wn.synsets(data1)
        tempList1=[]
        for synset1 in wordnetSynset1:
            synLemmas = synset1.lemma_names()
            for i in xrange(len(synLemmas)):
                word = synLemmas[i].replace('_',' ')
                if pos_tag(word.split()) not in tempList1:
                    tempList1.append(pos_tag(word.split()))
        synonymList1.append(tempList1)
    return synonymList1

print getSynonyms(word1)
print

结果如下:

代码语言:javascript
复制
[[[(u'orange', 'NN')], [(u'orangeness', 'NN')], [(u'orange', 'NN'), 
(u'tree', 'NN')], [(u'Orange', 'NN')], [(u'Orange', 'NNP'), (u'River', 
'NNP')], [(u'orangish', 'JJ')]], [[(u'man', 'NN')], [(u'adult', 'NN'), 
(u'male', 'NN')], [(u'serviceman', 'NN')], [(u'military', 'JJ'), (u'man', 
'NN')], [(u'military', 'JJ'), (u'personnel', 'NNS')], [(u'homo', 'NN')], 
[(u'human', 'JJ'), (u'being', 'VBG')], [(u'human', 'NN')], [(u'valet', 
'NN')], [(u'valet', 'NN'), (u'de', 'IN'), (u'chambre', 'NN')], 
[(u'gentleman', 'NN')], [(u"gentleman's", 'NN'), (u'gentleman', 'NN')], 
[(u'Man', 'NN')], [(u'Isle', 'NNP'), (u'of', 'IN'), (u'Man', 'NNP')], 
[(u'piece', 'NN')], [(u'world', 'NN')], [(u'human', 'JJ'), (u'race', 'NN')], 
[(u'humanity', 'NN')], [(u'humankind', 'NN')], [(u'human', 'JJ'), 
(u'beings', 'NNS')], [(u'humans', 'NNS')], [(u'mankind', 'NN')]], 
[[(u'bench', 'NN')], [(u'terrace', 'NN')], [(u'judiciary', 'NN')], 
[(u'workbench', 'NN')], [(u'work', 'NN'), (u'bench', 'NN')], [(u'Bench', 
'NN')]]]

但是,如果我只想获得一个字符串的同义词集呢?

例如,如果我得到'orange'的同义词集,我只会得到并打印:

代码语言:javascript
复制
[(u'orange', 'NN')], [(u'orangeness', 'NN')], [(u'orange', 'NN'), 
(u'tree', 'NN')], [(u'Orange', 'NN')], [(u'Orange', 'NNP'), (u'River', 
'NNP')], [(u'orangish', 'JJ')]

如果我得到'man'的同义词集,我只会得到并打印:

代码语言:javascript
复制
[(u'man', 'NN')], [(u'adult', 'NN'), 
(u'male', 'NN')], [(u'serviceman', 'NN')], [(u'military', 'JJ'), (u'man', 
'NN')], [(u'military', 'JJ'), (u'personnel', 'NNS')], [(u'homo', 'NN')], 
[(u'human', 'JJ'), (u'being', 'VBG')], [(u'human', 'NN')], [(u'valet', 
'NN')], [(u'valet', 'NN'), (u'de', 'IN'), (u'chambre', 'NN')], 
[(u'gentleman', 'NN')], [(u"gentleman's", 'NN'), (u'gentleman', 'NN')], 
[(u'Man', 'NN')], [(u'Isle', 'NNP'), (u'of', 'IN'), (u'Man', 'NNP')], 
[(u'piece', 'NN')], [(u'world', 'NN')], [(u'human', 'JJ'), (u'race', 'NN')], 
[(u'humanity', 'NN')], [(u'humankind', 'NN')], [(u'human', 'JJ'), 
(u'beings', 'NNS')], [(u'humans', 'NNS')], [(u'mankind', 'NN')]

对于'bench'也是如此。

我试过print getSynonyms(word1[0]),但结果真的很奇怪。

有人能帮上忙吗?谢谢

EN

回答 1

Stack Overflow用户

回答已采纳

发布于 2017-07-28 15:33:08

该函数需要一个数组,这就是为什么只传递一个字符串就会得到奇怪的结果。如果你向它传递一个单词,它将查找该单词的每个字母的同义词集。不过,您可以使用一个值来传递数组:

代码语言:javascript
复制
print getSynonyms([word1[0]])

您还可以重写该函数以删除out循环,从而允许您使用以下内容传入单个单词:

代码语言:javascript
复制
def getSynonyms(word):
    synonymList1 = []
    wordnetSynset1 = wn.synsets(word)
    tempList1=[]
    for synset1 in wordnetSynset1:
        synLemmas = synset1.lemma_names()
        for i in xrange(len(synLemmas)):
            word = synLemmas[i].replace('_',' ')
            if pos_tag(word.split()) not in tempList1:
                tempList1.append(pos_tag(word.split()))
    synonymList1.append(tempList1)
    return synonymList1
票数 2
EN
页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持
原文链接:

https://stackoverflow.com/questions/45367158

复制
相关文章

相似问题

领券
问题归档专栏文章快讯文章归档关键词归档开发者手册归档开发者手册 Section 归档