Text =“权力的游戏”是一部由大卫·贝尼奥夫( David Benioff )和D·B·魏斯(D.B. Weiss )为HBO制作的美国奇幻剧。这部电视剧是在英国贝尔法斯特制作和拍摄的。
import re
new_text = ''
punctuations = '''!.,'''
for character in Text:
if character not in punctuations:
new_text = new_text + character
from collections import Counter
split_text = Text.split(' ')
count = Counter(split_text)
most_freq_word_new = [key for key,valu in count.items() if valu == max(count.values())]
for index, word in enumerate(split_text):
for i in most_freq_word_new:
if word == i:
print (word)
#print (index)
suffix_word = split_text[index + 1]
prefix_word = split_text[index - 1]
print (suffix_word)
print (prefix_word)我的输出
and
D
Benioff
and
filmed
produced
in
Belfast
filmed
in
the
elsewhere我想要的输出
{and:['D','Benioff','filmed','produced'],
in:['Belfast','filmed','the','elsewhere']}发布于 2019-07-24 13:45:26
这是一种使用dict的方法。
演示:
Text = '''Game of Thrones is an American fantasy drama television series created by David Benioff and D. B. Weiss for HBO. The show was both produced and filmed in Belfast elsewhere in the United Kingdom.'''
import re
new_text = ''
punctuations = '''!.,'''
for character in Text:
if character not in punctuations:
new_text = new_text + character
from collections import Counter, defaultdict
split_text = Text.split(' ')
count = Counter(split_text)
most_freq_word_new = [key for key,valu in count.items() if valu == max(count.values())]
result = {i: [] for i in most_freq_word_new} #Create Dict with word as key and list as value
for index, word in enumerate(split_text):
for i in most_freq_word_new:
if word == i:
#print (index)
suffix_word = split_text[index + 1]
prefix_word = split_text[index - 1]
result[word].extend([suffix_word, prefix_word]) #Use list.extend to add to result.
print(result)输出:
{'and': ['D.', 'Benioff', 'filmed', 'produced'],
'in': ['Belfast', 'filmed', 'the', 'elsewhere']}https://stackoverflow.com/questions/57184244
复制相似问题