简短的背景:我试着用Peter Norvig在python中增强拼写校正器。在这个意义上,我需要出现一个句子(最多3-4个单词)…Ngram viewer from Google会对我有很大帮助,但我不知道如何通过API或其他东西获得值。
伪代码:
# Sentence without meaning but word for word correct.
>> occurrence("were are you")
0.0000000978
# Sentence that makes sense
>> occurrence("where are you")
0.000148
# Then my method should return the sentence with the highest value. (But thats not the problem)对不起,我的英语:-D谢谢!
发布于 2021-05-30 17:41:06
他们实际上有一个未记录的api。
import requests
import json
term = "where are you"
url =f"https://books.google.com/ngrams/json?content={term}&year_start=1800&year_end=2000&corpus=26&smoothing=3"
resp = requests.get(url)
if resp.ok:
results = json.loads(resp.content)results[0]['timeseries']拥有您需要的频率:
[2.854326695000964e-07,
3.4926038665616944e-07,
3.3916604043800663e-07,
...]https://stackoverflow.com/questions/67753096
复制相似问题