我想从{用户}时间线中检索至少1000条推文回复,在1000条推文中至少有100条与新冠肺炎关键词相关,如"covid19“、”武汉“、”口罩“、”封锁“、”隔离“、"sars-cov-2”等。
我编写了检索tweet的函数:
def get_tweets_by_user(self, screen_name):
'''
Use user_timeline api to fetch POI related tweets, some postprocessing may be required.
:return: List
'''
result = []
tweets = api.user_timeline(screen_name=screen_name,
# 200 is the maximum allowed count
count=200,
include_rts = True,
# Necessary to keep full_text
# otherwise only the first 140 words are extracted
tweet_mode = 'extended'
)
for tw in tweets:
result.append(tw)
return result现在如何从用户时间轴中检索与新冠肺炎关键字相关的100条推文?
发布于 2021-09-18 17:15:05
注册Twitter developer API。您需要几个消费者密钥。告诉他们你是个学生。
import requests as re
import json
import twitter # install this library to work with twitter dev.
consumer_key = "your key"
consumer_secret = "your key"
access_token = "your key"
access_token_secret = "your key"
api = twitter.Api(consumer_key=yourkey,
consumer_secret=yoursecret,
access_token_key=youraccesstoken,
access_token_secret=yourtokensecret)
FILTER = ["covid-19 string here"] # PUT YOUR COVID 19 STRING HERE
LANGUAGES = ['en']
store_file = "outputfileforcovidtweets.txt"
_location = ["put coordinates here"]
def main():
with open(store_file, 'a') as z:
for line in api.GetStreamFilter(track=FILTER, languages=LANGUAGES, locations=_location):
z.write(json.dumps(line))
z.write('\n')
main()这将收集实时tweet到您的输出文件。:)
https://stackoverflow.com/questions/69235540
复制相似问题