我试图在python 3上刮掉sec的json页面,对于我来说,我无法获得json代码,我总是以json解码错误和页面的HTML代码(我是python新手)而告终。这是我的代码:
import requests
base_url = r"https://data.sec.gov/api/xbrl/companyfacts/CIK"
CIK = "0000320193"
json_index = ".json"
url = base_url + CIK + json_index
content = requests.get(url)
decoded_content = content.json()非常感谢你的帮助!
发布于 2022-02-01 23:50:47
您只需要稍微掩饰一下,在请求中添加一个标题,使请求看起来像是来自一个真正的浏览器,我复制了我的Chrome用户代理,并使用了:
import requests
headers = {'user-agent':'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/96.0.4664.110 Safari/537.36'}
url = 'https://data.sec.gov/api/xbrl/companyfacts/CIK0000320193.json'
resp = requests.get(url,headers=headers).json()
print(resp)https://stackoverflow.com/questions/70948679
复制相似问题