我在PythonAnywhere上有一个免费帐户,在那里我试图运行以下脚本,这个脚本在本地工作得很好。
我想知道,我得到的错误是由于技术原因,还是仅仅是PythonAnywhere禁止人们只为某些网站从他们的平台上报废?
你知道有其他免费网站允许我放弃任何东西吗?
import requests
from bs4 import BeautifulSoup as bs
def scrapMarketwatch(address):
#creating formatting data from scrapdata
r = requests.get(address)
c = r.content
sup = bs(c,"html.parser")
print(sup)
scrapMarketwatch('http://www.marketwatch.com/investing/future/sp%20500%20futures')
print('\n\n\n PARAGRAPH \n SPACE \n\n\n')
scrapMarketwatch('https://www.bloomberg.com/quote/USDJPY:CUR')我得到以下错误:
文件"/usr/local/lib/python3.6/dist-packages/requests/packages/urllib3/util/retry.py",第376行,在处理上述异常时,在递增的MaxRetryError(_pool,url,error或ResponseError(原因))MaxRetryError HTTPSConnectionPool(host='www.bloomberg.com',port=443)中:在处理上述异常时,最大重试超过url: //USDJPY:CUR(由ProxyError(‘无法连接代理“)引起。另一个例外情况发生:追溯(最近一次调用):scrapMarketwatch (‘https://www.bloomberg.com/quote/USDJPY:CUR’文件“/home/https://www.bloomberg.com/quote/USDJPY:CUR83/scrapit/requests.get文件第70行)中的第10行:scrapMarketwatch r=requests.get(地址)文件第70行,第20行,在get返回请求(‘get’,url,params=params,**kwargs)文件"/usr/local/lib/python3.6/dist-packages/requests/api.py",行56,在请求返回session.request(method=method,url=url,**kwargs)文件"/usr/local/lib/python3.6/dist-packages/requests/sessions.py",行488,在request resp = self.send(prep,**send_kwargs)文件"/usr/local/lib/python3.6/dist-packages/requests/sessions.py",行609,在send = adapter.send(request,**kwargs)文件ProxyError(e,request=request) requests.exceptions.ProxyError: HTTPSConnectionPool(host='www.bloomberg.com',port=443)中,最大重试超过url: /ProxyError/USDJPY:CUR(由ProxyError(‘无法连接到代理“)引起(’隧道连接失败: 403禁止‘))
发布于 2017-10-04 12:31:22
PythonAnywhere免费帐户只允许访问其白名单上的外部站点。那些允许的站点提供了一个机器API。你可以要求其他网站被添加,但如果你要刮他们不。
https://stackoverflow.com/questions/46374988
复制相似问题