我希望我的程序能在亚马逊上获得产品的价格。我正在尝试用soup.select来做这件事,但是失败了( soup.find也还没成功)。在商品价格上单击鼠标右键,将显示以下html代码:
<span id="price_inside_buybox" class="a-size-medium a-color-price">
ILS 53.35
</span>这是我的代码,我做错了什么?我应该如何正确使用soup.select?
import bs4, requests,
amazonlink = "https://www.amazon.com/UNSCENTED-PURITO-Ingredients-sunscreen-Lightweight/dp/B07WQXP3D8/ref=as_li_ss_tl?crid=V29993T4TI1W&dchild=1&keywords=purito+comfy+water+sunblock&qid=1585239706&sprefix=purito+comfy,aps,299&sr=8-2&linkCode=sl1&tag=drdrayzday-20&linkId=7231993512a225ded0b474b852d105bc&language=en_US"
res = requests.get(amazonlink)
soup = bs4.BeautifulSoup(res.text,'html.parser') #creates soup object
print(soup.select("price_inside_buybox"))
print(soup.find("span",id="price_inside_buybox")) 编辑:我的输出为空列表或无
发布于 2020-05-01 23:42:40
这将是一个更好和更干净的解决方案,因为如果亚马逊改变了它的html,不管是什么原因,你的脚本不再工作,使用亚马逊api。
https://docs.aws.amazon.com/AWSECommerceService/latest/DG/Welcome.html
发布于 2020-05-01 22:11:26
您需要添加user_agent并将解析器更改为lxml。如果没有lxml解析器,则需要使用以下命令进行安装
pip install lxml
代码
import bs4, requests
amazonlink = "https://www.amazon.com/UNSCENTED-PURITO-Ingredients-sunscreen-Lightweight/dp/B07WQXP3D8/ref=as_li_ss_tl?crid=V29993T4TI1W&dchild=1&keywords=purito+comfy+water+sunblock&qid=1585239706&sprefix=purito+comfy,aps,299&sr=8-2&linkCode=sl1&tag=drdrayzday-20&linkId=7231993512a225ded0b474b852d105bc&language=en_US"
headers = {'User-Agent':'Mozilla/5.0'}
res = requests.get(amazonlink,headers=headers)
soup = bs4.BeautifulSoup(res.text,'lxml')
print(soup.select_one("#price_inside_buybox").text.strip())https://stackoverflow.com/questions/61543789
复制相似问题