我要把亚马逊的所有链接都从这个开始-
https://www.amazon.com/s?k=guess+case&crid=2Q25FH0FOTCA4&sprefix=guess+case%2Caps%2C215&ref=nb_sb_noss但我只需要猜一个案子。这些链接必须包含2个值-“猜测”和“电话”。例如:
https://www.amazon.com/Guess-Scarlett-Collection-Hard-iPhone/dp/B00QTEP0B0/ref=sr_1_2?crid=2Q25FH0FOTCA4&keywords=guess+case&qid=1650550474&sprefix=guess+case%2Caps%2C215&sr=8-2
https://www.amazon.com/Guess-GUHCP13SPCUMABK-Marble-Collection-iPhone/dp/B09J94ZMZ3/ref=sr_1_3?crid=2Q25FH0FOTCA4&keywords=guess+case&qid=1650550474&sprefix=guess+case%2Caps%2C215&sr=8-3如何使用帮助库重新获取这些链接?
start_urls = ['https://www.amazon.com/s?k=guess+case&crid=2Q25FH0FOTCA4&sprefix=guess+case%2Caps%2C215&ref=nb_sb_noss/']
rules = [Rule(LinkExtractor(allow=r'???' , ))...发布于 2022-04-21 15:55:59
只要用if语句..。
如果“猜测”和“电话”不在网址中:
https://stackoverflow.com/questions/71956203
复制相似问题