我刚接触Scrapy (一般是webscraping),对于一个学校项目,我试图从某个网站收集职位.I使用scrapy shell这是我的请求:
In [19]: job = response.xpath("//article/div/a/text()")
In [20]: job.getall()这是我得到的结果:
['\r\n ',
'\r\n ',
'\r\n ',
'\r\n ']至于HTML:
<article id="644613" class="media well listing-item listing-item__jobs ">
</div>
<div class="media-body">
<div class="media-heading listing-item__title">
<a href="https://www.tanitjobs.com/job/644613/ingénieur-net/?backPage=&searchID=1585105963.7756" class="link">
Ingénieur .NET
</a>
</div>
</article>发布于 2020-03-25 20:42:23
试试这个:
jobs = response.css("article.listing-item div.listing-item__title a::text").getall()您可以在此处阅读有关选择器的更多信息:https://docs.scrapy.org/en/latest/topics/selectors.html
https://stackoverflow.com/questions/60842573
复制相似问题