文章/答案/技术大牛

发布

社区首页 >问答首页 >BeautifulSoup -获取用逗号分隔的所有<a>标记

问BeautifulSoup -获取用逗号分隔的所有<a>标记
EN

Stack Overflow用户

提问于 2017-04-20 04:07:59

回答 1查看 929关注 0票数 2

为了获取网站中的所有标签，我有一段代码：

results=[]

all_links = soup.find_all('article')
        for link in all_links:
            print link.find('div', class_="cb-category cb-byline-element")

通过这种方式，我可以得到以以下方式显示的数据(使用','，分离<a>标记)：

<div class="cb-category cb-byline-element"><i class="fa fa-folder-o"></i> <a href="http://ridethetempo.com/category/canadian/" title="View all posts in Canadian">Canadian</a>,  <a href="http://ridethetempo.com/category/music/garage-rock/" title="View all posts in Garage">Garage</a>,  <a href="http://ridethetempo.com/category/listen-2/" title="View all posts in Listen">Listen</a>,  <a href="http://ridethetempo.com/category/music/" title="View all posts in Music">Music</a>,  <a href="http://ridethetempo.com/category/music/psychedelic/" title="View all posts in Psychedelic">Psychedelic</a>,  <a href="http://ridethetempo.com/category/under-2000/" title="View all posts in Under 2000">Under 2000</a></div>

但是，如果我这样做的话：

 results.append(link.find('div', class_="cb-category cb-byline-element"))
 for link in results:
     link.find('a', href=True)['href']

我只为每个<div>块获得第一个<div>，如下所示：

http://ridethetempo.com/category/canadian/

如何递归地检索所有<a>标记，最后得到这个结果？

http://ridethetempo.com/category/canadian/ 
http://ridethetempo.com/category/music/garage-rock/
http://ridethetempo.com/category/listen-2/
http://ridethetempo.com/category/music/ 
http://ridethetempo.com/category/music/psychedelic/
http://ridethetempo.com/category/under-2000/

python

beautifulsoup

回答 1

Stack Overflow用户

回答已采纳

发布于 2017-04-20 04:37:50

for link in soup.find_all('a'):
    print(link.get('href'))

将打印所有“a”标记元素。

票数 1

页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持

原文链接：

https://stackoverflow.com/questions/43510031

复制

相似问题

问BeautifulSoup -获取用逗号分隔的所有<a>标记
EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问BeautifulSoup -获取用逗号分隔的所有<a>标记EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问BeautifulSoup -获取用逗号分隔的所有<a>标记
EN