首页
学习
活动
专区
圈层
工具
发布
社区首页 >问答首页 >用汤获得多种元素

用汤获得多种元素
EN

Stack Overflow用户
提问于 2019-10-10 22:29:27
回答 1查看 46关注 0票数 1

我有以下HTML代码,我试图为每个特定的“日期”获取“客户端”,但我只得到第一个下一个元素:

代码语言:javascript
复制
<div class="info">
    <div class="left-wrap"><span class="date">DATE-1</span></div>
</div>

<div class="clients-list">
        <div>
            <span class="client" >client1</span>
            <span class="client" >client2</span>
            <span class="client" >client3</span>
        </div>
    </div>

<div class="clients-list">
        <div>
            <span class="client" >client4</span>
            <span class="client" >client5</span>
            <span class="client" >client6</span>
        </div>
    </div>

<div class="info">
    <div class="left-wrap"><span class="date" >DATE-2</span></div>
</div>
<div class="clients-list">
        <div>
            <span class="client" >client7</span>
            <span class="client" >client8</span>
        </div>
</div>
<div class="clients-list">
        <div>
            <span class="client" >client9</span>
            <span class="client" >client10</span>
        </div>
</div>
<div class="clients-list">
        <div>
            <span class="client" >client11</span>
            <span class="client" >client12</span>
        </div>
</div>

我使用以下代码:

代码语言:javascript
复制
soup=BeautifulSoup(html,'html.parser')
dates=soup.find_all(class_='date')
for date in dates:
  print(date.text)
  for item in date.find_next(class_='clients-list').find_all(class_='client'):
       print(item.text)

输出是get:

代码语言:javascript
复制
DATE-1
client1
client2
client3
DATE-2
client7
client8

我尝试过使用find_next_all,但是得到了相同的输出。

EN

回答 1

Stack Overflow用户

回答已采纳

发布于 2019-10-10 23:08:20

有点棘手,但您将得到output.Use find_next_siblings()

代码语言:javascript
复制
from bs4 import BeautifulSoup
html='''<div class="info">
    <div class="left-wrap"><span class="date">DATE-1</span></div>
</div>

<div class="clients-list">
        <div>
            <span class="client" >client1</span>
            <span class="client" >client2</span>
            <span class="client" >client3</span>
        </div>
    </div>

<div class="clients-list">
        <div>
            <span class="client" >client4</span>
            <span class="client" >client5</span>
            <span class="client" >client6</span>
        </div>
    </div>

<div class="info">
    <div class="left-wrap"><span class="date" >DATE-2</span></div>
</div>
<div class="clients-list">
        <div>
            <span class="client" >client7</span>
            <span class="client" >client8</span>
        </div>
</div>
<div class="clients-list">
        <div>
            <span class="client" >client9</span>
            <span class="client" >client10</span>
        </div>
</div>
<div class="clients-list">
        <div>
            <span class="client" >client11</span>
            <span class="client" >client12</span>
        </div>
</div>'''

soup=BeautifulSoup(html,'html.parser')
dates=soup.find_all(class_='date')
for date in dates:
  print(date.text)
  for item in date.parent.parent.find_next_siblings(class_='clients-list'):

      if item.find_previous_sibling(class_='info').find_next(class_='date').text==date.text:
          for client in item.find_all(class_='client'):
              print(client.text)

输出

代码语言:javascript
复制
DATE-1
client1
client2
client3
client4
client5
client6
DATE-2
client7
client8
client9
client10
client11
client12
票数 1
EN
页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持
原文链接:

https://stackoverflow.com/questions/58331914

复制
相关文章

相似问题

领券
问题归档专栏文章快讯文章归档关键词归档开发者手册归档开发者手册 Section 归档