首页
学习
活动
专区
圈层
工具
发布
社区首页 >问答首页 >PyQuery找到子元素节点文本

PyQuery找到子元素节点文本
EN

Stack Overflow用户
提问于 2015-10-16 03:18:50
回答 1查看 587关注 0票数 0

以下是代码:

代码语言:javascript
复制
from pyquery import PyQuery

content = '''<td field="exceptions"><div style="white-space:normal;height:auto;" \
class="datagrid-cell datagrid-cell-c2-exceptions">Traceback (most recent call last):<br>\
  File "./crawler.py", line 381, in &lt;module&gt;<br>   \
   crawler.start()<br>  File "./crawler.py", line 153, in start<br> \
      raise RemoteTransportException(e)<br>RemoteTransportException: \
      This socket is already used by another greenlet: &lt;bound method Waiter.\
      switch of &lt;gevent.hub.Waiter object at 0x7f64d499d6e0&gt;&gt;<br></div></td>'''
pq = PyQuery(content)

for content in pq('td div'):
    print content.text # get Traceback (most recent call last):


for content in pq('td div'):
    for sub in content.getchildren():
        print sub.text


# Traceback (most recent call last):
# None
# None
# None
# None
# None
# None

正如您所得到的,我希望在td div元素中获取内容,它应该是

代码语言:javascript
复制
Traceback (most recent call last):
File "./crawler.py", line 381, in <module>
crawler.start()
File "./crawler.py", line 153, in start
raise RemoteTransportException(e)
RemoteTransportException: This socket is already used by another greenlet: <bound method Waiter.switch of <gevent.hub.Waiter object at 0x7f64d499d6e0>>

但我刚得到Traceback (most recent call last):。那么,如何找出td div中所有带有子标签的文本呢?

EN

回答 1

Stack Overflow用户

回答已采纳

发布于 2015-10-16 03:30:02

您可以使用BeautifulSoup代替:

代码语言:javascript
复制
import bs4
soup = bs4.BeautifulSoup(content)
soup.find('td').find('div').text
u'Traceback (most recent call last):  File "./crawler.py", line 381, in <module>      crawler.start()  File "./crawler.py", line 153, in start       raise RemoteTransportException(e)RemoteTransportException:       This socket is already used by another greenlet: <bound method Waiter.      switch of <gevent.hub.Waiter object at 0x7f64d499d6e0>>'
票数 1
EN
页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持
原文链接:

https://stackoverflow.com/questions/33162006

复制
相关文章

相似问题

领券
问题归档专栏文章快讯文章归档关键词归档开发者手册归档开发者手册 Section 归档