我使用的是HTML4,我有以下BeautifulSoup:
<tr>
<td>London <a href="/company/mcrt/5" target="_blank">10 vol</a> 54 page</td>
</tr>我尝试只删除"a“标记,并保留文本,如下所示:
<tr>
<td>London 10 vol 54 page</td>
</tr>有没有办法用bs4做到这一点呢?
发布于 2020-01-05 02:45:22
您正在搜索.unwrap()方法:
txt = '''<tr>
<td>London <a href="/company/mcrt/5" target="_blank">10 vol</a> 54 page</td>
</tr>'''
from bs4 import BeautifulSoup
soup = BeautifulSoup(txt, 'html.parser')
soup.a.unwrap()
print(soup)打印:
<tr>
<td>London 10 vol 54 page</td>
</tr>https://stackoverflow.com/questions/59593617
复制相似问题