下面是出现在我的站点上的HTML:
<meta content="auth" name="param" />
<meta content="I_WANT_THIS" name="token" />我怎样才能使用lxml.html来获取它呢?
发布于 2014-03-12 21:47:32
使用xpath按name属性查找meta标记,并获取content属性值:
from lxml.html import fromstring
html_data = """ <meta content="auth" name="param" />
<meta content="I_WANT_THIS" name="token" />"""
tree = fromstring(html_data)
print tree.xpath('//meta[@name="token"]/@content')指纹:
['I_WANT_THIS']https://stackoverflow.com/questions/22364438
复制相似问题