我有一个XML文件形式的数据。我还得到了一个将XML转换为HTML的XSLT。我可以使用lxml来执行转换,但是,我希望在转换之后更改一些HTML标记。如何将这个新etree转换成HtmlElements,这样我就可以专门使用.cssselect()等特定方法了。
发布于 2013-06-22 01:09:38
>>> import lxml.etree
>>> import lxml.html
>>>
>>> xmlstring = '''\
... <?xml version='1.0' encoding='ASCII'?>
... <root><a class="here">link1</a><a class="there">link2</a></root>
... '''
>>> root = lxml.etree.fromstring(xmlstring)
>>> root.cssselect('a.here')
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
AttributeError: 'lxml.etree._Element' object has no attribute 'cssselect'lxml.etree.tostring(root) -> lxml.html.fromstring(..)
>>> root = lxml.html.fromstring(lxml.etree.tostring(root))
>>> root.cssselect('a.here')
[<Element a at 0x2989308>]获取XML输出:
>>> print lxml.etree.tostring(root, xml_declaration=True)
<?xml version='1.0' encoding='ASCII'?>
<root><a class="here">link1</a><a class="there">link2</a></root>https://stackoverflow.com/questions/17239607
复制相似问题