我有一些python的经验,但不是很多。我还没有用python处理过XML,但现在我必须这样做了。我在尝试用Python解析的字符串中有一个XML。我想将此XML存储在dataframe中,但无法将其解析为python。
import lxml.etree as ET
lz4UC = rs['trade']['uc']
UC = lz4ToString(base64.b64decode(lz4UC))
parser = ET.XMLParser(recover=True)
tree = ET.parse(UC,parser = parser) # option 1
#tree2 = ET.fromstring(UC,parser = parser) # option 2选项1的错误消息:选项2的OSError: Error reading file '<?xml version="1.0" encoding="UTF-8" standalone="yes"?>错误消息:ValueError: Unicode strings with encoding declaration are not supported. Please use bytes input or XML fragments without declaration.
UC看起来是这样的:
'<?xml version="1.0" encoding="UTF-8" standalone="yes"?>
<positionEventMessage xmlns="urn:XXXX:uc" xmlns:td="urn:XXXX:uc:trade-id" xmlns:dt="http://www.dtcc.com/ext" xmlns:ip="urn:XXXX:ipt" xmlns:fpml="http://www.fpml.org/FpML-5/recordkeeping" xmlns:dtx="urn:XXXX:dtcc-5-ext" xmlns:w3="http://www.w3.org/2000/09/xmldsig#" xmlns:XXXX="urn:XXXX:fpml-5-ext" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<header>
<sourceSystem>RODS</sourceSystem>
<originatingSystem>MXG2000</originatingSystem>
<timestamp>2020-07-04T16:23:46Z</timestamp>
</header>
<positionEvent>
<eventType>Position:Update</eventType>
<businessDate>2020-07-04</businessDate>
<businessTime>16:23:46.046Z</businessTime>
<position>
<primaryAssetClass>Cash</primaryAssetClass>
<productType productTypeScheme="urn:XXXX:product-type:RODS">ACFACFACF</productType>
<productType productTypeScheme="urn:XXXX:product-type:RODS:qlDesc">ACF-FXD</productType>
<owner>
<partyReference href="Party1"/>
<accountReference href="Account1"/>
</owner>
<aggregationCategory aggregationCategoryScheme="urn:XXXX:aggregation-category:MUREX:instrument">ACF-FXD</aggregationCategory>
<currencyPair>
<fpml:currency1>USD</fpml:currency1>
<fpml:currency2>SAR</fpml:currency2>
</currencyPair>
<positionId positionIdScheme="urn:XXXX:position-id:HTI">0000002442892000207911</positionId>
<positionId positionIdScheme="urn:XXXX:position-id:RODS:regulatory-key">999999999894891</positionId>
<positionId positionIdScheme="urn:XXXX:position-id:RODS:valuation-id">USDSAR209</positionId>
<positionId positionIdScheme="urn:XXXX:position-id:RODS:GlobalId">2000207911</positionId>
<version>20151207000000000</version>
<fpml:cash>
<fpml:currency>SAR</fpml:currency>
</fpml:cash>
<positionType>Long</positionType>
<quantity>7426113.8099999996</quantity>
<internalProductType>
<ip:productType productName="FX - SIMPLE CASH FLOW"/>
</internalProductType>
</position>
</positionEvent>
<party id="Party1">
<fpml:partyId partyIdScheme="urn:XXXX:party-id:PO_ID">PO7</fpml:partyId>
<fpml:partyId partyIdScheme="urn:XXXX:party-id:PO_GROUP">LOH</fpml:partyId>
<fpml:partyId partyIdScheme="urn:XXXX:party-id:GROUP_ID">MDBK</fpml:partyId>
<fpml:partyId partyIdScheme="urn:XXXX:party-id:BRANCH_ID">610</fpml:partyId>
<fpml:partyId partyIdScheme="urn:XXXX:party-id:GRID_ID">43146</fpml:partyId>
</party>
<account id="Account1">
<fpml:accountId accountIdScheme="urn:XXXX:book-id:RODS">209</fpml:accountId>
<fpml:accountId accountIdScheme="urn:XXXX:book-id:HMS">FO0025489</fpml:accountId>
<fpml:accountBeneficiary href="Party1"/>
</account>
</positionEventMessage>'发布于 2020-08-21 00:32:18
这样试试:
uc = """[your xml above"""]
tree = ET.XML(uc.encode())看看能不能成功。
https://stackoverflow.com/questions/63508211
复制相似问题