我在使用Apache POI读取一些.docx内容并将结果显示为未格式化的预览时遇到了问题。Im使用的是POI版本3.11。
代码:
private static String POI2Text(File file) {
POITextExtractor extractor = null;
try {
extractor = ExtractorFactory.createExtractor(file);
return extractor.getText();
} catch (Exception ex) {
logger.warn("Error:", ex);
} finally {
if (extractor!=null) try { extractor.close(); } catch (Exception ex) { logger.warn("Error:", ex); }
}
return "";
}在finally块(extractor.close())中抛出以下异常:
org.apache.poi.openxml4j.exceptions.OpenXML4JRuntimeException: Fail to save: an error occurs while > saving the package : part
at org.apache.poi.openxml4j.opc.ZipPackage.saveImpl(ZipPackage.java:503) ~[agent.jar:na]
at org.apache.poi.openxml4j.opc.OPCPackage.save(OPCPackage.java:1425) ~[agent.jar:na]
at org.apache.poi.openxml4j.opc.OPCPackage.save(OPCPackage.java:1412) ~[agent.jar:na]
at org.apache.poi.openxml4j.opc.ZipPackage.closeImpl(ZipPackage.java:353) ~[agent.jar:na]
at org.apache.poi.openxml4j.opc.OPCPackage.close(OPCPackage.java:425) ~[agent.jar:na]
at org.apache.poi.POIXMLTextExtractor.close(POIXMLTextExtractor.java:87) ~[agent.jar:na]
....
Caused by: java.lang.IllegalArgumentException: part
at org.apache.poi.openxml4j.opc.OPCPackage.addPackagePart(OPCPackage.java:873) ~[agent.jar:na]
at org.apache.poi.openxml4j.opc.ZipPackage.saveImpl(ZipPackage.java:448) ~[agent.jar:na]
... 15 common frames omitted有什么办法防止这种异常吗?最大的问题是,在抛出异常之后,poi没有释放文件句柄。我需要能够在我的应用程序之外移动或编辑文件。
发布于 2015-02-12 17:33:51
快速反馈:我可以通过以只读方式打开一个输入流,然后使用这个流通过POITextExtractor提取数据来解决这个问题。
try (InputStream is = Files.newInputStream(path, StandardOpenOption.READ);
POITextExtractor extractor = ExtractorFactory.createExtractor(is)) {
return extractor.getText();
} catch (Exception ex) {
logger.warn("Error in file {}", path, ex);
}https://stackoverflow.com/questions/27968461
复制相似问题