我正在尝试为java action运行一个oozie工作流。我的Java代码可以从HDFS读取Word文件,并在HDFS上写回CSV文件。我的workflow.xml包含-
<?xml version="1.0" encoding="UTF-8"?>
<workflow-app xmlns="uri:oozie:workflow:0.4" name="Word-Processing">
<start to="PathologyReport-Processing"/>
<action name="PathologyReport-Processing">
<java>
<job-tracker>${jobTracker}</job-tracker>
<name-node>${nameNode}</name-node>
<configuration>
<property>
<name>mapred.job.queue.name</name>
<value>${queueName}</value>
</property>
<property>
<name>oozie.libpath</name>
<value>${JarPath}</value>
</property>
</configuration>
<main-class>${MainClass}</main-class>
<arg>-libjars</arg>
<arg>${JarPath}</arg>
<arg>${in}</arg>
<arg>${out}</arg>
</java>
<ok to="end"/>
<error to="fail"/>
</action>
<kill name="fail">
<message>Java Action failed, error message[${wf:errorMessage(wf:lastErrorNode())}]</message>
</kill>
<end name="end"/>
</workflow-app>我的job.properties包含以下代码
nameNode=hdfs://CTSC00385700501:8020
jobTracker=CTSC00385700501:8032
workflowRoot=PathologyReport
queueName=default
MainClass=SampleUnstructured
JarPath=hdfs://localhost:8020/user/oozie/${workflowRoot}/lib/poi-3.9.jar
in=hdfs://localhost:8020/user/oozie/${workflowRoot}/SampleWord.docx
out=hdfs://localhost:8020/user/oozie/${workflowRoot}/output
oozie.use.system.libpath=true
oozie.libpath=hdfs://localhost:8020/user/oozie/share/lib/lib_20150513153121/
oozie.wf.application.path=hdfs://localhost:8020/user/oozie/${workflowRoot}我已经为Apache POI jar文件指定了路径,但它仍然无法找到它。请帮我解决这个问题。提前谢谢。
发布于 2015-08-04 17:55:09
你能检查一下你提到的job.property配置吗?据我所知,workflow.xml中的"${workflowRoot}“参数应该是mentnion,而job.xml应该定义为
<property>
<name>workflowRoot</name>
<value>${workflowRoot}</value>
</property>然后试着运行oozie作业,相信它们会顺利工作。
发布于 2015-08-06 06:35:26
...Please注意到,Oozie不支持Hadoop命令行支持的-libjars选项...
对于Oozie,您应该在操作中添加一个引用JAR-to-be-downloaded-automagically-in-the-working-dir-of-the-YARN-container-at-run-time.的元素
https://stackoverflow.com/questions/31554881
复制相似问题