我正在学习关于流立方体构建的教程。
来自流媒体的Kylin Cube (Kafka)
所有的属性都是按照它在上述页面中所说的设置的。
但是在触发构建立方体的时候。它在步骤1中失败,从Kafka保存数据
说:
org.apache.kylin.engine.mr.exception.MapReduceException: no counters for job job_1547096967734_0086
at org.apache.kylin.engine.mr.common.MapReduceExecutable.doWork(MapReduceExecutable.java:173)
at org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:164)
at org.apache.kylin.job.execution.DefaultChainedExecutable.doWork(DefaultChainedExecutable.java:70)
at org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:164)
at org.apache.kylin.job.impl.threadpool.DefaultScheduler$JobRunner.run(DefaultScheduler.java:114)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)我见过Apache kylin多维数据集失败“没有作业计数器”
但是这里的用例是用于正常的多维数据集构建,而不是通过kafka多维数据集构建流。
在mapred-root-historyserver.log中,下面的条目似乎没有帮助。
2019-01-22 11:33:15,557 INFO org.apache.hadoop.mapreduce.v2.hs.CompletedJob:
Loading job: job_1547096967734_0087 from file:
hdfs://localhost:9000/tmp/hadoop-
yarn/staging/history/done_intermediate/root/job_1547096967734_0087-
1548149562328-root-Kylin_Save_Kafka_Data_kylin_streaming_cube_Step-
1548149585065-0-0-FAILED-default-1548149566816.jhist
2019-01-22 11:33:15,557 INFO org.apache.hadoop.mapreduce.v2.hs.CompletedJob:
Loading history file: [hdfs://localhost:9000/tmp/hadoop-
yarn/staging/history/done_intermediate/root/job_1547096967734_0087-
1548149562328-root-Kylin_Save_Kafka_Data_kylin_streaming_cube_Step-
1548149585065-0-0-FAILED-default-1548149566816.jhist]
2019-01-22 11:33:15,572 INFOorg.apache.hadoop.mapreduce.jobhistory.
JobSummary:jobId=job_1547096967734_0087,submitTime=1548149562328
,launchTime=1548149566816,firstMapTaskLaunchTime=1548149570064,
firstReduceTaskLaunchTime=0,finishTime=1548149585065,resourcesPerMap
=1024,resourcesPerReduce=0,numMaps=1,numReduces=0,user=root,queue=
default,status=FAILED,mapSlotSeconds=8,reduceSlotSeconds=0,jobName=
Kylin_Save_Kafka_Data_kylin_streaming_cube_Step
2019-01-22 11:33:15,572 INFO org.apache.hadoop.mapreduce.v2.hs.
HistoryFileManager: Deleting JobSummary file: [hdfs://localhost:9000/
tmp/hadoop-yarn/staging/history/done_intermediate/
root/job_1547096967734_0087.summary]
2019-01-22 11:33:15,574 INFO
org.apache.hadoop.mapreduce.v2.hs.HistoryFileManager: Moving
hdfs://localhost:9000/tmp/hadoop-
yarn/staging/history/done_intermediate/root/job_1547096967734_0087-
1548149562328-root-Kylin_Save_Kafka_Data_kylin_streaming_cube_Step-
1548149585065-0-0-FAILED-default-1548149566816.jhist to
hdfs://localhost:9000/tmp/hadoop-
yarn/staging/history/done/2019/01/22/000000/job_1547096967734_0087-
1548149562328-root-Kylin_Save_Kafka_Data_kylin_streaming_cube_Step-
1548149585065-0-0-FAILED-default-1548149566816.jhist
2019-01-22 11:33:15,574 INFO
org.apache.hadoop.mapreduce.v2.hs.HistoryFileManager: Moving
hdfs://localhost:9000/tmp/hadoop-
yarn/staging/history/done_intermediate/root/job_1547096967734_0087_conf.xml
to hdfs://localhost:9000/tmp/hadoop-
yarn/staging/history/done/2019/01/22/000000/job_1547096967734_0087_conf.xml
2019-01-22 11:35:30,160 INFO org.apache.hadoop.mapreduce.v2.hs.JobHistory:
Starting scan to move intermediate done files这是一个完全手动安装的kylin环境,下面是版本规范:
apache-hive-2.3.4-bin
apache-kylin-2.5.2-bin-hbase1x
hadoop-2.9.1
hbase-1.4.9
kafka_2.11-2.0.0
spark-2.3.2-bin-hadoop2.7
zookeeper-3.4.13任何帮助都将不胜感激。
发布于 2019-01-23 13:45:29
看来你的副官有问题了。您可以检查错误消息的更多日志。你最好参考最新的streaming.html医生。如果你想快速启动Kylin。建议您试用Kylin或使用集成沙箱(如HDP沙箱)进行开发,并确保它至少有10 GB的内存。
发布于 2019-01-24 07:08:22
请检查MR job在纱线上的第一步。在这项工作中,您可以深入研究每个映射器的日志,然后您应该能够看到一些异常。通常情况下,可能的原因包括“无法与Kafka连接”、“无法加载Kafka客户端jar”等。
发布于 2019-01-24 08:15:10
我们能够通过给卡夫卡客户端2.0.0.jar的纱线共享库来修复它。正如mapreduce所说的,没有为kafka找到类def。
https://stackoverflow.com/questions/54308128
复制相似问题