我正在尝试使用Apache构建我的第一个多维数据集,直到我得到错误的最后一步,一切都很顺利:
java.lang.IllegalStateException: Can't get cube source record count.
at com.google.common.base.Preconditions.checkState(Preconditions.java:149)
at org.apache.kylin.job.cube.UpdateCubeInfoAfterBuildStep.doWork(UpdateCubeInfoAfterBuildStep.java:104)
at org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:107)
at org.apache.kylin.job.execution.DefaultChainedExecutable.doWork(DefaultChainedExecutable.java:50)
at org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:107)
at org.apache.kylin.job.impl.threadpool.DefaultScheduler$JobRunner.run(DefaultScheduler.java:132)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
at java.lang.Thread.run(Thread.java:745)根据这个问题,https:// github.com/KylinOLAP/ Kylin / issue /101发生以上错误,因为Kylin试图在hive的输出中找到此模式:"HDFS读:(\d+) HDFS写:(\d+)成功“。
来自多维数据集构建教程的正确输出: github.com/KylinOLAP/Kylin/wiki/Kylin-Cube-Build-and-Job-Monitoring-Tutorial:
http://i.imgur.com/71tEeIy.png
我在Kylin的输出看起来被破坏了:
http://i.imgur.com/GCbrN03.png
然而,当我检查Hive日志时,它看起来是ok的:
2015-05-27 08:40:13,419 INFO [main]: ql.Driver (Driver.java:execute(1285)) - Starting command:
INSERT OVERWRITE TABLE kylin_intermediate_Kubek_19700101000000_2922789940817071255_f23ac1b1_10fe_4112_ac9e_b4e6baf07654 SELECT
FACT_TABLE.DATE
,FACT_TABLE.MONEY_ADVERTISER
,FACT_TABLE.MONEY_PUBLISHER
FROM DEFAULT.ADVSTATS as FACT_TABLE
...
2015-05-27 08:45:05,132 INFO [main]: ql.Driver (SessionState.java:printInfo(824)) - MapReduce Jobs Launched:
2015-05-27 08:45:05,148 INFO [main]: ql.Driver (SessionState.java:printInfo(824)) - Stage-Stage-1: Map: 1 Cumulative CPU: 17.32 sec HDFS Read: 44644035 HDFS Write: 2347008 SUCCESS
2015-05-27 08:45:05,153 INFO [main]: ql.Driver (SessionState.java:printInfo(824)) - Total MapReduce CPU Time Spent: 17 seconds 320 msec
2015-05-27 08:45:05,167 INFO [main]: ql.Driver (SessionState.java:printInfo(824)) - OK我正在使用Hortonworks Sandbox 2.2:
hadoop-2.6.0
hbase-0.98.12
hive-0.14.0
zookeeper-3.4.6有人能告诉我为什么我的kylin日志预览是这样的吗?首先,它可能是最后一步错误的原因吗?
发布于 2015-05-28 12:17:59
几天前,kylin开发人员为这类问题进行了绕行。
https://github.com/apache/incubator-kylin/commit/a4692dba681bc2f136e02c64565639eb0080fcc9
因为有时hadoop可能无法获得计数器,即使从现在起作业成功,Kylin在没有获得多维数据集源时给出警告而不是错误。
我所要做的就是重建凯林。
https://stackoverflow.com/questions/30479688
复制相似问题