在http://kudu.apache.org/docs/quickstart.html上遵循kudu QuickStart时,我遇到了错误" error : AnalysisException:数据分布必须使用DISTRIBUTE子句指定。“同时尝试从黑斑马表passenger_data_raw创建kudu表passenger_data。
[quickstart.cloudera:21000] > CREATE TABLE passenger_data
> TBLPROPERTIES(
> 'storage_handler' = 'com.cloudera.kudu.hive.KuduStorageHandler',
> 'kudu.table_name' = 'passenger_data',
> 'kudu.master_addresses' = '127.0.0.1',
> 'kudu.key_columns' = 'id'
> ) AS SELECT * FROM passenger_data_raw;
Query: create TABLE passenger_data
TBLPROPERTIES(
'storage_handler' = 'com.cloudera.kudu.hive.KuduStorageHandler',
'kudu.table_name' = 'passenger_data',
'kudu.master_addresses' = '127.0.0.1',
'kudu.key_columns' = 'id'
) AS SELECT * FROM passenger_data_raw
ERROR: AnalysisException: A data distribution must be specified using a DISTRIBUTE BY clause.系统规格1. Macbook 2011 2. OS El-Capitan 3.按照快速入门指南的指示为kudu下载CDH VM。4. kudu 0.9.0 (版本5f2bf643d8ce3d042aa3903543a92841077a6874) uuid ca7e69c27e064aac8fa64db53cad71e5
有人能帮帮忙吗。
发布于 2016-07-04 03:16:06
幸运的是,通过谷歌搜索,我找到了http://www.cloudera.com/documentation/betas/kudu/0-5-0/PDF/cloudera-kudu.pdf。所以我试着使用“按散列分发”...我不知道我为什么要尝试它,也许是因为它与错误有关。这个查询对我很有效。
CREATE TABLE passenger_data
DISTRIBUTE BY HASH (id) INTO 16 BUCKETS
TBLPROPERTIES(
'storage_handler' = 'com.cloudera.kudu.hive.KuduStorageHandler',
'kudu.table_name' = 'passenger_data',
'kudu.master_addresses' = '127.0.0.1',
'kudu.key_columns' = 'id'
) AS SELECT * FROM passenger_data_raw;希望它对其他人有用。
https://stackoverflow.com/questions/38173477
复制相似问题