我已经设法在ubuntu 14.04无头虚拟机上安装了apache drill。
我已经放置了一个要对其执行查询的csv文件。
我读了教程,但当我只是想快速入门时,这些教程对我来说都没有意义。

请给我建议。
发布于 2015-08-24 06:23:42
如果您的CSV has没有标题行,请按如下方式查询文件:
select * from dfs.`/Users/khahn/drill/apache-drill-1.1.0/csv_no_header.csv`;
+------------------------+
| columns |
+------------------------+
| ["hello","1","2","3"] |
| ["hello","1","2","3"] |
| ["hello","1","2","3"] |
| ["hello","1","2","3"] |
| ["hello","1","2","3"] |
| ["hello","1","2","3"] |
| ["hello","1","2","3"] |
+------------------------+
7 rows selected (1.427 seconds)如果您的csv确实有标题行,则需要在存储插件(本例中为dfs)定义中添加skipFirstLine属性:
"csv": {
"type": "text",
"extensions": [
"csv"
],
"skipFirstLine": true,
"delimiter": ","
},在Apache Drill docs中描述了通过REST更新存储插件。
带有标题行的CSV:
name, num1, num2,num3
hello,1,2,3
hello,1,2,3
hello,1,2,3
hello,1,2,3
hello,1,2,3
hello,1,2,3
hello,1,2,3用于选择全部的查询将与没有报头的CSV的查询相同。输出也将是相同的。
要查询单个列,请使用use the COLUMNS[n] syntax。
根据您的CSV文件内容,可能需要进行其他存储插件配置更改。参见Configuring Drill to Read Text Files。
https://stackoverflow.com/questions/32164814
复制相似问题