问通过apache livy执行hudi delta streamer作业
EN

Stack Overflow用户

提问于 2021-10-07 11:42:40

回答 1查看 148关注 0票数 0

请帮助如何将--props文件和--source-class文件传递给LIVY API POST。

spark-submit --packages org.apache.hudi:hudi-utilities-bundle_2.11:0.5.3,org.apache.spark:spark-avro_2.11:2.4.4 \
 --master yarn \
 --deploy-mode cluster \
 --conf spark.sql.shuffle.partitions=100 \
 --driver-class-path $HADOOP_CONF_DIR \
 --class org.apache.hudi.utilities.deltastreamer.HoodieDeltaStreamer \
 --table-type MERGE_ON_READ \
 --source-class org.apache.hudi.utilities.sources.JsonKafkaSource \
 --source-ordering-field tst  \
 --target-base-path /user/hive/warehouse/stock_ticks_mor \
 --target-table test \
 --props /var/demo/config/kafka-source.properties \
 --schemaprovider-class org.apache.hudi.utilities.schema.FilebasedSchemaProvider \
 --continuous

apache-spark

spark-submit

apache-hudi

回答 1

Stack Overflow用户

发布于 2021-10-21 03:31:58

我已经将您正在使用的配置转换为json文件，并将其传递给LIVY API

{
  "className": "org.apache.hudi.utilities.deltastreamer.HoodieDeltaStreamer",
  "proxyUser": "root",
  "driverCores": 1,
  "executorCores": 2,
  "executorMemory": "1G",
  "numExecutors": 4,
  "queue": "default",
  "name": "stock_ticks_mor",
  "file": "hdfs://tmp/hudi-utilities-bundle_2.12-0.8.0.jar",
  "conf": {
    "spark.sql.shuffle.partitions": "100",
    "spark.jars.packages": "org.apache.hudi:hudi-spark-bundle_2.12:0.8.0,org.apache.spark:spark-avro_2.12:3.0.2",
    "spark.serializer": "org.apache.spark.serializer.KryoSerializer",
    "spark.task.cpus": "1",
    "spark.executor.cores": "1"
  },
  "args": [
    "--props","/var/demo/config/kafka-source.properties",
    "--table-type","MERGE_ON_READ",
    "--source-class", "org.apache.hudi.utilities.sources.JsonKafkaSource",
    "--target-base-path","/user/hive/warehouse/stock_ticks_mor",
    "--target-table","test",
    "--schemaprovider-class","org.apache.hudi.utilities.schema.FilebasedSchemaProvider",
    "--continuous"
  ]
}

您可以将此json提交给LIVY端点，如下所示

curl -H "X-Requested-By: admin" -H "Content-Type: application/json" -X POST -d @config.json http://localhost:8999/batches

参考：https://community.cloudera.com/t5/Community-Articles/How-to-Submit-Spark-Application-through-Livy-REST-API/ta-p/247502

票数 0

页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持

原文链接：

https://stackoverflow.com/questions/69480533

复制

相似问题

问通过apache livy执行hudi delta streamer作业
EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问通过apache livy执行hudi delta streamer作业EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问通过apache livy执行hudi delta streamer作业
EN