首页
学习
活动
专区
圈层
工具
发布
社区首页 >问答首页 >戈布林: org.apache.gobblin.source.extractor.extract.jdbc.MysqlSource :java.lang.ClassNotFoundException

戈布林: org.apache.gobblin.source.extractor.extract.jdbc.MysqlSource :java.lang.ClassNotFoundException
EN

Stack Overflow用户
提问于 2020-02-22 09:54:01
回答 1查看 234关注 0票数 1

我正在尝试mysql使用妖精来摄取hdfs数据。在运行mysql到gobblin.put时,请使用以下步骤:

1)启动hadoop:

sbin\start-all.cmd

2)启动mysql服务:

sudo service mysql start

3)设置GOBBLIN_WORK_DIR:

export GOBBLIN_WORK_DIR=/mnt/c/users/name/incubator-gobblin/GOBBLIN_WORK_DIR

4)设置GOBBLIN_JOB_CONFIG_DIR

export GOBBLIN_JOB_CONFIG_DIR=/mnt/c/users/name/incubator-gobblin/GOBBLIN_JOB_CONFIG_DIR

5)独立启动

bin/gobblin.sh service standalone start --jars /mnt/C/Users/name/incubator-gobblin/build/gobblin-sql/libs/gobblin-sql-0.15.0.jar

给出以下误差

代码语言:javascript
复制
ERROR [JobScheduler-0] org.apache.gobblin.scheduler.JobScheduler$NonScheduledJobRunner  637 - Failed to run job GobblinMySql
org.apache.gobblin.runtime.JobException: Failed to run job GobblinMySql
Caused by: java.lang.ClassNotFoundException: org.apache.gobblin.source.extractor.extract.jdbc.MysqlSource

下面是mysql到gobblin.put文件

代码语言:javascript
复制
# Job properties
job.name=GobblinMySql
job.group=MySql
job.description=Data pull from MySql

# Extract properties

extract.table.type=snapshot_only
extract.table.name=user

# Property to consider the extract as full dump
extract.is.full=true

# Source properties
# Source properties - source class to extract data from Mysql Source
source.class=org.apache.gobblin.source.extractor.extract.jdbc.MysqlSource

# Source properties
source.max.number.of.partitions=1
source.querybased.partition.interval=1
source.querybased.is.compression=true
source.querybased.watermark.type=timestamp

# Converter properties - Record from mysql source will be processed by the below series of converters
converter.classes=gobblin.converter.avro.JsonIntermediateToAvroConverter

# date columns format
converter.avro.timestamp.format=yyyy-MM-dd HH:mm:ss'.0'
converter.avro.date.format=yyyy-MM-dd
converter.avro.time.format=HH:mm:ss

# Qualitychecker properties
qualitychecker.task.policies=gobblin.policies.count.RowCountPolicy,gobblin.policies.schema.SchemaCompatibilityPolicy
qualitychecker.task.policy.types=OPTIONAL,OPTIONAL

# Publisher properties
data.publisher.type=gobblin.publisher.BaseDataPublisher
source.querybased.schema=praveen_schema
source.entity=user
source.querybased.extract.type=snapshot

writer.builder.class=org.apache.gobblin.writer.SimpleDataWriterBuilder
writer.file.path.type=tablename
writer.destination.type=HDFS
writer.output.format=txt

data.publisher.type=org.apache.gobblin.publisher.BaseDataPublisher

mr.job.max.mappers=1

metrics.reporting.file.enabled=true
metrics.log.dir=/gobblin-kafka/metrics
metrics.reporting.file.suffix=txt

bootstrap.with.offset=earliest

fs.uri=hdfs://localhost:9000
writer.fs.uri=hdfs://localhost:9000
state.store.fs.uri=hdfs://localhost:9000

mr.job.root.dir=/gobblin-kafka/working
state.store.dir=/gobblin-kafka/state-store
task.data.root.dir=/jobs/kafkaetl/gobblin/gobblin-kafka/task-data
data.publisher.final.dir=/gobblintest/job-output

我正在从/mnt/c/users/name/incubator-gobblin/build/gobblin-distribution/distributions/gobblin-dist目录运行这个命令。

我在这里需要做什么改变?我该怎么解决呢?

EN

回答 1

Stack Overflow用户

回答已采纳

发布于 2020-03-06 12:23:03

解决方案是添加此jar或依赖项,以消除由: java.lang.ClassNotFoundException: java.lang.ClassNotFoundException引起的

代码语言:javascript
复制
<dependency>
    <groupId>com.linkedin.gobblin</groupId>
    <artifactId>gobblin-core</artifactId>
    <version>0.8.0</version>
</dependency>

从这个mvn 网站下载jar。

希望这能有所帮助。

票数 0
EN
页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持
原文链接:

https://stackoverflow.com/questions/60350951

复制
相关文章

相似问题

领券
问题归档专栏文章快讯文章归档关键词归档开发者手册归档开发者手册 Section 归档