文章/答案/技术大牛

发布

社区首页 >问答首页 >java.lang.UnsatisfiedLinkError: jep.Jep.init(Ljava/lang/ClassLoader;ZZ)

问java.lang.UnsatisfiedLinkError: jep.Jep.init(Ljava/lang/ClassLoader;ZZ)
EN

Stack Overflow用户

提问于 2020-01-07 10:06:26

回答 1查看 793关注 0票数 4

首先，我不明白为什么人们会把这个问题打成负数。要么解释我如何改进问题。我可以进一步阐述。这是我这边的反馈。虽然我是新来的，但我不想在不努力的情况下提问。

我试图在使用jep解释器的Google平台Dataproc集群上运行用Scala编写的星火作业。

我增加了jep作为依赖项。

使用Google平台Dataproc在Scala上运行jep的完整简短解决方案是什么？

"black.ninia" % "jep" % "3.9.0"

在我的install.sh脚本中，我写了

sudo -E pip install jep    
export JEP_PATH=$(pip show jep | grep "^Location:" | cut -d ':' -f 2,3 | cut -d ' ' -f 2)

export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:$JEP_PATH/jep

不过，我仍然得到以下错误(在java.library.path中没有jep )

20/01/07 09:07:23 WARN org.apache.spark.scheduler.TaskSetManager: Lost task 4.0 in stage 9.0 (TID 74, fs-xxxx-xxx-xxxx-test-w-1.c.xx-xxxx.internal, executor 1): java.lang.UnsatisfiedLinkError: no jep in java.library.path
    at java.lang.ClassLoader.loadLibrary(ClassLoader.java:1867)
    at java.lang.Runtime.loadLibrary0(Runtime.java:870)
    at java.lang.System.loadLibrary(System.java:1122)
    at jep.MainInterpreter.initialize(MainInterpreter.java:128)
    at jep.MainInterpreter.getMainInterpreter(MainInterpreter.java:101)
    at jep.Jep.<init>(Jep.java:256)
    at jep.SharedInterpreter.<init>(SharedInterpreter.java:56)
    at dunnhumby.sciencebank.SubsCommons$$anonfun$getUnitVecEmbeddings$1.apply(SubsCommons.scala:33)
    at dunnhumby.sciencebank.SubsCommons$$anonfun$getUnitVecEmbeddings$1.apply(SubsCommons.scala:31)
    at org.apache.spark.sql.execution.MapPartitionsExec$$anonfun$6.apply(objects.scala:196)
    at org.apache.spark.sql.execution.MapPartitionsExec$$anonfun$6.apply(objects.scala:193)
    at org.apache.spark.rdd.RDD$$anonfun$mapPartitionsInternal$1$$anonfun$apply$25.apply(RDD.scala:827)
    at org.apache.spark.rdd.RDD$$anonfun$mapPartitionsInternal$1$$anonfun$apply$25.apply(RDD.scala:827)
    at org.apache.spark.rdd.MapPartitionsRDD.compute(MapPartitionsRDD.scala:38)
    at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:323)
    at org.apache.spark.rdd.RDD.iterator(RDD.scala:287)
    at org.apache.spark.rdd.MapPartitionsRDD.compute(MapPartitionsRDD.scala:38)
    at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:323)
    at org.apache.spark.rdd.RDD.iterator(RDD.scala:287)
    at org.apache.spark.rdd.MapPartitionsRDD.compute(MapPartitionsRDD.scala:38)
    at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:323)
    at org.apache.spark.rdd.RDD.iterator(RDD.scala:287)
    at org.apache.spark.scheduler.ResultTask.runTask(ResultTask.scala:87)
    at org.apache.spark.scheduler.Task.run(Task.scala:108)
    at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:338)
    at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
    at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
    at java.lang.Thread.run(Thread.java:748)

(编辑)：-

1.)我已经看到了具体的答案，现场机器，但不是谷歌云平台。

2.)我找到了https://github.com/ninia/jep/issues/141，但这没什么用

3.)我也找到了回答，但这没有得到回答，也没有被Google平台所接受。我甚至完成了从那里开始的所有步骤。

4.)如果问题少了一些快照，我会附上。但请提供一些评论。

(编辑：- 08012020我正在添加使用的install.sh )

#!/bin/bash

set -x -e

# Disable ipv6 since it seems to cause intermittent SocketTimeoutException when collecting data
# See CENG-1268 in Jira
printf "\nnet.ipv6.conf.default.disable_ipv6=1\nnet.ipv6.conf.all.disable_ipv6=1\n" >> /etc/sysctl.conf
sysctl -p

if [[ $(/usr/share/google/get_metadata_value attributes/dataproc-role) == Master ]]; then
    config_bucket="$(/usr/share/google/get_metadata_value attributes/dataproc-cluster-configuration-directory | cut -d'/' -f3)"
    dataproc_cluster_name="$(/usr/share/google/get_metadata_value attributes/dataproc-cluster-name)"
    hdfs dfs -mkdir -p gs://${config_bucket}/${dataproc_cluster_name}/spark_events
    systemctl restart spark-history-server.service
fi

tee -a /etc/hosts << EOM
$$(/usr/share/google/get_metadata_value /attributes/preprod-mjr-dataplatform-metrics-mig-ip) influxdb
EOM

echo "[global]
index-url = https://cs-anonymous:XXXXXXXX@artifactory.xxxxxxxx.com/artifactory/api/pypi/pypi-remote/simple" >/etc/pip.conf

PIP_REQUIREMENTS_FILE=gs://preprod-xxx-dpl-artif/dataproc/requirements.txt
PIP_TRANSITIVE_REQUIREMENTS_FILE=gs://preprod-xxx-dpl-artif/dataproc/transitive-requirements.txt

gsutil cp ${PIP_REQUIREMENTS_FILE} .
gsutil cp ${PIP_TRANSITIVE_REQUIREMENTS_FILE} .
gsutil -q cp gs://preprod-xxx-dpl-artif/dataproc/apt-transport-https_1.4.8_amd64.deb /tmp/apt-transport-https_1.4.8_amd64.deb

export http_proxy=http://preprod-xxx-securecomms.preprod-xxx-securecomms.il4.us-east1.lb.dh-xxxxx-media-55595.internal:3128
export https_proxy=http://preprod-xxx-securecomms.preprod-xxx-securecomms.il4.us-east1.lb.dh-xxxxx-media-55595.internal:3128
export no_proxy=google.com,googleapis.com,localhost
echo "deb https://cs-anonymous:Welcome123@artifactory.xxxxxxxx.com/artifactory/debian-main-remote stretch main" >/etc/apt/sources.list.d/main.list
echo "deb https://cs-anonymous:Welcome123@artifactory.xxxxxxxx.com/artifactory/maria-db-debian stretch main" >>/etc/apt/sources.list.d/main.list
echo 'Acquire::CompressionTypes::Order:: "gz";' > /etc/apt/apt.conf.d/02update
echo 'Acquire::http::Timeout "10";' > /etc/apt/apt.conf.d/99timeout
echo 'Acquire::ftp::Timeout "10";' >> /etc/apt/apt.conf.d/99timeout
sudo dpkg -i /tmp/apt-transport-https_1.4.8_amd64.deb
sudo apt-get install --allow-unauthenticated -y /tmp/apt-transport-https_1.4.8_amd64.deb
sudo -E apt-get update --allow-unauthenticated -y -o Dir::Etc::sourcelist="sources.list.d/main.list" -o Dir::Etc::sourceparts="-" -o APT::Get::List-Cleanup="0"

sudo -E apt-get --allow-unauthenticated -y install python-pip gcc python-dev python-tk curl
#requires index-url specifying because the version of pip installed by previous command
#installs an old version that doesn't seem to recognise pip.conf
sudo -E pip install --index-url https://cs-anonymous:xxxxxxx@artifactory.xxxxxxxx.com/artifactory/api/pypi/pypi-remote/simple --ignore-installed pip setuptools wheel

sudo -E pip install jep

sudo -E pip install gensim

JEP_PATH=$(pip show jep | grep "^Location:" | cut -d ':' -f 2,3 | cut -d ' ' -f 2)

cat << EOF >> /etc/spark/conf/spark-env.sh

export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:$JEP_PATH/jep
export LD_PRELOAD=$LD_PRELOAD:$JEP_PATH/jep
EOF

tee -a /etc/spark/conf/spark-defaults.conf << EOM
export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:$JEP_PATH/jep
export LD_PRELOAD=$LD_PRELOAD:$JEP_PATH/jep
EOM

tee -a /etc/*bashrc << EOM
export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:$JEP_PATH/jep
export LD_PRELOAD=$LD_PRELOAD:$JEP_PATH/jep
EOM

source /etc/*bashrc

sudo -E apt-get install --allow-unauthenticated -y \
  pkg-config \
  freetype* \
  python-matplotlib \
  libpq-dev \
  libssl-dev \
  libcrypto* \
  python-dev \
  libtext-csv-xs-perl \
  libmysqlclient-dev \
  libfreetype* \
  libzmq3-dev \
  libzmq3*


sudo -E pip install -r ./requirements.txt

apache-spark

google-cloud-platform

google-cloud-dataproc

jep

回答 1

Stack Overflow用户

回答已采纳

发布于 2020-01-07 18:00:23

假设您使用install.sh作为Dataproc的init操作，您的export命令只会在运行init操作的本地shell会话中导出这些环境变量，而不是持久地导出此后运行的所有Spark进程的环境变量。

让Spark使用自定义环境变量的方法是将它们添加到/etc/spark/conf/spark-env.sh中。这是一个星火中如何设置java.library.path的火花用户讨论。

本质上，您只需在导出环境变量的部分的init操作中使用heredoc即可。但是，如https://issues.apache.org/jira/browse/SPARK-1719中所示，环境变量不足以将库路径传播到纱线中的执行器中；激发显式设置库路径。而不是通过LD_LIBRARY_PATH传播，因此在spark-defaults.conf中也必须使用spark.executor.extraLibraryPath

JEP_PATH=$(pip show jep | grep "^Location:" | cut -d ':' -f 2,3 | cut -d ' ' -f 2)

# spark-env.sh for driver process.
cat << EOF >> /etc/spark/conf/spark-env.sh
# Note that backslash before $LD_LIBRARY_PATH on the right hand side;
# it is important that the variable is evaluated in spark-env.sh rather
# than clobbering it with the local $LD_LIBRARY_PATH of the init action
# running process.
export LD_LIBRARY_PATH=\$LD_LIBRARY_PATH:$JEP_PATH/jep
EOF

# For executor processes
cat << EOF >> /etc/spark/conf/spark-defaults.conf
spark.executor.extraLibraryPath=$JEP_PATH/jep
EOF

票数 3

页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持

原文链接：

https://stackoverflow.com/questions/59626224

复制

相似问题

问java.lang.UnsatisfiedLinkError: jep.Jep.init(Ljava/lang/ClassLoader;ZZ)
EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问java.lang.UnsatisfiedLinkError: jep.Jep.init(Ljava/lang/ClassLoader;ZZ)EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问java.lang.UnsatisfiedLinkError: jep.Jep.init(Ljava/lang/ClassLoader;ZZ)
EN