我试图让R(通过笔记本或RStudio)连接到DatabrickAzure10.1上的MariaDB。但是,无论是将RMariaDB添加到集群的库选项卡中,还是通过install.packages("RMariaDB")在RStudio中添加,我都会失败,因为:
-----------------------------[ ANTICONF ]-----------------------------
Configure could not find suitable mysql/mariadb client library. Try installing:
* deb: libmariadb-dev (Debian, Ubuntu)
* rpm: mariadb-connector-c-devel | mariadb-devel | mysql-devel (Fedora, CentOS, RHEL)
* csw: mysql56_dev (Solaris)
* brew: mariadb-connector-c (OSX)
If you already have a mysql client library installed, verify that either
mariadb_config or mysql_config is on your PATH. If these are unavailable
you can also set INCLUDE_DIR and LIB_DIR manually via:
R CMD INSTALL --configure-vars='INCLUDE_DIR=... LIB_DIR=...'
--------------------------[ ERROR MESSAGE ]----------------------------
<stdin>:1:10: fatal error: mysql.h: No such file or directory
compilation terminated.
-----------------------------------------------------------------------我在databricks上安装了python、R和java文件,但没有安装C库。我找到了可以下载到笔记本电脑的ubuntu库,但是databricks中的“上传库”功能似乎只是想要jars。
有人知道如何让R在数据库中与MariaDB对话吗?或者,是否可以在笔记本的python单元(我有此功能)中执行查询并访问R单元中的数据?
谢谢
发布于 2021-12-17 16:21:05
在Spark/Databricks上这样做的最简单方法是使用spark.read.jdbc (参见文档) --您只需要提供JDBC、用户名和密码。
sparkR.session()
jdbcUrl <- "jdbc:mysql://<host>:3306/databasename"
df <- read.jdbc(jdbcUrl, "table", user = "username", password = "password")https://stackoverflow.com/questions/70394844
复制相似问题