我已经读了下面的问题,但不能很好地解决这个问题。尝试通过张贴表单登录网站。请阅读以下内容:
工作示例:
正在加载程序包:
install.packages("RHTMLForms", repos = "http://www.omegahat.org/R", type = "source") require(RHTMLForms)
require(RCurl)
require(XML)我正在连接到https://www.moodys.com/
url <- getURL("https://www.moodys.com/",
cainfo = system.file("CurlSSL",
"cacert.pem", package = "RCurl"))获取登录表单:
forms <- getHTMLFormDescription(url)并将表单发回:
fun <- createFunction(forms$aspnetForm)
results <- fun(MdcUserName = "xxx@xxx", MdcPassword="xxxx")这给出了以下错误消息:
Error in function (type, msg, asError = TRUE) :
Could not resolve host: NA; Host not found我知道复制/解决此错误可能需要有效的用户名和密码,但非常感谢。
类似的问题:
R - posting a login form using RCurl
https://stackoverflow.com/questions/19327001/https-php-login-via-rcurl-post
What if I want to web scrape with R for a page with parameters?
发布于 2014-06-20 04:01:48
您可以使用Selenium和RSelenium登录到该网页
library(RSelenium)
RSelenium::startServer()
appURL <- "http://www.moodys.com"
username <- "someuser"
password <- "somepass"
remDr <- remoteDriver()
remDr$open()
remDr$navigate(appURL)
logIn <- remDr$findElement("id", "LoginText")
logIn$clickElement()
userName <- remDr$findElement("id", "MdcUserName")
userName$sendKeysToElement(list(username))
passWord <- remDr$findElement("id", "MdcPassword")
passWord$sendKeysToElement(list(password))
logIn <- remDr$findElement("id", "LoginImageButton")
logIn$clickElement()https://stackoverflow.com/questions/24314622
复制相似问题