文章/答案/技术大牛

发布

社区首页 >问答首页 >RSelenium将刮起雅虎财经新闻的头条

问RSelenium将刮起雅虎财经新闻的头条
EN

Stack Overflow用户

提问于 2021-08-27 12:03:24

回答 1查看 27关注 0票数 1

我想从雅虎得到一家公司的新闻标题。我使用RSelenium启动远程浏览器并接受cookies。我找到了替代的css类"StretchedBox“，并且我可以通过浏览器检查从字面上看到标题。如何存储这些标题？接下来，我想使用RSelenium向下滚动并保存更多这些元素(比如保存几天)。

library('RSelenium')

# Start Remote Browser
rD <- rsDriver(port = 4840L, browser = c("firefox")) 
remDr <- rD[["client"]]

# Navigate to Yahoo Finance News for Specific Company
# This takes unusual long time 
remDr$navigate("https://finance.yahoo.com/quote/AAPL/news?p=AAPL")

# Get "accept all cookies" botton
webElems <- remDr$findElements(using = "xpath", "//button[starts-with(@class, 'btn primary')]") 

# We can check if we did get the proper button by checking the text of the element:
unlist(lapply(webElems, function(x) {x$getElementText()}))

# We found the two button, and we want to click the first one:
webElems[[1]]$clickElement()

# wait for page loading
Sys.sleep(5) 

# I am looking for news headline in or after the StretchedBox
boxes <- remDr$findElements(using = "class", "StretchedBox")
boxes[1] # empty

boxes[[1]]$browserName

web-scraping

xpath

css-selectors

rselenium

回答 1

Stack Overflow用户

回答已采纳

发布于 2021-08-30 11:43:41

最后，我找到了一个可以用来getElementText新闻文章标题的xpath。

library('RSelenium')

# Start Browser
rD <- rsDriver(port = 4835L, browser = c("firefox")) 
remDr <- rD[["client"]]

# Navigate to Yahoo Financial News
remDr$navigate("https://finance.yahoo.com/quote/AAPL/news?p=AAPL")

# Click Accept Cookies
webElems <- remDr$findElements(using = "xpath", "//button[starts-with(@class, 'btn primary')]")
unlist(lapply(webElems, function(x) {x$getElementText()}))
webElems[[1]]$clickElement()

# extract headlines from html/css by xpath 
headlines <- remDr$findElements(using = "xpath", "//h3[@class = 'Mb(5px)']//a")

# extract headline text
headlines <- sapply(headlines, function(x){x$getElementText()})
headlines[1]

[[1]]
[1] "What Kind Of Investors Own Most Of Apple Inc. (NASDAQ:AAPL)?"

票数 0

页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持

原文链接：

https://stackoverflow.com/questions/68953103

复制

相似问题

问RSelenium将刮起雅虎财经新闻的头条
EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问RSelenium将刮起雅虎财经新闻的头条EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问RSelenium将刮起雅虎财经新闻的头条
EN