我正在尝试从以下网站下载一个.xlsx文件:https://www.cor.pa.gov/Pages/COVID-19.aspx
在该网站上,有一个链接可以阅读最新的PA DOC囚犯和员工新冠肺炎测试数据。当您单击它时,它将下载.xlsx文件。它不会重定向到不同的页面。
我试过了
download.file("https://www.cor.pa.gov/Pages/COVID-19.aspx/Documents/PA-DOC-COVID-19-Testing.xlsx",destfile="PA-DOC-COVID-19-Testing (1).xlsx")
Penn <- read_excel("PA-DOC-COVID-19-Testing (1).xlsx")但是我得到了错误信息" error : Evaluation error: zip文件‘/myfiledirectory/PA-DOC-新冠肺炎-Testing (1).xlsx’无法打开。‘
我从html代码中获得了download.file URL (/Documents/PA-DOC-新冠肺炎-Testing.xlsx)。
如果有人对如何做到这一点有见解,我将不胜感激。谢谢!
发布于 2020-06-27 04:34:31
你弄错了链接。这是可行的:
download.file("https://www.cor.pa.gov/Documents/PA-DOC-COVID-19-Testing.xlsx", destfile="PA-DOC-COVID-19-Testing.xlsx")
Penn <- readxl::read_excel("PA-DOC-COVID-19-Testing.xlsx")
> head(Penn)
# A tibble: 6 x 23
LOCATION `Staff Testing` ...3 ...4 ...5 ...6 `Inmate Testing` ...8 ...9 ...10 ...11
<chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr>
1 NA Positive Nega… Pend… Deat… Reco… Positive Nega… Pend… Deat… Reco…
2 Central… 1 1 NA NA NA NA NA NA NA NA
3 Albion NA 7 5 NA NA NA 89 NA NA NA
4 Benner … NA 13 2 NA NA 1 91 NA 1 NA
5 Cambrid… NA 8 1 NA NA NA 163 NA NA NA
6 Camp Hi… 4 8 21 NA 3 1 1333 NA NA NA
# … with 12 more variables: `Inmate Testing Purpose` <chr>, ...13 <chr>, ...14 <chr>, ...15 <chr>,
# ...16 <chr>, ...17 <chr>, ...18 <chr>, ...19 <chr>, ...20 <chr>, ...21 <chr>, ...22 <lgl>,
# ...23 <chr>发布于 2020-06-27 04:31:37
您需要解压缩下载的文件。因为从网络下载的文件是压缩的。你可以在StackOverflow上找到类似帖子的链接:Using R to download zipped data file, extract, and import data
https://stackoverflow.com/questions/62602338
复制相似问题