我有一个包含部分NAs的变量。它是一个变量,表示从startyear到endyear的时间段,但是其中endyear和endyear where缺少到NA的周期范围。
# startyear endyear ongoingasofyear imposition sanctions_period
#1 1945 1947 NA 1 1945-1947
#2 1946 1949 NA 1 1946-1949
#3 1946 1993 NA 1 1946-1993
#4 1946 NA NA 1 1946-NA
#5 1946 1946 NA 1 1946-1946
#6 1946 NA NA 1 1946-NA如何删除sanctions_period中包含NA的观测值?谢谢!
发布于 2016-08-10 00:11:32
这里有一个内置的命令: complete.cases()
d1 <- read.table(header=TRUE,text="
startyear endyear ongoingasofyear imposition sanctions_period
1945 1947 NA 1 1945-1947
1946 1949 NA 1 1946-1949
1946 1993 NA 1 1946-1993
1946 NA NA 1 1946-NA
1946 1946 NA 1 1946-1946
1946 NA NA 1 1946-NA")
d2 <- d1[complete.cases(d1[c("startyear","endyear")]),]发布于 2016-08-10 01:52:27
您还可以使用sqldf
库(Sqldf)
df<- read.csv("df.csv",header= TRUE,sep = "")
sqldf(c("delete from df where sanctions_period like '%-NA%'","select * from df“)
https://stackoverflow.com/questions/38854964
复制相似问题