我正在尝试通过Python语言中的Rpy2在这个数据文件上调用R函数loess:http://filebin.ca/azuz9Piv0z8/test.data
当我使用数据的一个子集(前1000个点)时,它是有效的,但当我尝试使用整个文件时,我得到一个错误。我的代码:
import pandas
from rpy2.robjects import r
import rpy2.robjects as robjects
data = pandas.read_table(os.path.expanduser("~/test2.data"), sep="\t").values
small_data = data[0:1000, :]
print "small data loess:"
a, b = robjects.FloatVector(list(small_data[:, 0])), \
robjects.FloatVector(list(small_data[:, 1]))
df = robjects.DataFrame({"a": a, "b": b})
loess_fit = r.loess("b ~ a", data=df)
print loess_fit
print "large data loess:"
a, b = robjects.FloatVector(list(data[:, 0])), \
robjects.FloatVector(list(data[:, 1]))
df = robjects.DataFrame({"a": a, "b": b})
loess_fit = r.loess("b ~ a", data=df)
print loess_fit适用于small_data,但不适用于data。我得到了错误:
Error in simpleLoess(y, x, w, span, degree, parametric, drop.square, normalize, :
NA/NaN/Inf in foreign function call (arg 1)
loess_fit = r.loess("b ~ a", data=df)
File "/usr/local/lib/python2.7/dist-packages/rpy2-2.3.3-py2.7-linux-x86_64.egg/rpy2/robjects/functions.py", line 86, in __call__
return super(SignatureTranslatedFunction, self).__call__(*args, **kwargs)
File "/usr/local/lib/python2.7/dist-packages/rpy2-2.3.3-py2.7-linux-x86_64.egg/rpy2/robjects/functions.py", line 35, in __call__
res = super(Function, self).__call__(*new_args, **new_kwargs)
rpy2.rinterface.RRuntimeError: Error in simpleLoess(y, x, w, span, degree, parametric, drop.square, normalize, :
NA/NaN/Inf in foreign function call (arg 1)如何解决这个问题?我不确定是R函数loess有问题还是Rpy2接口有问题?谢谢。
发布于 2013-03-22 04:58:16
问题在于数据中的-Inf值:
DF <- read.table('http://filebin.ca/azuz9Piv0z8/test.data')
DF[!is.finite(DF[,1]) | !is.finite(DF[,2]),]
# V1 V2
# 5952 -Inf -Inf发布于 2013-07-11 00:22:04
既然可以使用statsmodels package in Python for lowess smoothing,为什么还要调用R呢?
还有一个针对lowess的Bio.Statistics包,但它似乎不那么准确,而且我无法让它针对this lowess example收敛。
https://stackoverflow.com/questions/15557770
复制相似问题