我尝试将一个函数(weight.func)传递给另一个调用ddply的函数(包装器)。我希望ddply使用该函数(weight.func)作为其计算的一部分。当weight.func被设置为“全局”时,我得到了我想要的输出,但当它作为匿名函数传递给包装器时却没有。
我能让ddply做我想做的事吗?下面是一个代码示例:
> print(sampleData)
studentId problem part workerId rating
1 8001 problem26 partA A127R5QI5OGBIK 0.0
2 8001 problem26 partA A1FCLYRBAB430F 0.0
3 8001 problem26 partA A25FZQY34C6RVO 0.0
4 8001 problem26 partA A3G0MO562MHMZ3 0.5
5 8001 problem26 partA A3RB9ZOIUC3NWG 2.0
6 8001 problem26 partB A1FCLYRBAB430F 0.5
7 8001 problem26 partB A1XRDZKSJBWY8Q 0.5
8 8001 problem26 partB A22CRWMZUX7FFR 0.5
9 8001 problem26 partB A25FZQY34C6RVO 1.0
10 8001 problem26 partB A3G0MO562MHMZ3 0.5
11 8001 problem27 partA A1ET309DW6M2XA 2.0
12 8001 problem27 partA A1FCLYRBAB430F 0.0
13 8001 problem27 partA A22CRWMZUX7FFR 0.0
14 8001 problem27 partA A25FZQY34C6RVO 0.0
15 8001 problem27 partA A3G0MO562MHMZ3 0.0
16 8001 problem27 partB A1FCLYRBAB430F 1.0
17 8001 problem27 partB A22CRWMZUX7FFR 0.0
18 8001 problem27 partB A25FZQY34C6RVO 0.0
19 8001 problem27 partB A2U9676210WST5 0.0
20 8001 problem27 partB A3G0MO562MHMZ3 0.0
21 8002 problem26 partA A127R5QI5OGBIK 0.0
22 8002 problem26 partA A1FCLYRBAB430F 0.5
23 8002 problem26 partA A22CRWMZUX7FFR 0.0
24 8002 problem26 partA A25FZQY34C6RVO 2.0
25 8002 problem26 partA A3G0MO562MHMZ3 0.5
26 8002 problem26 partB A17EHJZNJGNRAN 2.0
27 8002 problem26 partB A1FCLYRBAB430F 0.0
28 8002 problem26 partB A2IPRDTE6B4TAB 0.0
29 8002 problem26 partB A3G0MO562MHMZ3 0.0
30 8002 problem26 partB A6SON3OS15XKA 0.0
31 8002 problem27 partA A1FCLYRBAB430F 0.0
32 8002 problem27 partA A25FZQY34C6RVO 0.0
33 8002 problem27 partA A2IPRDTE6B4TAB 0.0
34 8002 problem27 partA A2U9676210WST5 0.0
35 8002 problem27 partA A3G0MO562MHMZ3 0.0
36 8002 problem27 partB A1FCLYRBAB430F 0.0
37 8002 problem27 partB A1V52SSKROBV8E 2.0
38 8002 problem27 partB A25FZQY34C6RVO 2.0
39 8002 problem27 partB A2IPRDTE6B4TAB 0.0
40 8002 problem27 partB A3G0MO562MHMZ3 0.0
>
> #Make a wrapper
> wrapper <- function ( ratingData, weight.func ) {
+ print(weight.func) #prove that the function is being passed
+ ddply(ratingData, c('studentId','problem','part'), summarize,
+ sum.weights = sum ( weight.func(rating) ))
+ }
> wrapper( sampleData, weight.func=function(x) (x+.001)^-1 )
function(x) (x+.001)^-1
Error in data.frame(sum.weights = sum(weight.func(rating))) :
could not find function "weight.func"
>
> #'globally' declare weight.func
> weight.func <- function(x) (x+.001)^-1
> wrapper( sampleData, weight.func=NULL )
NULL
studentId problem part sum.weights
1 8001 problem26 partA 3002.495758
2 8001 problem26 partB 8.983033
3 8001 problem27 partA 4000.499750
4 8001 problem27 partB 4000.999001
5 8002 problem26 partA 2004.491766
6 8002 problem26 partB 4000.499750
7 8002 problem27 partA 5000.000000
8 8002 problem27 partB 3000.999500第二个输出是目标。感谢任何人的帮助!(包括一种非基于plyr的方法来完成相同的任务。)
上面的例子是一个玩具例子。这是我能重现行为的最简单的例子。
发布于 2010-11-27 11:02:29
您可以使用aggregate:
w2 <- function(d, f){
aggregate(rating~studentId+problem+part, function(x)sum(f(x)), data=d)
}
w2( sampleData, function(x) (x+.001)^-1 )请注意,聚合列的名称是自动确定的,因此,如果要命名,则需要自己命名。
你可以通过ddply来做同样的事情,而不需要总结。
wrapper <- function ( ratingData, weight.func ) {
ddply(ratingData, c('studentId','problem','part'), function(x)c(sum.weights=sum(weight.func(x$rating))))
}
wrapper( sampleData, weight.func=function(x) (x+.001)^-1 )在这种情况下,您可以在函数内部指定名称。
发布于 2010-12-01 02:53:37
这是plyr中的一个已知错误:https://github.com/hadley/plyr/issues#issue/3
发布于 2010-11-27 10:22:09
我不确定我做了哪项更改(去掉“>>”后面的空格,或者将NULL改为一个实函数或<< something >> ),但现在它可以工作了:
wrapper <- function ( ratingData, weight.func=weight.func) {
ddply(ratingData, .variables=c('studentId','problem','part'),
.fun=summarise, sum.weights = sum(weight.func(rating) ))
}
wrapper( sampleData, weight.func=weight.func )
studentId problem part sum.weights
1 8001 problem26 partA 3002.495758
2 8001 problem26 partB 8.983033
3 8001 problem27 partA 4000.499750
4 8001 problem27 partB 4000.999001
5 8002 problem26 partA 2004.491766
6 8002 problem26 partB 4000.499750
7 8002 problem27 partA 5000.000000
8 8002 problem27 partB 3000.999500https://stackoverflow.com/questions/4289602
复制相似问题