我有一个大数据框,我用它运行这个:
dcast(mydata, People ~ Categories, value.var = "Answer Count", **sum**)
这是结果:
People category1 category2
Marge 3,648 6,402
Homer 3,586 6,684
Bart 3,469 7,119
Lisa 4,045 6,758
Maggie 2,847 5,748
另外,这个:
dcast(mydata, People ~ Categories, value.var = "Answer Count", **length**)
使这个:
People category1 category2
Marge 2,531 4,516
Homer 2,535 4,512
Bart 2,542 4,563
Lisa 2,501 4,488
Maggie 2,517 4,513
实际上,我想这样做:
dcast(mydata, People ~ Categories, value.var = "Answer Count", **sum / length / 6**)
并获得这些值:
People category1 category2
Marge 0.240221256 0.236271036
Homer 0.235765943 0.246897163
Bart 0.227445581 0.260026298
Lisa 0.269558843 0.250965538
Maggie 0.188518077 0.212275648
我尝试将操纵fun.aggregate
作为论据,但我不确定这是正确的路径,或者我不知道我在做什么。有人可以帮我解决这个问题吗?(旁注:这个样本有两个类别。真实数据有 > 40。)