我有一个包含2列(客户和购买)的关系数据框架。我想要一个数据框,其中每个独立客户都有一行,每个产品都有一列,其指标变量显示该客户是否购买了该产品。
例子:
df <- data.frame(customer=c("A", "A", "B", "B"), purchase = c("Milk", "Eggs", "Juice", "Milk"))
customer purchase
1 A Milk
2 A Eggs
3 B Juice
4 B Milk
我想:
customer Milk Eggs Juice
1 A 1 1 0
2 B 1 0 1
我们可以用
library(reshape2)
dcast(df, customer~purchase, length, value.var='purchase')
# customer Eggs Juice Milk
#1 A 1 0 1
#2 B 0 1 1
本文收集自互联网,转载请注明来源。
如有侵权,请联系[email protected] 删除。
我来说两句