Reshape multiple categorical variables to binary response variables
How much spice is too much? Here is a solution via tidyr
:
library(dplyr)
library(tidyr)
mydata %>%
gather(actor,name,starts_with("actor")) %>%
mutate(present = 1) %>%
select(-actor) %>%
spread(name,present,fill = 0)
movie Jack Kate Leo
1 Departed 1 0 1
2 Titanic 0 1 1
One way to reshape your data.frame
is with the reshape2
package, using melt
and dcast
. For example:
library(reshape2)
long.mydata <- melt(mydata, id.vars = "movie")
wide.mydata <- dcast(long.mydata, movie ~ value, function(x) 1, fill = 0)
Pay attention to the fun.aggregate
and fill
parameters in dcast
, which control what goes to fill in the interior after casting.