Thanks to everyone who took part! I recieved 124 responses in about 24 hours, which is super awesome. This document gives a quick writeup of the results.
Overall, function usage was much as I expected: ddply
is by far the most commonly used function followed by ldply
and dlply
, then llply
. This is reassuring because for the next iteration of plyr
, I'm planning to focus on ddply
, ldply
and dlply
.
I didn't perform a formal analysis of the free text “other functions”, but common themes were:
join
mutate
, summarise
, arrange
colwise
count
rbind.fill
Again, no formal analysis, but the common themes were:
You like plyr - thanks!
Make plyr faster - this is a big motivation for the next iteration, and initial explorations are promising: I should be able to get a 10-100x speedup for many cases.
Documentation and examples could be better - I know, but good documentation is hard!
A few things that you complained about that are fixed in the current dev version:
summarise now works sequentially (i.e. you can refer to columns you just created)
there's a new progress bar (thanks to Mike Lawrence) that estimates the amount of time remaining
a new here
function makes it easier to use ddply
+ summarise
/mutate
/subset
inside a function