I'll be using this data set to show how to perform descriptive statistics of groups within a data set, when the data set is long (as opposed to wide).
First, we'll need to load up the psych package. The datasets package containing our data is probably already loaded.
library(psych)
The describe.by function in the psych package is what does the magic for us here. It will group our data by a variable we give it, and output descriptive statistics for each of the groups.
> describe.by(sleep, sleep$group) group: 1 var n mean sd median trimmed mad min max range skew kurtosis se extra 1 10 0.75 1.79 0.35 0.68 1.56 -1.6 3.7 5.3 0.42 -1.30 0.57 group* 2 10 1.00 0.00 1.00 1.00 0.00 1.0 1.0 0.0 NaN NaN 0.00 ID* 3 10 5.50 3.03 5.50 5.50 3.71 1.0 10.0 9.0 0.00 -1.56 0.96 ------------------------------------------------------------ group: 2 var n mean sd median trimmed mad min max range skew kurtosis se extra 1 10 2.33 2.00 1.75 2.24 2.45 -0.1 5.5 5.6 0.28 -1.66 0.63 group* 2 10 2.00 0.00 2.00 2.00 0.00 2.0 2.0 0.0 NaN NaN 0.00 ID* 3 10 5.50 3.03 5.50 5.50 3.71 1.0 10.0 9.0 0.00 -1.56 0.96
Of course, there are other ways to find the descriptive statistics of groups, and since you'll probably be doing further analysis on the groups, and you may be splitting the whole data into subsets by groups, it may be easiest to just use describe on each subset. But that's a topic for another post. And this is an easy way to quickly look at many groups, and a quick look is particularly essential for descriptive statistics.
No comments:
Post a Comment