Answering a biological question with statistics has a few steps that should be followed to communicate your result clearly and completely to your audience.


This example uses a dataset about vital capacity of the lungs in college students.


Step in process Example
1. State the question to be answered as clearly as possible. This may be done before or after gathering data. Do male and female students have equivalent lung capacities?
2. Characterize the sample and the variables, including their scale of measurement and a test for normality. Include graphs or tables as appropriate. Lung capacity is measured at the ratio scale. Sex is measured as two nominal-scale groups.

3. Decide on a statistical test and tailedness. This is based on the sampling distribution and the characteristics of your variable(s). I first used a Shapiro-Wilk test to see if the log-transformed lung capacities are normally-distributed and found that they were (W = 0.99, P = 0.09). I’ll compare these means with a two-sample t test using Welch’s method (which doesn’t assume equality of variances). This is a two-tailed test as the question is stated because I was looking for “different.”
4. State the null and alternative hypotheses to be tested

- \(H_0\): the hypothesis of “no difference” between the sample and random chance.
- Ha: phrased to be the only other choice if \(H_0\) is rejected
\(H_0\): Male and female students have equivalent lung capacities (\(\mu_{male}=\mu_{female}\)).

\(H_a\): Male and female lung capacities are different (\(\mu_{male}\ne\mu_{female}\)).
5. Determine the appropriate confidence level (\(\alpha\)) at or below which \(H_0\) would be rejected. Let alpha α = 0.05.
6. Calculate the test with the function in R. I’m showing the output here—that’s okay for homework—but I wouldn’t include this in the paper. t.test(lungs$lnvc ~ lungs$sex)

Welch Two Sample t-test

data: lungs$lnvc by lungs$sex t = -17.144, df = 108.87, p-value < 2.2e-16 alternative hypothesis: true difference in means is not equal to 0 95 percent confidence interval: -0.4062970 -0.3220898
sample estimates:
mean in group female mean in group male
8.055377 8.419570
7. Make a decision about \(H_{0}\) by comparing the test statistic with the predetermined confidence level. The reported p-value is far smaller than 0.05, so I reject the null hypothesis.
8. Report your conclusion in words relevant to the original question, and include the test statistic, degrees of freedom (if applicable), and P-value (level of significance) whether \(H_0\) is rejected or not. Men and women have different log-transformed vital capacities (t = -17.144, df = 109, P < 0.001).
Note that even though P is wayyyy lower than 0.05, I wouldn’t recommend reporting a P-value more extreme than “P < 0.001.”


This actually isn’t the best example because, as we’ll see, it’s not sex that determine VC but body size. As you’d expect, most of my male students were taller than my female students, so they have bigger VCs. Women and men with the same height are not significantly different from each other. But we’ll wait to cover this until we do regression.