If assumptions of the t-distribution are true
- data come from a normal distribution (normality)
- both standard deviations are identical (equality of variance, homoscedasticity)
- and \(\mu_x\) = \(\mu_y\) or \(\mu_x - \mu_y = 0\)
Then, the difference in the means is distributed according to a t-distribution
\[ t(n_x + n_y -2) \sim \frac{\bar{x} - \bar{y}}{\text{se}} \]
with \(n - 2\) degrees of freedom.