- Refresher on independent samples t-test
- Deriving a t-test from the descriptive statistics (mean, standard deviation, sample size)
- Calculating the standard error
- Calculating the t-value
- Calculating the p-value
- Calculating the confidence interval
If assumptions of the t-distribution are true
Then, the difference in the means is distributed according to a t-distribution
\[ t(n_x + n_y -2) \sim \frac{\bar{x} - \bar{y}}{\text{se}} \]
with \(n - 2\) degrees of freedom.
The denominator of the t-statistic is known as the standard error which is calculated according to
\[ \text{se} = \sqrt{\frac{(n_x - 1) \times s_x^2 + (n_y - 1) \times s_y^2}{n_x + n_y - 2}} \times \sqrt{\frac{1}{n_x} + \frac{1}{n_y}} \]
Numerator:
\[ (n_x - 1) \times s_x^2 + (n_y - 1) \times s_y^2 \]
Denominator:
\[ n_x + n_y - 2 \] Fraction:
\[ \text{left term} = \sqrt\frac{\text{numerator}}{\text{denominator}} \]
\[ \text{right term} = \sqrt{\frac{1}{n_x} + \frac{1}{n_y}} \]
Product of the left and the right side of the SE equation.
\[ \text{se} = \text{left term} \times \text{right term} \]
The confidence interval for the difference in means is
\[ (\bar{x} - \bar{y}) \pm \tau \times \text{se} \] where \(\pm\tau\) are the lower and upper values in a t-distribution which contains 95%, 99% etc. of the area under the curve.
Again, the shape of the t-distribution depends on the degrees of freedom, so \(tau\) does too.
Set of all hypothetical values that we do not rule out at the \(\alpha < 0.05\) level of significance is known as the 95% confidence interval.
\[ (\bar{x} - \bar{y}) \pm \tau \times \text{se} \]
\(\pm\tau\) are values in t-distribution that embrace 95% of the area under the curve.
Set of all hypothetical values that we do not rule out at the \(\alpha < 0.05\) level of significance is known as the 95% confidence interval.
\[ (\bar{x} - \bar{y}) \pm \tau \times \text{se} \]
\(\pm\tau\) are values in t-distribution that embrace 95% of the area under the curve.
Set of all hypothetical values that we do not rule out at the \(\alpha < 0.05\) level of significance is known as the 95% confidence interval.
\[ (\bar{x} - \bar{y}) \pm \tau \times \text{se} \]
\(\pm\tau\) are values in t-distribution that embrace 95% of the area under the curve.
Set of all hypothetical values that we do not rule out at the \(\alpha < 0.05\) level of significance is known as the 95% confidence interval.
\[ (\bar{x} - \bar{y}) \pm \tau \times \text{se} \]
\(\pm\tau\) are values in t-distribution that embrace 95% of the area under the curve.
Set of all hypothetical values that we do not rule out at the \(\alpha < 0.05\) level of significance is known as the 95% confidence interval.
\[ (\bar{x} - \bar{y}) \pm \tau \times \text{se} \]
\(\pm\tau\) are values in t-distribution that embrace 95% of the area under the curve.
Set of all hypothetical values that we do not rule out at the \(\alpha < 0.05\) level of significance is known as the 95% confidence interval.
\[ (\bar{x} - \bar{y}) \pm \tau \times \text{se} \]
\(\pm\tau\) are values in t-distribution that embrace 95% of the area under the curve.