When we run a statistical test, we start with a null hypothesis \(H_0\) — basically an assumption that nothing interesting is going on.
The p-value is the probability of getting results as extreme (or more extreme) than what we observed, assuming the null hypothesis is true.
- Small p-value → the data is unlikely under \(H_0\) → evidence against \(H_0\)
- Large p-value → the data is consistent with \(H_0\) → not enough evidence to reject
A common threshold is \(\alpha = 0.05\). If \(p < \alpha\), we reject \(H_0\).
Important: A p-value is NOT the probability that \(H_0\) is true.