Recall a central goal of statistics is to learn about a parameter \(\theta\) that governs the population or underlying data generating mechanism from which the data came.
The objective of a statistical test is to test a hypothesis concerning the values of one or more of the population parameters. We generally have a theory, or a research hypothesis, about the parameter(s) that we are seeing if the data support.
The research hypothesis is often denoted by \(H_a\), also known as the alternative hypothesis.
The default state, or the hypothesis that we “fall back on” if \(H_a\) cannot be proven, is the null hypothesis, or \(H_0\).
Decision rules for testing hypotheses
Hypothesis tests are typically carried out in the following manner:
Specify \(H_0\) and \(H_a\), two mutually exclusive states of reality.
Obtain a test statistic \(T\), which contains information about \(\theta\).
Specify a critical value \(c\) and corresponding rejection region (RR).
If \(T\) falls in RR, reject \(H_0\). Otherwise fail to reject.
Decision errors
It’s of course possible that we make the incorrect decision, given the true state of reality.
The types of errors we can make under the two possible states of reality (the “null reality” and the “alternative reality”) can be summarized in 2x2 table format:
Decision
\(H_0\) true
\(H_0\) false
Reject \(H_0\)
Type-I Error
Correct decision
Fail to reject \(H_0\)
Correct decision
Type-II error
Decision probabilities
The probabilities of the two error types we notate with \(\alpha\) and \(\beta\).
\(\alpha\):
The probability of committing a Type-I error, i.e. \(P(Reject\ H_0| H_0\ true)\).
Equivalently: \(P(T \in RR | H_0\ true)\).
Often referred to as the significance level or the size of the test.
\(\beta\):
The probability of committing a Type-II error is \(P(Fail\ to\ reject\ H_0|H_0\ false)\).
Equivalently: \(P(T \not \in RR | H_0\ false)\).
The power of a test is \(1-\beta = P(Reject\ H_0|H_0\ false)\).
The \(\alpha\)-\(\beta\) relationship
Implication: Power can always be trivially increased if \(\alpha\) is increased
Consider: what sort of “brain dead” test would have \(Power=1\)? What is \(\alpha\) for this test?
In practice: set \(\alpha\) to something small, try to maximize power.
Finding \(\alpha\) and \(\beta\)
To find \(\alpha\):
Specify sampling distribution of \(T\) under \(H_0\) state of reality
Find \(P(T\in RR | H_0\ true)\)
To find \(\beta\)/power:
Specify sampling distribution of \(T\) under an \(H_a\) state of reality
A sample of size \(n=1\) is drawn from a population with the following pdf:
\[f_Y(y) = (1+\theta) y^\theta; 0\leq y \leq 1.\]
Test:
\[H_0: \theta = 2\]\[H_a: \theta > 2\]
Tasks:
Should we reject \(H_0\) for large or small \(Y\)?
What is the size of the test if the decision rule is to reject for \(Y>3/4\)?
What is the RR for a size-5% test?
Find the power function of the size-5% test as a function of \(\theta_a\).
Task 1: Reject for large or small \(Y\)?
Since \(Y\sim BETA(\theta+1,1)\):
\[E(Y) = \frac{\theta+1}{\theta+2}\]
Larger values of \(\theta\Rightarrow\) larger expected \(Y\); makes sense to reject \(H_0\) in favor of \(H_a: \theta= ``large"\) for large values of \(Y\).