M. Drew LaMar
February 5, 2021
“Statistics are used much like a drunk uses a lamppost: for support, not illumination.”
- Andrew Lang
Definition: The
sampling distribution represents the distribution of the point estimatesbased on samples of a fixed size from a certain population. It is useful to think of a particular point estimate as being drawn from such a distribution. Understanding the concept of a sampling distribution is central to understanding statistical inference.
Definition: The standard deviation associated with an estimate is called the
standard error . It describes the typical error or uncertainty associated with the estimate.
The standard error is also the standard deviation of the sampling distribution.
http://www.zoology.ubc.ca/~whitlock/kingfisher/SamplingNormal.htm
Definition: The standard error represents the standard deviation associated with the estimate, and roughly 95% of the time the estimate will be within 2 standard errors of the parameter.
An approximate 95% confidence interval for a point estimate is given by \[ \textrm{point estimate} \pm 1.96\times SE \]
Note: For a yuge number of computed 95% confidence intervals, the population parameter will be contained in 95% of the confidence intervals.
If a sample consists of at least 30 independent observations and the data are not strongly skewed, then the sampling distribution for the mean is well approximated by a normal model.
Definition:
Hypothesis testing compares data to what we would expect to see if a specific null hypothesis were true. If the data are too unusual, compared to what we would expect to see if the null hypothesis were true, then the null hypothesis is rejected.
Definition: A
null hypothesis is a specific statement about a population parameter made for the purpose of argument.
Definition: The
alternative hypothesis includes all other feasible values for the population parameter besides the value stated in the null hypothesis.
Can parents distinguish their own children by smell alone? To investigate, Porter and Moore (1981) gave new T-shirts to children of nine mothers. Each child wore his or her shirt to bed for three consecutive nights. During the day, from waking until bedtime, the shirts were kept in individually sealed plastic bags. No scented soaps or perfumes were used during the study. Each mother was then given the shirt of her child and that of another, randomly chosen child and asked to identify her own by smell.
Discuss: What is the
null hypothesis ?alternative hypothesis ?
Can parents distinguish their own children by smell alone? To investigate, Porter and Moore (1981) gave new T-shirts to children of nine mothers. Each child wore his or her shirt to bed for three consecutive nights. During the day, from waking until bedtime, the shirts were kept in individually sealed plastic bags. No scented soaps or perfumes were used during the study. Each mother was then given the shirt of her child and that of another, randomly chosen child and asked to identify her own by smell.
Discuss: What is the
null hypothesis ?alternative hypothesis ?
Answer: With \( p \) the probability of choosing correctly,
\[ H_{0}: \ p = 0.5 \] \[ H_{A}: \ p \neq 0.5 \]
Definition: The
test statistic is a number calculated from the data that is used to evaluate how compatible the data are with the result expected under the null hypothesis.
Definition: The
null distribution is the sampling distribution of outcomes for a test statistic under the assumption that the null hypothesis is true.
Definition: A
\( P \)-value is the probability of obtaining the data (or data showing as great or greater difference from the null hypothesis) if the null hypothesis were true.
Can parents distinguish their own children by smell alone? To investigate, Porter and Moore (1981) gave new T-shirts to children of nine mothers. Each child wore his or her shirt to bed for three consecutive nights. During the day, from waking until bedtime, the shirts were kept in individually sealed plastic bags. No scented soaps or perfumes were used during the study. Each mother was then given the shirt of her child and that of another, randomly chosen child and asked to identify her own by smell. Eight of nine mothers identified their children correctly.
Discuss: What
test statistic should you use?
Answer: The number of mothers with correct identifications.
The following figure shows the null distribution for the number of mothers out of nine guessing correctly.
Discuss: If \( H_{0} \) were true, what is the probability of exactly eight correct identifications?
Answer: Pr[number correct = 8] = 0.018
The following figure shows the null distribution for the number of mothers out of nine guessing correctly.
Discuss: If \( H_{0} \) were true, what is the probability of obtaining eight or more correct identifications?
Answer: Pr[number correct \( \geq \) 8] = 0.018 + 0.002 = 0.02
Discuss: What is the \( P \)-value?
Answer: \( P = 2\times(0.02) = 0.04 \)
Definition: The
significance level , \( \alpha \), is the probability used as a criterion for rejecting the null hypothesis. If the \( P \)-value is less than or equal to \( \alpha \), then the null hypothesis is rejected. If the \( P \)-value is greater than \( \alpha \), then the null hypothesis isnot rejected
Definition: A result is considered
statistically significant when \( P \)-value \( < \alpha \).
Definition: A result is considered
not statistically significant when \( P \)-value \( \geq \alpha \).
Can parents distinguish their own children by smell alone? To investigate, Porter and Moore (1981) gave new T-shirts to children of nine mothers. Each child wore his or her shirt to bed for three consecutive nights. During the day, from waking until bedtime, the shirts were kept in individually sealed plastic bags. No scented soaps or perfumes were used during the study. Each mother was then given the shirt of her child and that of another, randomly chosen child and asked to identify her own by smell. Eight of nine mothers identified their children correctly.
Discuss: Given \( \alpha = 0.05 \), \( \{H_{0}: \ p = 0.5\} \), and \( P \)-value of 0.04, what is the appropriate conclusion?
Answer: Reject \( H_{0} \). There is evidence that mothers consistently identify own children correctly by smell.
“We want to know if results are right, but a p-value doesn’t measure that. It can’t tell you the magnitude of an effect, the strength of the evidence or the probability that the finding was the result of chance.”
Christie Aschwanden
http://fivethirtyeight.com/pvalue
“Belief that "statistical significance” can alone discriminate between truth and falsehood borders on magical thinking.“
Cohen
Measure and report precision and effect size separately (the \( P \)-value is a summary measure that mixes them):