M. Drew LaMar
February 3, 2021
“…a hypothesis test tells us whether the observed data are consistent with the null hypothesis, and a confidence interval tells us which hypotheses are consistent with the data.”
- William C. Blackwelder
Definition: Aparameter is a quantity describing a population, whereas anestimate orstatistic is a related quantity calculated from a sample.
Parameter examples: Averages, proportions, measures of variation, and measures of relationship
The two sides of the statistical coin:
Definition: Astatistical hypothesis is a specific claim regarding a population parameter.
Definition:Hypothesis testing uses data to evaluate evidence for or against statistical hypotheses.
The two sides of the statistical coin:
Example: A trapping study measures the rate of fruit fall in forest clear-cuts.
The two sides of the statistical coin:
Example: A clinical trial is carried out to determine whether taking large doses of vitamin C benefits health of advanced cancer patients.
…well, most of the time.
For your question, there is desired and undesired information in your data.
Goals:
Definition:Bias is a systematic discrepancy between the estimates we would obtain, if we could sample a population again and again, and the true population characteristic.
For your question, there is desired and undesired information in your data.
Goals:
Definition:Sampling error is the difference between an estimate and the population parameter being estimated caused by chance.
For your question, there is desired and undesired information in your data.
Goals:
“The aim … is to provide a clear and rigorous basis for determining when a causal ordering can be said to hold between two variables or groups of variables in a model…”
- H. Simon
The main assumptions of all statistical techniques is that your data come from a random sample.
Definition: In a
random sample , each member of a population has an equal and independent chance of being selected.
Random sampling
Definition: The
sampling distribution represents the distribution of the point estimatesbased on samples of a fixed size from a certain population. It is useful to think of a particular point estimate as being drawn from such a distribution. Understanding the concept of a sampling distribution is central to understanding statistical inference.
Definition: The standard deviation associated with an estimate is called the
standard error . It describes the typical error or uncertainty associated with the estimate.
The standard error is also the standard deviation of the sampling distribution.
http://www.zoology.ubc.ca/~whitlock/kingfisher/SamplingNormal.htm
Definition: The standard error represents the standard deviation associated with the estimate, and roughly 95% of the time the estimate will be within 2 standard errors of the parameter.
An approximate 95% confidence interval for a point estimate is given by \[ \textrm{point estimate} \pm 1.96\times SE \]
Note: For a yuge number of computed 95% confidence intervals, the population parameter will be contained in 95% of the confidence intervals.