Aimee Schwab
June 10, 2013
One of my goals for this class is to improve your “statistical literacy” and critical thinking about facts and figures presented to you in everyday life. To do this, we need to start with a few definitions.
Statistics is the art and science of learning from data.
Statisticians go through a three-step process. We…
Population: the total set of subjects in which we are interested
Sample: the subset of the population for which we have (or plan to collect) data
Example: For each scenario, identify the population and describe the sample.
Random Sampling: a sampling method in which every member of the population has the same chance of being included in that sample.
There are lots of ways to get a random sample! Random samples tend to be representative of the population, so we can draw conclusions from them.
One way to draw conclusions from data is to calculate certain values.
Parameter: a numerical summary of the population
Statistic: a numerical summary of the sample
Ideally, we can use a sample statistic to help us find a “best estimate” for the population parameter.
Example: Identify whether the scenario is talking about a parameter or a statistic.
Example: Internet sites report that about 13% of Americans are left handed. Is this true at UNL? During a chemistry exam, the instructor walks around the room and counts 15 left-handed students out of 98 in the class. Identify the following:
Example: Internet sites report that about 13% of Americans are left handed. Is this true at UNL? During a chemistry exam, the instructor walks around the room and counts 15 left-handed students out of 98 in the class. Identify the following:
- variable of interest: proportion of people who are left-handed
- population: all Americans
- sample: students in UNL chemistry class
- parameter: 13%
- statistic: 15/98
- This is not a random sample of all Americans! We could possibly generalize these results to UNL students.
So how can we make conclusions from data?
Descriptive statistics: methods for summarizing data collected from a sample or a population.
Descriptive statistics allow us to summarize what's happening in our sample or population. If our sample is small and our population is large, uncertainty can become a problem!
Inferential statistics: methods of making decisions about a population, based on data obtained from a sample of that population.
In practice, both descriptive and inferential methods are used on the same data. Descriptive methods are incredibly straightforward, and most of you have used them before. The majority of our time will be spent on inference.