Language of Statistics

Prof. Eric A. Suess
March 28, 2017

Introduction

Statistics is about Learning From Data.

The four-step process by which we can learn from data:

  1. Define the Problem
  2. Collect the Data
  3. Summarize the Data
  4. Analyze the Data, Interpret the Analyses, and Communicating the Results

The language of Statistics

  • Knowing the vocabulary used in Statistics is sometimes difficult. The reason why is because of the use of common words that are in the dictionary as technical terms.
  • Examples: mean, variation, correlation, sample, population, random, hypothesis, confidence, and the list goes on.

The language of Statistics

  • We need to keep a list of the vocabulary so we understand what is being asked in Statistics related questions.
  • Start with sample and population.

The language of Statistics

Population: the set of all measurements of interest

Sample: any subset of measurements selected from the population

Example Problems

  1. Monitoring quality of a lightbulb manufacturing facility. This is an example where Quality Control/6 Sigma is used.
  2. Relationship between quitting smoking and gaining weight. This is an example of Medical Research.
  3. What effect does nitrogen fertilizer have on wheat production? This is an example of Agricultural Research.
  4. Determining public opinion toward a question, issue, product, or candidate. This is an example of Polling.

Gallop Poll

Why study Statistics?

Slide With R Code

summary(cars)
     speed           dist       
 Min.   : 4.0   Min.   :  2.00  
 1st Qu.:12.0   1st Qu.: 26.00  
 Median :15.0   Median : 36.00  
 Mean   :15.4   Mean   : 42.98  
 3rd Qu.:19.0   3rd Qu.: 56.00  
 Max.   :25.0   Max.   :120.00  

Slide With Plot

plot of chunk unnamed-chunk-2