M. Drew LaMar
September 5, 2018
“Numerical quantities focus on expected values, graphical summaries on unexpected values.”
- John Tukey
Quote: We don’t include men in our calculation because they are not at risk for cervical cancer and by the same measure, we shouldn’t include women who don’t have a cervix
- Anne F. Rositch
Quote: The researchers found that black women have a mortality rate of 10.1 per 100,000. For white women, the rate is 4.7 per 100,000. Past estimates had those rates at 5.7 and 3.2, respectively. The new death rate for black women in the US is on par with that of developing countries.
Discuss: Does this warrant the “much deadlier” headline?
Discuss: Why do you think the death rate for black woman in the US is higher?
Definition:
Variables are characteristics that differ among objects of interest.
Definition:
Data are the measurements of one or more variables made on a sample of objects of interest.
Data, essentially, is any measurement of the real world since
Categorical variable (qualitative)
Remember the factor
data type in R?
Numerical variable (quantitative)
Remember the numeric
data type in R?
Discuss: Would the fraction of birds in a large sample infected with avian flu virus be a discrete or continuous numerical variable?
Answer: Neither! The variable of interest here is actually categorical (nominal). Why?
Ask yourself the following questions:
Frequency distributions of univariate data
Type of data | Graphical method |
---|---|
Categorical | Bar graph |
Numerical | Histogram |
Showing association of bivariate data
Type of data | Graphical method |
---|---|
Two numerical | Scatter plot |
“ | Line plot |
” | Map |
Two categorical | Grouped bar graph |
“ | Mosaic plot |
Mixed | Strip chart |
” | Box plot |
“ | Multiple histograms |
” | Cumulative frequency distributions |
Data visualization is one step in exploratory data analysis.
Quote: …the first step in any data analysis or statistical procedure is to graph the data and look at it. Humans are a visual species, with brains evolved to process visual information. Take advantage of millions of years of evolution, and look at visual representations of your data before doing anything else.
- Whitlock & Schluter
Data visualization is one step in exploratory data analysis.
If you want to graph some data, you most likely will need to manipulate the data first to put it in the right form.