Data Visualization

M. Drew LaMar
January 29, 2016

alt text
https://xkcd.com/833/

Course Announcements

In the News - Schizophrenia's Cause?

alt text Penzes P, Cahill ME, Jones KA, VanLeeuwen J-E, Woolfrey KM. Dendritic spine pathology in neuropsychiatric disorders. Nat Neurosci. 2011 Mar;14(3):285–93.

In the News - Schizophrenia's Cause?

“Schizophrenia’s strongest genetic association…arises in part from many structurally diverse alleles of the complement component 4 (C4) genes. We found that these alleles generated widely varying levels of C4A and C4B expression in the brain, with each common C4 allele associating with schizophrenia in proportion to its tendency to generate greater expression of C4A… In mice, C4 mediated synapse elimination during postnatal development.”

Sekar A et al. Schizophrenia risk from complex variation of complement component 4. Nature. 2016 Jan 27.

In the News - Schizophrenia's Cause?

alt text Sekar A et al. Schizophrenia risk from complex variation of complement component 4. Nature. 2016 Jan 27.

Why is data visualization important?

Communicating with data visualization

Data is beautiful!

Data is ugly!

What is data?

Definition: Variables are characteristics that differ among objects of interest.

Definition: Data are the measurements of one or more variables made on a sample of objects of interest.


Data, essentially, is any measurement of the real world since

  • \( n=1 \) counts as a sample,
  • variables can technically have only one possible value (i.e. no variation)

Types of data

  • Categorical variable (qualitative)
    • Nominal (levels have no inherent ordering)
    • Ordinal (levels have an inherent ordering)
  • Numerical variable (quantitative)
    • Continuous
    • Discrete

Types of data (Class discussion)

Discuss: Would the fraction of birds in a large sample infected with avian flu virus be a discrete or continuous numerical variable?

Plots and data types

Frequency distributions of univariate data

Type of data Graphical method
Categorical Bar graph
Numerical Histogram

Showing association of bivariate data

Type of data Graphical method
Two numerical Scatter plot
Line plot
Map
Two categorical Grouped bar graph
Mosaic plot
Mixed Strip chart
Box plot
Multiple histograms
Cumulative frequency distributions