Sheila Weaver
6 July 2020
First, what is Big Data?
- Big Data is the latest ‘information explosion.’
- The printing press was probably the first major one.
- Took 300 years for the world to 'settle down' after its invention.
- We're still settling down with big data…
Here's one idea:
Who is Antonie Van Leeuwenhoek ?
We know that Standardized Test Scores usually have a bell curve:
library(tidyverse)
mpg %>% group_by(class) %>%
summarise(mileage = mean(hwy))
# A tibble: 7 x 2
class mileage
<chr> <dbl>
1 2seater 24.8
2 compact 28.3
3 midsize 27.3
4 minivan 22.4
5 pickup 16.9
6 subcompact 28.1
7 suv 18.1
ggplot(data = mpg,
mapping = aes(x = hwy, fill=class)) +
geom_density() + facet_grid(class~.) +
theme(legend.title = element_text(size=18),
legend.text = element_text(size = 16),
strip.text.y = element_blank())