- Let’s consider two R Built-in Data Sets: women and chickens
- The 1st data frame contains only 15 observations, the 2nd contains 578 observations.
- How will the plots differ? How does sample size effect our ability to draw conclusions?
2024-11-17
\(\mu=\int xf(x) dx\)
\(\sigma=\sqrt{V(X)}=\sqrt{\sum_{i} P_{i}(x_{i} - \mu)^2}=\sqrt{\frac{\sum_{i} (x_{i} - \mu)^2}{n}} \\ \sigma=\sqrt{V(X)}=\sqrt{\int (x-\mu)^{2}f(x) dx}\)
stand_dev_women = sd(women$weight, na.rm =TRUE) stand_dev_women
[1] 15.49869
stand_dev_chickens = sd(ChickWeight$weight, na.rm =TRUE) stand_dev_chickens
[1] 71.07196