RailTrail.hightemp and cloudcover is quite small. Would you be sure that the two variables are not related at all?The data set is from a case-control study of smoking and Alzheimer’s disease. The data set has two variables of main interest:
smoking a factor with four levels “None”, “<10”, “10-20”, and “>20” (cigarettes per day)disease a factor with three levels “Alzheimer”, “Other dementias”, and “Other diagnoses”.## ── Attaching packages ───── tidyverse 1.3.0 ──
## ✓ ggplot2 3.3.0 ✓ purrr 0.3.3
## ✓ tibble 2.1.3 ✓ dplyr 0.8.5
## ✓ tidyr 1.0.2 ✓ stringr 1.4.0
## ✓ readr 1.3.1 ✓ forcats 0.5.0
## ── Conflicts ──────── tidyverse_conflicts() ──
## x dplyr::filter() masks stats::filter()
## x dplyr::lag() masks stats::lag()
## Loading required package: grid
The largest group that has other dementias had no cigarettes per day.
The group that had more cases than expected was the group that had >20 cigarettes per day and had other dementias.
It does not seem to matter when it comes to other dementias. The reason for this is because the largest group of people to get other dementias is did not smoke.
RailTrail.Hint: The RailTrail data set is from the mosaicData package.
Spring, fall, cloudcover, and precipitation all have a negative correlation with the number of trail user.
Fall seems to be the least popular season.
Hint: Use message, echo and results in the chunk options. Refer to the RMarkdown Reference Guide.