RailTrail.hightemp and cloudcover is quite small. Would you be sure that the two variables are not related at all?The data set is from a case-control study of smoking and Alzheimer’s disease. The data set has two variables of main interest:
smoking a factor with four levels “None”, “<10”, “10-20”, and “>20” (cigarettes per day)disease a factor with three levels “Alzheimer”, “Other dementias”, and “Other diagnoses”.## smoking None <10 10-20 >20
## disease
## Alzheimer 126 15 30 27
## Other dementias 79 8 33 44
## Other diagnoses 104 5 47 20
The largest group that has other dementias smokes 0 cigarettes a day.
There were more cases than expected in other dementias where the person smokes more than 20 cigarettes per day.
I believe that smoking is more of a confounding variable and the correlation is based off of other sets of variables as the spread of cases in all of the diseases has a similar pattern with cigarettes per day.
RailTrail.Hint: The RailTrail data set is from the mosaicData package.
Spring, cloudcover, fall, and precipitation all have negative correlation with the number of trail users (volume).
The least popular season for trailusers is fall, at -0.25 correlation.
Hint: Use message, echo and results in the chunk options. Refer to the RMarkdown Reference Guide.