RailTrail.hightemp and cloudcover is quite small. Would you be sure that the two variables are not related at all?The data set is from a case-control study of smoking and Alzheimer’s disease. The data set has two variables of main interest:
smoking a factor with four levels “None”, “<10”, “10-20”, and “>20” (cigarettes per day)disease a factor with three levels “Alzheimer”, “Other dementias”, and “Other diagnoses”.The largest group that has other dimentias consists of non-smokers who smoke zero cigarettes per day.
One group that has more cases than expected, given independence, would be smokers who smoke more than 20 cigarettes per day.
In determining other dimentias, smoking does not seem to matter. The mosaic chart supports this theory because it displays non-smokers, who smoke zero cigarettes per day, as the largest group.
RailTrail.Hint: The RailTrail data set is from the mosaicData package.
The four variables that have a negative correlation with the number of trail users in volume include the following: spring (-0.04), fall (-0.25), cloudcover (-0.37), and precipitation (-0.23).
The least popular season for trail users is Fall due to it having the largest negative correlation with users in comparison to the other seasons.
Hint: Use message, echo and results in the chunk options. Refer to the RMarkdown Reference Guide.