RailTrail.hightemp and cloudcover is quite small. Would you be sure that the two variables are not related at all?The data set is from a case-control study of smoking and Alzheimer’s disease. The data set has two variables of main interest:
smoking a factor with four levels “None”, “<10”, “10-20”, and “>20” (cigarettes per day)disease a factor with three levels “Alzheimer”, “Other dementias”, and “Other diagnoses”.The largest group that has other dementias, the biggest grey box on the left, smokes no cigarettes.
One group that has more cases than expected given independence, is the other dementias box. It is one of the larger boxes and its purple due to smoking over 20 cigarettes per day.
According to the masaic chart, smoking doesn’t seem to matter in determining other dementais. The largest block in the other dementias row falls under the none column, therefore the largest population of other dementias don’t even smoke.
RailTrail.Hint: The RailTrail data set is from the mosaicData package.
The four varibles that have a negative correlation with the number of trail users are: Spring, Fall, Precipitation and Cloud Cover.
Fall seems to be the least popular season for trail users.
Hint: Use message, echo and results in the chunk options. Refer to the RMarkdown Reference Guide.