RailTrail.hightemp and cloudcover is quite small. Would you be sure that the two variables are not related at all?The data set is from a case-control study of smoking and Alzheimer’s disease. The data set has two variables of main interest:
smoking a factor with four levels “None”, “<10”, “10-20”, and “>20” (cigarettes per day)disease a factor with three levels “Alzheimer”, “Other dementias”, and “Other diagnoses”.The largest group that has other dementias were non-smokers, 0 cigarettes per day.
One group that has more cases than expected given chance were people with other dementias who smoked more than 20 cigarettes per day.
After examining the plot, I can say that smoking does not matter in determining other dementias. There is only one group of people where there are more cases than expected which were people with other dementias who smoked more than 20 gigarettes per day. And the sample size is too small anyway so nothing is conclusive.
RailTrail.Hint: The RailTrail data set is from the mosaicData package.
The four variables that have negative correlation with the number of trail users are Spring, Fall, Clodcover, and Precipitation.
The Fall seems to be least popular for the trail users.
Hint: Use message, echo and results in the chunk options. Refer to the RMarkdown Reference Guide.