RailTrail.hightemp and cloudcover is quite small. Would you be sure that the two variables are not related at all?The data set is from a case-control study of smoking and Alzheimer’s disease. The data set has two variables of main interest:
smoking a factor with four levels “None”, “<10”, “10-20”, and “>20” (cigarettes per day)disease a factor with three levels “Alzheimer”, “Other dementias”, and “Other diagnoses”.The largest group in the “other dementias” category are non-smokers.
The group that has more cases than expected given independence is people in the “other dementias” category who smoke more than 20 cigarretts a day.
I would say no, smoking does not seem to matter in determining “other dementias”. The number of people who smoke at all and do not smoke are relatively even, therefore making it very hard to find a strong relationship between smoking and other dementia cases.
RailTrail.Hint: The RailTrail data set is from the mosaicData package.
The four variables that have negative correlation with the number of trail users (or the volume) is the spring season, the fall season, cloudcover, and precipitation.
Fall seems to be the least popular season for trail user. Fall has the largest negative correlation with volume according to our data.
Hint: Use message, echo and results in the chunk options. Refer to the RMarkdown Reference Guide.