RailTrail.hightemp and cloudcover is quite small. Would you be sure that the two variables are not related at all? Create scatter plot. After examing the scatter plot, would you conclude that the two variables are not related at all?The data set is from a case-control study of smoking and Alzheimer’s disease. The data set has two variables of main interest:
smoking a factor with four levels “None”, “<10”, “10-20”, and “>20” (cigarettes per day)disease a factor with three levels “Alzheimer”, “Other dementias”, and “Other diagnoses”.The largest group that has Alzheimer’s is the group that smokes no cigarettes per day. You can tell by looking at the graph and the largest box is the ones that do not smoke.
The group that has more cases than expected would be other dimentias (only bos with blue) with more than 20 cigarettes a day.
Smoking does not seem to be a determining factor in having Alzheimer because the largest group of people in the mosaic plot were the non-smokers. If smoking was a determining factor in having Alzeihmer’s, more people who smoke cigarettes would have Alzheimer’s.
RailTrail.Hint: The RailTrail data set is from the mosaicData package.
Four variables in the correlation plot have a positive correlation with the amount of trail users or “volume”. They are hightemp, lowtemp, avgtemp, summer because they have positive numbers above .0.
The correlation plot shows that the most popular season for the trail users is summer.Summer, out of all the seasons, has the highest correlation for volume at .23.
Hint: Use message, echo and results in the chunk options. Refer to the RMarkdown Reference Guide.