RailTrail.hightemp and cloudcover is quite small. Would you be sure that the two variables are not related at all? Create scatter plot. After examing the scatter plot, would you conclude that the two variables are not related at all?The data set is from a case-control study of smoking and Alzheimer’s disease. The data set has two variables of main interest:
smoking a factor with four levels “None”, “<10”, “10-20”, and “>20” (cigarettes per day)disease a factor with three levels “Alzheimer”, “Other dementias”, and “Other diagnoses”.The largest group that has Alzhiemers is none smoking group. As it has the largest width compared to the other groups.
The group that has more cases than expected given independence is the other dementias that smoke 20 or more cigarettes a day. The box is blue indicating more cases than expected.
Smoking does not seem to determine Alzheimers because the largest group of people with Alzheimers don’t smoke at all.Therefore no correlation explaining the many gray boxes.
RailTrail.Hint: The RailTrail data set is from the mosaicData package.
## Q5 What variables have positve correlation with the number of trail users (volume)?
The variables that have positive correrlation with the number of trail users is high temp and average temp. In the table they are both red colored in relation to volume.
The most popular season for trail users is summer. The variable for summer has a positive relation to volume, whereas fall has a negative relationship.
Hint: Use message, echo and results in the chunk options. Refer to the RMarkdown Reference Guide.