The data set is from a case-control study of smoking and Alzheimer’s disease. The data set has two variables of main interest:

## ── Attaching packages ─────────────────── tidyverse 1.3.0 ──
## ✓ ggplot2 3.3.0     ✓ purrr   0.3.3
## ✓ tibble  2.1.3     ✓ dplyr   0.8.5
## ✓ tidyr   1.0.2     ✓ stringr 1.4.0
## ✓ readr   1.3.1     ✓ forcats 0.5.0
## ── Conflicts ────────────────────── tidyverse_conflicts() ──
## x dplyr::filter() masks stats::filter()
## x dplyr::lag()    masks stats::lag()
## Loading required package: grid

Q1 Describe the largest group that has other dementias. Discuss it by number of cigarettes per day.

The largest group of all dementias is non smoker alzhhiemers. The plot is larger than the smoking alzhiemers patients ## Q2 Describe one group that has more cases than expected given independence (by chance). Discuss it by number of cigarettes per day.

People with dimentia that smoke more than 20 cigarettes per day.

Q3 Does smoking seem to matter in determining other dementias? Discuss your reason using the masaic chart above.

If you smoke you seem to be less likely to have alzhimers, as the majority of alzhiemers patients do not smoke

Q4 Create correlation plot for RailTrail.

Hint: The RailTrail data set is from the mosaicData package.

Q5 List all four variables that have negative correlation with the number of trail users (volume).

High, average, low tenp and summer have high correlation

Q7 The correlation coefficient between hightemp and cloudcover is quite small. Would you be sure that

the two variables are not related at all? Hint: One word answer (e.g., yes or no) is NOT enough. Explain why. Yes, this is most likely because its not usually cloudy when its hot because clouds burn off.

Q8 Hide the messages, the code and its results on the webpage.

Hint: Use message, echo and results in the chunk options. Refer to the RMarkdown Reference Guide.

Q9 Display the title and your name correctly at the top of the webpage.

Q10 Use the correct slug.