RailTrail.hightemp and cloudcover is quite small. Would you be sure thatThe data set is from a case-control study of smoking and Alzheimer’s disease. The data set has two variables of main interest:
smoking a factor with four levels “None”, “<10”, “10-20”, and “>20” (cigarettes per day)disease a factor with three levels “Alzheimer”, “Other dementias”, and “Other diagnoses”.## ── Attaching packages ─────────────────── tidyverse 1.3.0 ──
## ✓ ggplot2 3.3.0 ✓ purrr 0.3.3
## ✓ tibble 2.1.3 ✓ dplyr 0.8.5
## ✓ tidyr 1.0.2 ✓ stringr 1.4.0
## ✓ readr 1.3.1 ✓ forcats 0.5.0
## ── Conflicts ────────────────────── tidyverse_conflicts() ──
## x dplyr::filter() masks stats::filter()
## x dplyr::lag() masks stats::lag()
## Loading required package: grid
The largest group of all dementias is non smoker alzhhiemers. The plot is larger than the smoking alzhiemers patients ## Q2 Describe one group that has more cases than expected given independence (by chance). Discuss it by number of cigarettes per day.
People with dimentia that smoke more than 20 cigarettes per day.
If you smoke you seem to be less likely to have alzhimers, as the majority of alzhiemers patients do not smoke
RailTrail.Hint: The RailTrail data set is from the mosaicData package.
High, average, low tenp and summer have high correlation
spring is the least popular seeason for the trail
hightemp and cloudcover is quite small. Would you be sure thatthe two variables are not related at all? Hint: One word answer (e.g., yes or no) is NOT enough. Explain why. Yes, this is most likely because its not usually cloudy when its hot because clouds burn off.
Hint: Use message, echo and results in the chunk options. Refer to the RMarkdown Reference Guide.