summary(cars)
## speed dist
## Min. : 4.0 Min. : 2.00
## 1st Qu.:12.0 1st Qu.: 26.00
## Median :15.0 Median : 36.00
## Mean :15.4 Mean : 42.98
## 3rd Qu.:19.0 3rd Qu.: 56.00
## Max. :25.0 Max. :120.00
library(tidyverse)
## ── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ──
## ✔ dplyr 1.1.4 ✔ readr 2.1.5
## ✔ forcats 1.0.0 ✔ stringr 1.5.1
## ✔ ggplot2 3.5.1 ✔ tibble 3.2.1
## ✔ lubridate 1.9.4 ✔ tidyr 1.3.1
## ✔ purrr 1.0.2
## ── Conflicts ────────────────────────────────────────── tidyverse_conflicts() ──
## ✖ dplyr::filter() masks stats::filter()
## ✖ dplyr::lag() masks stats::lag()
## ℹ Use the conflicted package (<http://conflicted.r-lib.org/>) to force all conflicts to become errors
Students must formulate a plausible and testable hypothesis from the data they have selected. Deliverables will include a null hypothesis and an alternative hypothesis that can plausibly explain the data in question. Details on how to craft a testable hypothesis will be discussed in class. This assignment can be submitted via Rmarkdown or word document.
Define variables:
Dependent variable (the outcome you are interested in) (y) overdose rates ( measured by incidents per 100,000 population).
Independent variable(s) (the variables which you think affect the outcome) (x) educational level including (graduate,high school or less, some college credit but no degree, undergraduate, unknown). Additional variable to be consider ethnicity including (Hispanic, non Hispanic Black, non Hispanic White,and non Hispanic Other).
Define your hypotheses.: In Bexar County overdose rates vary significantly across different racial groups, with socioeconomic factors such as level of education mediating this relationship.
Null hypothesis - there is NO EFFECT of y on x There is no difference in overdose rates (y) across education level, racial groups (x)
Alternative hypothesis - there is SOME EFFECT of y on x There is a notable difference in overdose rates (Y) across at least one or more racial groups, across one more education level (x) The lower the level of education (x), the higher the chance of overdose.(Y)
Give me a few good paragraphs explaining your hypothesis and identifying your variables.
Hypothesis: In Bexar County overdose rates vary significantly across different racial groups, with socioeconomic factors such as level of education mediating this relationship. To explain this hypothesis I believe the data will show that the lower the level of education the higher the over dose rate. Comparison of the overrates among races (Hispanic, non Hispanic Black, non Hispanic White,and non Hispanic Other) further supports the notion of socioeconomic factors contributing to overall drug use or overdose within Bexar County. This dataset is courtesy of DSHS.