The college scorecard is a dataset collected and provided by the US department education. EAch row of this data is a post-secondary educational institution and each column describes something the institution.
Hereโs a summary of what I find interesting about the data.
{r summary statistics, message=FALSE,warning=FALSE}
scorecard %>%
summarise(`most expensive tuition`=max(COSTT4_A, na.rm = TRUE),
`most students`=max(as.numeric(UGDS),na.rm = TRUE),
`highest acceptance rate`=max(ADM_RATE,na.rm)
scorecard %>%
ggplot(aes(x = MARRIED)) +
geom_histogram(bins = 50) +
labs(title ="")