##Import Data

library(tidyverse)
## ── Attaching packages ─────────────────────────────────────── tidyverse 1.3.2 ──
## ✔ ggplot2 3.4.0     ✔ purrr   1.0.1
## ✔ tibble  3.1.8     ✔ dplyr   1.1.0
## ✔ tidyr   1.3.0     ✔ stringr 1.5.0
## ✔ readr   2.1.3     ✔ forcats 1.0.0
## ── Conflicts ────────────────────────────────────────── tidyverse_conflicts() ──
## ✖ dplyr::filter() masks stats::filter()
## ✖ dplyr::lag()    masks stats::lag()
data <- read_csv("../00_data/Mydata.csv") %>%
    janitor::clean_names()
## New names:
## Rows: 41152 Columns: 17
## ── Column specification
## ──────────────────────────────────────────────────────── Delimiter: "," chr
## (11): Name, Sex, Event, Equiptment, Age Class, Division, Weight Class KG... dbl
## (5): Age, Bodyweight, Best Sqaut KG, Best Bench KG, Best Deadlifting lgl (1):
## ...17
## ℹ Use `spec()` to retrieve the full column specification for this data. ℹ
## Specify the column types or set `show_col_types = FALSE` to quiet this message.
## • `` -> `...17`

Explain Data and Variables:

The data I chose is the max bench press for male and female power lifters in different weight classes. The variables that will go the code below are “Sex”, “Best Bench KG” “Bodyweight”, “Age”.

What is the question I am looking to answer?

What factor be it age, sex or body weight had the biggest impact on average bench press strength.

Below is a scatter plot of the ammount someone weighs vs their strength both male and female.

ggplot(data, aes(x = weight_class_kg, y = best_bench_kg)) +
    geom_point() +
    labs(title = "Powerlifting Data: Bodyweight vs. Bench")
## Warning: Removed 2462 rows containing missing values (`geom_point()`).

Below is a chart that shows the ammount males can bench vs females.

ggplot(data, 
       aes(x = best_bench_kg, 
           fill = sex)) +
  geom_density(alpha = 0.4) +
  labs(title = "Male Vs Female Bench Press")
## Warning: Removed 2462 rows containing non-finite values (`stat_density()`).

Below is a scatter ploy of the age class vs bench press strength of both males and females.

ggplot(data, 
       aes(x = best_bench_kg, 
           y = age_class)) +
  geom_point()
## Warning: Removed 2462 rows containing missing values (`geom_point()`).

Interpret Data.

I would have to say that sex has the biggest impact on an individuals strength. Males had significantly more density when compared to female. The next biggest variable is age. There was a bell curve of sorts where the maxium bench poress was hit around the ages of 24-34 and the least strong being 13-15 and 80+. Bodyweight had a pretty preportionate corolation and that seemed to be the least big factor. There is a saying in the weight lifting comunity that “mass moves mass” so this chart just helped solidify that point.