Import data

# excel file
data <- read_excel("../00_data/data/myData.xlsx")
data
## # A tibble: 9,355 × 12
##    work_year job_title    job_category      salary_currency salary salary_in_usd
##        <dbl> <chr>        <chr>             <chr>            <dbl>         <dbl>
##  1      2023 AI Architect Machine Learning… USD             305100        305100
##  2      2023 AI Architect Machine Learning… USD             146900        146900
##  3      2023 AI Architect Machine Learning… USD             330000        330000
##  4      2023 AI Architect Machine Learning… USD             204000        204000
##  5      2023 AI Architect Machine Learning… USD             330000        330000
##  6      2023 AI Architect Machine Learning… USD             204000        204000
##  7      2023 AI Architect Machine Learning… EUR             200000        215936
##  8      2023 AI Architect Machine Learning… USD             330000        330000
##  9      2023 AI Architect Machine Learning… USD             204000        204000
## 10      2023 AI Architect Machine Learning… USD             200000        200000
## # ℹ 9,345 more rows
## # ℹ 6 more variables: employee_residence <chr>, experience_level <chr>,
## #   employment_type <chr>, work_setting <chr>, company_location <chr>,
## #   company_size <chr>

State one question

My Question: How did the salary of Data Analysts change over the past three years.

Plot data

analytics <- filter(data, job_category == "Data Analysis")
analytics
## # A tibble: 1,457 × 12
##    work_year job_title       job_category  salary_currency salary salary_in_usd
##        <dbl> <chr>           <chr>         <chr>            <dbl>         <dbl>
##  1      2023 BI Data Analyst Data Analysis USD              25000         25000
##  2      2023 BI Data Analyst Data Analysis USD              50000         50000
##  3      2023 BI Data Analyst Data Analysis USD              85000         85000
##  4      2023 BI Data Analyst Data Analysis USD              70000         70000
##  5      2023 BI Data Analyst Data Analysis USD              60000         60000
##  6      2023 BI Data Analyst Data Analysis EUR              67000         72338
##  7      2022 BI Data Analyst Data Analysis EUR              58000         60938
##  8      2022 BI Data Analyst Data Analysis EUR             100000        105066
##  9      2022 BI Data Analyst Data Analysis USD              57000         57000
## 10      2022 BI Data Analyst Data Analysis AUD              65000         45050
## # ℹ 1,447 more rows
## # ℹ 6 more variables: employee_residence <chr>, experience_level <chr>,
## #   employment_type <chr>, work_setting <chr>, company_location <chr>,
## #   company_size <chr>
ggplot(data = analytics, aes(x = work_year, y = salary_in_usd)) +
    geom_bar(stat = "identity")

Interpret

Obviously, there was an increase in Salary in the Data Analytics category. However, I couldn’t figure out how to change the number format so it looks like there is a massive change in salary what’s probably not the case.