# excel file
data <- read_excel("../00_data/data/myData.xlsx")
data
## # A tibble: 9,355 × 12
## work_year job_title job_category salary_currency salary salary_in_usd
## <dbl> <chr> <chr> <chr> <dbl> <dbl>
## 1 2023 AI Architect Machine Learning… USD 305100 305100
## 2 2023 AI Architect Machine Learning… USD 146900 146900
## 3 2023 AI Architect Machine Learning… USD 330000 330000
## 4 2023 AI Architect Machine Learning… USD 204000 204000
## 5 2023 AI Architect Machine Learning… USD 330000 330000
## 6 2023 AI Architect Machine Learning… USD 204000 204000
## 7 2023 AI Architect Machine Learning… EUR 200000 215936
## 8 2023 AI Architect Machine Learning… USD 330000 330000
## 9 2023 AI Architect Machine Learning… USD 204000 204000
## 10 2023 AI Architect Machine Learning… USD 200000 200000
## # ℹ 9,345 more rows
## # ℹ 6 more variables: employee_residence <chr>, experience_level <chr>,
## # employment_type <chr>, work_setting <chr>, company_location <chr>,
## # company_size <chr>
My Question: How did the salary of Data Analysts change over the past three years.
analytics <- filter(data, job_category == "Data Analysis")
analytics
## # A tibble: 1,457 × 12
## work_year job_title job_category salary_currency salary salary_in_usd
## <dbl> <chr> <chr> <chr> <dbl> <dbl>
## 1 2023 BI Data Analyst Data Analysis USD 25000 25000
## 2 2023 BI Data Analyst Data Analysis USD 50000 50000
## 3 2023 BI Data Analyst Data Analysis USD 85000 85000
## 4 2023 BI Data Analyst Data Analysis USD 70000 70000
## 5 2023 BI Data Analyst Data Analysis USD 60000 60000
## 6 2023 BI Data Analyst Data Analysis EUR 67000 72338
## 7 2022 BI Data Analyst Data Analysis EUR 58000 60938
## 8 2022 BI Data Analyst Data Analysis EUR 100000 105066
## 9 2022 BI Data Analyst Data Analysis USD 57000 57000
## 10 2022 BI Data Analyst Data Analysis AUD 65000 45050
## # ℹ 1,447 more rows
## # ℹ 6 more variables: employee_residence <chr>, experience_level <chr>,
## # employment_type <chr>, work_setting <chr>, company_location <chr>,
## # company_size <chr>
ggplot(data = analytics, aes(x = work_year, y = salary_in_usd)) +
geom_bar(stat = "identity")
Obviously, there was an increase in Salary in the Data Analytics category. However, I couldn’t figure out how to change the number format so it looks like there is a massive change in salary what’s probably not the case.