Assignment 3 - Storytelling with Open Data
Pay disparities among data science professionals based on factors such as Experience, Designation, Employment Type, etc.
Geetesh Sanjeev Matreja (s3931386)
Understanding the Data
1st variable refers to working year, and is in year format like “2021” “2022
2nd variable refers to designation, which says job profile
3rd variable refer to experience, which says the level of experience as “EN” = entry level, “MI” = mid level/intermediate, “SE” = senior level/expert, “EX” = executive level/Director.
4th variable refers to employment status as “FT”(full time), “PT”(part time), “CT”(contract), “FL”(freelance)
5th variable refers salary in Indian rupees.
6th variable refers location of an employee as IN = india, HK = hong kong,ES = spain, etc.
7th variable refers to office location as IN = india, PT = portugal,GB = united kingdom, etc.
8th variable refers company size as “s”(small - less than 50 employees), “m”(medium - 50-250 employees), “l”(large - more than 250 employees)
9th variable refers remote working ratio as “0”- no remote work, “50” - partially remote, “100” - fully remote
Average Pay as per Work Experience
workex
Average Pay as per Employment Status
(stat_graph)
Average Pay as per Company Size
size_graph
Average Pay as per Designation
des_plot
Reference
Aman Chauhan.(Sept,2022). Data science Fields Salary Categorization [Data Set]. Kaggle. Retrived from
https://www.kaggle.com/datasets/whenamancodes/data-science-fields-salary-categorization