The aim of our project is to investigate Data Science Jobs in the US. We are interested in the topic since we are studying to become Data Scientists, and therefore, would like to know more about it. We are going to use the data about Data Science Job Posting on Glassdoor. It was collected by web-scraping job posts from Glassdoor for data science jobs.
We have decided to start our analysis with a graph that could give a sort of general overview of Data Science Jobs (and related) and salaries. From the graph above, it can be seen that being a Manager is definitely the position that, on average, pays the most. Data Architect immediately follows with an average yearly salary of $200,000. Contrary to our expectation, Data Modeler does not earn that much and can be found at almost the end of the ranking.
In this part of the project, we are going to analyze which sectors have the highest salaries. We think this is a useful because it highlights the trend that, even in the field of data science, there can be significant differences in salary depending on the field that an individual may choose to go into.
The above graph shows the average salary for data-related job postings across different sectors. Highest paying sectors are Media and Retail. However, the total number of openings from those sectors in the data set are only 5 and 7 respectively. The sector with the highest opportunity for data related jobs seem to be Business Services and Information Technology, because, despite being on the average salary range across sectors, they have the highest number of openings and the biggest job opportunity.
After exploring the relationship between sectors and salaries, we next try to assess the popularity of data science roles, and how it differentiates by state. We are assuming that Google search trends for data science related search terms (such as data modeler, data architect and data engineer) are an effective proxy for interest. We then compare how interest in data science roles, as measured by Google trends, is related to the actual number of job openings. For this purpose, we pulled Google Search trends for the following job titles across US states. Google trends provide a relative number of search popularity, not the absolute number of total searches.
We mostly see a strong correlation between the interest in job titles (as per Google search numbers) and the available jobs per state. California, without any surprise, is where the highest number of job openings and biggest interest is at. Interestingly, Virginia comes second in terms of openings and interest, rather than NY.
Another important insight is that there are many cases where the interest is not met by the available jobs. Texas, Illinois and Florida have very high Google search numbers for Data Science jobs but in terms of the number of job postings, they are in the smallest range.
The last bit of insight this graph shows is the interest for different job titles. We see that Data Analyst and Data Scientist searches on Google are quite high, on a scale of 0-50, while for Machine Learning Engineer and Data Engineer, this relative spectrum is quite smaller.
After analyzing the statewide breakdown of interest and availability of data science jobs, we thought it would be interesting to visualize on a map which cities and states have the most job openings and highest salaries.
Note: grey states have no openings
From the first map above we can see that the States with the more openings are California, Virginia, and Massachusetts. City-wide, the jobs are well distributed over the Country. However, the city with more openings are San Francisco with 69 and New York with 50. In the third place comes Washington D.C. with just 26 openings. Nonetheless, it has to be taken into consideration that the openings in the dataset are community specific. This means that there are a lot of openings in the surrounding areas of the big cities that are not, however, counted as if in their metro areas. Zooming in to San Francisco gives a clearer idea. About thirty-plus areas with openings surround the San Francisco’s metropolitan area. Santa Clara has 9 openings, Redwood City has 7, San Jose 4, Cupertino 3, and many more.
## [1] "The median salary in the dataset is:"
## [1] 115.4