Tidy up Linkedin Dataset

Load the Linkedin csv file into R. Below are some things we need to tidy:

Matrix of raw file

The leading and trailing white spaces

Linkedin <- read.csv('https://raw.githubusercontent.com/suswong/dataset-version-2/main/Job%20details%20by%20search_LinkedIn%20version%202(1).csv')

head(Linkedin,1)
##   Keyword      Location    Job_title
## 1    Data United States Data Analyst
##                                                                                                                                                                                                                  Job_link
## 1 https://www.linkedin.com/jobs/view/data-analyst-at-young-life-3521907674?refId=j1Bxa6%2F%2BWbkCr9i%2FQ091RQ%3D%3D&trackingId=vi3sxWiThZ5AMIhSHt0oWQ%3D%3D&position=1&pageNum=0&trk=public_jobs_jserp-result_search-card
##                                        Company
## 1 \n                Young Life\n              
##                                                                   Company_link
## 1 https://www.linkedin.com/company/young-life?trk=public_jobs_topcard-org-name
##                                  Job_location
## 1 \n              United States\n            
##                                                                                                                                                                                                                           Post_time
## 1 \n        \n\n    \n    \n    \n    \n    \n    \n    \n    \n    \n    \n    \n    \n    \n    \n    \n    \n\n    \n    \n    \n    \n    \n    \n    \n    \n    \n    \n    \n    \n    \n    \n\n      1 day ago\n  \n      
##                                                                               Applicants_count
## 1 \n          \n          \n            Be among the first 25 applicants\n          \n        
##                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            Job_description
## 1 Mission/Authority\n\nServe mission leadership and staff by curating and analyzing key mission data to deliver insights and decision support to key Young Life stakeholders. Alongside technical and information leaders, deliver the data necessary to support effective decisions, discover efficiencies in mission data systems, and evaluate business data to determine effective analytics solutions.\n\nEssential Duties\n\nBuild and maintain reporting and analytics is Workday, Salesforce, and Power BI related to Young Life’s Finance, Human Resources, Learning, and Product Development Team.\nAssist in growing data literacy and self-service reporting capabilities for Young Life Finance, Human Resources, Learning, and Field organizations.\nInterpret, analyze, and utilize statistical analysis techniques to discover patterns and key insights in mission critical data.\nExecute the curation of data delivered by the data team and exploratory data for analytics consumption and deployment.\nOrganize and optimize all data residing in mission data schemas.\nSupport department embedded analysts by delivering new data sets to meet their reporting and analytics needs.\nAssist the Data Architect in developing the data models necessary to support a wide variety of information needs.\nCollaborate with platform and data resources to ensure a high level of data availability and performance for all datasets used in reporting and analytics.\nCollaborate with the data quality staff to ensure clean and accurate data for reporting and analytical models.\nCollaborate with key business staff and leaders to design and build analytical solutions to meet the operational reporting and decision support need of the mission.\nAssist in building a business intelligence center of excellence to support a wide variety of analytical and reporting needs for mission leadership and staff.\nAssist in pioneering data activities in response to prospective use cases driven by IS and Strategic Leadership.\nEngage in assigned data related projects across the analytics and data ecosystem.\n\nWorking Relationships\n\nMultiple regular interactions with key technical and ministry operations stakeholders to maintain a sense of awareness and understanding of current ministry needs.\nRegular involvement in the cross-departmental teams.\nCollaborate with Project Management to provide data expertise and perspective on key mission projects.\nAssists the Director of Mission Analytics and Decision Support in furthering the goals of the mission.\n\nEducation\n\nBachelor’s, Master’s Degree, or equivalent experience in a business, technical, or data related field required.\n\nWorking Conditions\n\nOffice Environment at the Service Center in Colorado Springs, CO with hybrid or remote options.\nTravel 1-3 times a year may be required.\nThis position offers a competitive salary range of $59,800 - $74,749 annually commensurate with relevant experience, qualifications, and education.\n\nQualifications Required For The Job\n\n5-7 years minimum work experience.\nProficiency in modern business intelligence and enterprise platform tools is required (Workday, Salesforce, Power BI, Excel, or similar).\nStrong working knowledge of relational databases and/or SQL is required.\nExperience with Snowflake and Python is preferred.\n3 or more years of related data, business intelligence, or data analyst experience required.\nHighly organized, self-motivated, and attentive to detail.\nStrong interpersonal, communication, and problem-solving skills needed.\n\nApplication\n\nPlease submit your application, resume, and cover letter by April 3rd, 2023.
##                         Seniority_level                 Employment_type
## 1 \n            Entry level\n           \n          Full-time\n        
##                                                             Job_function
## 1 \n            Strategy/Planning and Information Technology\n          
##                                       Industries
## 1 \n          Religious Institutions\n

Datatable of raw file

library(DT)
datatable(head(Linkedin))

Remove leading and trailing white spaces

We need to remove the leading and trailing white spaces in the following columns:

  • Company

  • Job_location

  • Applicants_count

  • Seniority_level

  • Employment_type

  • Job_function

  • Industries

Matrix

rem_WS_Linkedin <- Linkedin
rem_WS_Linkedin <- data.frame(lapply(rem_WS_Linkedin, trimws), stringsAsFactors = FALSE)
# At first, I removed the white spaces manually (see below). However, the link below shows how to remove leading and trailing white spaces for the entire dataframe. https://stackoverflow.com/questions/20760547/removing-whitespace-from-a-whole-data-frame-in-r

# tidied_Linkedin <- Linkedin
# tidied_Linkedin$Company <- str_trim(tidied_Linkedin$Company)
# tidied_Linkedin$Job_location <- str_trim(tidied_Linkedin$Job_location)
# tidied_Linkedin$Post_time <- str_trim(tidied_Linkedin$Post_time)
# tidied_Linkedin$Applicants_count <- str_trim(tidied_Linkedin$Applicants_count)
# tidied_Linkedin$Seniority_level <- str_trim(tidied_Linkedin$Seniority_level)
# tidied_Linkedin$Employment_type <- str_trim(tidied_Linkedin$Employment_type)
# tidied_Linkedin$Job_function <- str_trim(tidied_Linkedin$Job_function)
# tidied_Linkedin$Industries <- str_trim(tidied_Linkedin$Industries)

head(rem_WS_Linkedin,1)
##   Keyword      Location    Job_title
## 1    Data United States Data Analyst
##                                                                                                                                                                                                                  Job_link
## 1 https://www.linkedin.com/jobs/view/data-analyst-at-young-life-3521907674?refId=j1Bxa6%2F%2BWbkCr9i%2FQ091RQ%3D%3D&trackingId=vi3sxWiThZ5AMIhSHt0oWQ%3D%3D&position=1&pageNum=0&trk=public_jobs_jserp-result_search-card
##      Company
## 1 Young Life
##                                                                   Company_link
## 1 https://www.linkedin.com/company/young-life?trk=public_jobs_topcard-org-name
##    Job_location Post_time                 Applicants_count
## 1 United States 1 day ago Be among the first 25 applicants
##                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            Job_description
## 1 Mission/Authority\n\nServe mission leadership and staff by curating and analyzing key mission data to deliver insights and decision support to key Young Life stakeholders. Alongside technical and information leaders, deliver the data necessary to support effective decisions, discover efficiencies in mission data systems, and evaluate business data to determine effective analytics solutions.\n\nEssential Duties\n\nBuild and maintain reporting and analytics is Workday, Salesforce, and Power BI related to Young Life’s Finance, Human Resources, Learning, and Product Development Team.\nAssist in growing data literacy and self-service reporting capabilities for Young Life Finance, Human Resources, Learning, and Field organizations.\nInterpret, analyze, and utilize statistical analysis techniques to discover patterns and key insights in mission critical data.\nExecute the curation of data delivered by the data team and exploratory data for analytics consumption and deployment.\nOrganize and optimize all data residing in mission data schemas.\nSupport department embedded analysts by delivering new data sets to meet their reporting and analytics needs.\nAssist the Data Architect in developing the data models necessary to support a wide variety of information needs.\nCollaborate with platform and data resources to ensure a high level of data availability and performance for all datasets used in reporting and analytics.\nCollaborate with the data quality staff to ensure clean and accurate data for reporting and analytical models.\nCollaborate with key business staff and leaders to design and build analytical solutions to meet the operational reporting and decision support need of the mission.\nAssist in building a business intelligence center of excellence to support a wide variety of analytical and reporting needs for mission leadership and staff.\nAssist in pioneering data activities in response to prospective use cases driven by IS and Strategic Leadership.\nEngage in assigned data related projects across the analytics and data ecosystem.\n\nWorking Relationships\n\nMultiple regular interactions with key technical and ministry operations stakeholders to maintain a sense of awareness and understanding of current ministry needs.\nRegular involvement in the cross-departmental teams.\nCollaborate with Project Management to provide data expertise and perspective on key mission projects.\nAssists the Director of Mission Analytics and Decision Support in furthering the goals of the mission.\n\nEducation\n\nBachelor’s, Master’s Degree, or equivalent experience in a business, technical, or data related field required.\n\nWorking Conditions\n\nOffice Environment at the Service Center in Colorado Springs, CO with hybrid or remote options.\nTravel 1-3 times a year may be required.\nThis position offers a competitive salary range of $59,800 - $74,749 annually commensurate with relevant experience, qualifications, and education.\n\nQualifications Required For The Job\n\n5-7 years minimum work experience.\nProficiency in modern business intelligence and enterprise platform tools is required (Workday, Salesforce, Power BI, Excel, or similar).\nStrong working knowledge of relational databases and/or SQL is required.\nExperience with Snowflake and Python is preferred.\n3 or more years of related data, business intelligence, or data analyst experience required.\nHighly organized, self-motivated, and attentive to detail.\nStrong interpersonal, communication, and problem-solving skills needed.\n\nApplication\n\nPlease submit your application, resume, and cover letter by April 3rd, 2023.
##   Seniority_level Employment_type                                 Job_function
## 1     Entry level       Full-time Strategy/Planning and Information Technology
##               Industries
## 1 Religious Institutions

Datatable

datatable(rem_WS_Linkedin)
## Warning in instance$preRenderHook(instance): It seems your data is too big
## for client-side DataTables. You may consider server-side processing: https://
## rstudio.github.io/DT/server.html

Tidy ‘Job_location’ column

Split the ‘Job_location’ column into city and state

The majority of the values in the ‘Job_location’ column contains both city and state. They are separated by commas. We need to split the ‘Job_location’ by its comma.

library(dplyr)
## 
## Attaching package: 'dplyr'
## The following objects are masked from 'package:stats':
## 
##     filter, lag
## The following objects are masked from 'package:base':
## 
##     intersect, setdiff, setequal, union
library(tidyr)
library(stringr)
split_location_Linkedin <- rem_WS_Linkedin
# tidied_Linkedin %>% 
#    separate(Job_location,c("City","State"),sep=",")

split_location_Linkedin[c('Job_location_City', 'Job_location_State')] <- str_split_fixed(split_location_Linkedin$Job_location, ',', 2)

#colnames(split_location_Linkedin)

# Drop the 'Job_location' column and 
tidied_Linkedin <- split_location_Linkedin[c('Keyword', 'Job_title', 'Job_link', 'Company','Company_link','Job_location_City','Job_location_State','Post_time','Applicants_count','Seniority_level','Employment_type','Job_function','Industries','Job_description')]

datatable(head(tidied_Linkedin))

Tidy ‘Job_location_State’

Some values in the raw file only contain the state and no city. So when I split the ‘location’ column by comma, those state values was set in the city column. Thus, some values are missing in ‘Job_location_State’.

#unique(tidied_Linkedin$Job_location_State)
tidied_Linkedin$Job_location_State[tidied_Linkedin$Job_location_City == "New York"] <- "NY"
tidied_Linkedin$Job_location_State[tidied_Linkedin$Job_location_City == "New York City Metropolitan Area"] <- "NY"
tidied_Linkedin$Job_location_State[tidied_Linkedin$Job_location_City == "Utica-Rome Area"] <- "NA"
tidied_Linkedin$Job_location_State[tidied_Linkedin$Job_location_City == "Denver Metropolitan Area"] <- "CO"
tidied_Linkedin$Job_location_State[tidied_Linkedin$Job_location_City == "California"] <- "CA"
tidied_Linkedin$Job_location_State[tidied_Linkedin$Job_location_City == "North Carolina"] <- "NC"
tidied_Linkedin$Job_location_State[tidied_Linkedin$Job_location_City == "Florida"] <- "FL"
tidied_Linkedin$Job_location_State[tidied_Linkedin$Job_location_City == "Texas"] <- "TX"
tidied_Linkedin$Job_location_State[tidied_Linkedin$Job_location_City == "Dallas-Fort Worth Metroplex"] <- "TX"
tidied_Linkedin$Job_location_State[tidied_Linkedin$Job_location_City == "Minnesota"] <- "MN"
tidied_Linkedin$Job_location_State[tidied_Linkedin$Job_location_City == "Ohio"] <- "OH"
tidied_Linkedin$Job_location_State[tidied_Linkedin$Job_location_City == "Salt Lake City Metropolitan Area"] <- "UT"
tidied_Linkedin$Job_location_State[tidied_Linkedin$Job_location_City == "Greater Chicago Area"] <- "IA"
tidied_Linkedin$Job_location_State[tidied_Linkedin$Job_location_City == "Greater Seattle Area"] <- "WA"
tidied_Linkedin$Job_location_State[tidied_Linkedin$Job_location_City == "Des Moines Metropolitan Area"] <- "IA"
datatable(tidied_Linkedin)
## Warning in instance$preRenderHook(instance): It seems your data is too big
## for client-side DataTables. You may consider server-side processing: https://
## rstudio.github.io/DT/server.html
head(tidied_Linkedin)
##   Keyword                    Job_title
## 1    Data                 Data Analyst
## 2    Data Data Analyst (Digital Media)
## 3    Data Human Resources Data Analyst
## 4    Data Human Resources Data Analyst
## 5    Data                 Data Analyst
## 6    Data                 Data Analyst
##                                                                                                                                                                                                                                             Job_link
## 1                            https://www.linkedin.com/jobs/view/data-analyst-at-young-life-3521907674?refId=j1Bxa6%2F%2BWbkCr9i%2FQ091RQ%3D%3D&trackingId=vi3sxWiThZ5AMIhSHt0oWQ%3D%3D&position=1&pageNum=0&trk=public_jobs_jserp-result_search-card
## 2                https://www.linkedin.com/jobs/view/data-analyst-digital-media-at-waitwhat-3526057051?refId=j1Bxa6%2F%2BWbkCr9i%2FQ091RQ%3D%3D&trackingId=uK5xKyhQEnQqddLCNQRMXg%3D%3D&position=2&pageNum=0&trk=public_jobs_jserp-result_search-card
## 3 https://www.linkedin.com/jobs/view/human-resources-data-analyst-at-los-angeles-dodgers-3526119048?refId=j1Bxa6%2F%2BWbkCr9i%2FQ091RQ%3D%3D&trackingId=08k9p0f7z%2BALiRZVg5jcWg%3D%3D&position=3&pageNum=0&trk=public_jobs_jserp-result_search-card
## 4                               https://www.linkedin.com/jobs/view/data-analyst-at-experfy-3526062049?refId=j1Bxa6%2F%2BWbkCr9i%2FQ091RQ%3D%3D&trackingId=KNlhCQN5Nm1QtugiflSJsg%3D%3D&position=4&pageNum=0&trk=public_jobs_jserp-result_search-card
## 5                        https://www.linkedin.com/jobs/view/data-analyst-at-rsa-conference-3526162382?refId=j1Bxa6%2F%2BWbkCr9i%2FQ091RQ%3D%3D&trackingId=NETC157DPeETcCAyPEBSEA%3D%3D&position=5&pageNum=0&trk=public_jobs_jserp-result_search-card
## 6                                https://www.linkedin.com/jobs/view/data-analyst-at-clutch-3526178035?refId=j1Bxa6%2F%2BWbkCr9i%2FQ091RQ%3D%3D&trackingId=727LCwCWaecFqZwPvsoNVw%3D%3D&position=6&pageNum=0&trk=public_jobs_jserp-result_search-card
##               Company
## 1          Young Life
## 2            WaitWhat
## 3 Los Angeles Dodgers
## 4 Los Angeles Dodgers
## 5      RSA Conference
## 6              Clutch
##                                                                            Company_link
## 1          https://www.linkedin.com/company/young-life?trk=public_jobs_topcard-org-name
## 2          https://www.linkedin.com/company/wait-what-?trk=public_jobs_topcard-org-name
## 3 https://www.linkedin.com/company/los-angeles-dodgers?trk=public_jobs_topcard-org-name
## 4 https://www.linkedin.com/company/los-angeles-dodgers?trk=public_jobs_topcard-org-name
## 5      https://www.linkedin.com/company/rsa-conference?trk=public_jobs_topcard-org-name
## 6           https://www.linkedin.com/company/clutchnow?trk=public_jobs_topcard-org-name
##   Job_location_City Job_location_State   Post_time
## 1     United States                      1 day ago
## 2          New York                 NY 3 weeks ago
## 3       Los Angeles                 CA   1 day ago
## 4       Los Angeles                 CA   1 day ago
## 5            Boston                 MA 2 hours ago
## 6     United States                     1 hour ago
##                   Applicants_count  Seniority_level Employment_type
## 1 Be among the first 25 applicants      Entry level       Full-time
## 2 Be among the first 25 applicants   Not Applicable       Full-time
## 3                    34 applicants Mid-Senior level       Part-time
## 4                    34 applicants Mid-Senior level       Part-time
## 5                    29 applicants Mid-Senior level       Full-time
## 6                    31 applicants      Entry level        Contract
##                                   Job_function
## 1 Strategy/Planning and Information Technology
## 2                       Information Technology
## 3                              Human Resources
## 4                              Human Resources
## 5                       Information Technology
## 6                       Information Technology
##                             Industries
## 1               Religious Institutions
## 2 Technology, Information and Internet
## 3                     Spectator Sports
## 4                     Spectator Sports
## 5                  Internet Publishing
## 6              Staffing and Recruiting
##                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      Job_description
## 1                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           Mission/Authority\n\nServe mission leadership and staff by curating and analyzing key mission data to deliver insights and decision support to key Young Life stakeholders. Alongside technical and information leaders, deliver the data necessary to support effective decisions, discover efficiencies in mission data systems, and evaluate business data to determine effective analytics solutions.\n\nEssential Duties\n\nBuild and maintain reporting and analytics is Workday, Salesforce, and Power BI related to Young Life’s Finance, Human Resources, Learning, and Product Development Team.\nAssist in growing data literacy and self-service reporting capabilities for Young Life Finance, Human Resources, Learning, and Field organizations.\nInterpret, analyze, and utilize statistical analysis techniques to discover patterns and key insights in mission critical data.\nExecute the curation of data delivered by the data team and exploratory data for analytics consumption and deployment.\nOrganize and optimize all data residing in mission data schemas.\nSupport department embedded analysts by delivering new data sets to meet their reporting and analytics needs.\nAssist the Data Architect in developing the data models necessary to support a wide variety of information needs.\nCollaborate with platform and data resources to ensure a high level of data availability and performance for all datasets used in reporting and analytics.\nCollaborate with the data quality staff to ensure clean and accurate data for reporting and analytical models.\nCollaborate with key business staff and leaders to design and build analytical solutions to meet the operational reporting and decision support need of the mission.\nAssist in building a business intelligence center of excellence to support a wide variety of analytical and reporting needs for mission leadership and staff.\nAssist in pioneering data activities in response to prospective use cases driven by IS and Strategic Leadership.\nEngage in assigned data related projects across the analytics and data ecosystem.\n\nWorking Relationships\n\nMultiple regular interactions with key technical and ministry operations stakeholders to maintain a sense of awareness and understanding of current ministry needs.\nRegular involvement in the cross-departmental teams.\nCollaborate with Project Management to provide data expertise and perspective on key mission projects.\nAssists the Director of Mission Analytics and Decision Support in furthering the goals of the mission.\n\nEducation\n\nBachelor’s, Master’s Degree, or equivalent experience in a business, technical, or data related field required.\n\nWorking Conditions\n\nOffice Environment at the Service Center in Colorado Springs, CO with hybrid or remote options.\nTravel 1-3 times a year may be required.\nThis position offers a competitive salary range of $59,800 - $74,749 annually commensurate with relevant experience, qualifications, and education.\n\nQualifications Required For The Job\n\n5-7 years minimum work experience.\nProficiency in modern business intelligence and enterprise platform tools is required (Workday, Salesforce, Power BI, Excel, or similar).\nStrong working knowledge of relational databases and/or SQL is required.\nExperience with Snowflake and Python is preferred.\n3 or more years of related data, business intelligence, or data analyst experience required.\nHighly organized, self-motivated, and attentive to detail.\nStrong interpersonal, communication, and problem-solving skills needed.\n\nApplication\n\nPlease submit your application, resume, and cover letter by April 3rd, 2023.
## 2 About WaitWhat\n\nWaitWhat creates and owns one of the most valuable independent portfolios of premium content designed to elevate human potential — Masters of Scale (business/leadership), Meditative Story (well-being/mental fitness), and Spark & Fire (creativity/innovation).\n\nToday, WaitWhat’s portfolio is downloaded millions of times every month, represents one of the largest bodies of timeless wisdom, and helps people around the world access their own potential. We design each media property to scale beyond the first format, with an expansive suite of profitable learning products, content experiences, and live events — both for consumer and enterprise markets – that reach a passionately loyal audience.\n\nIn everything WaitWhat creates, we aim to elicit the contagious emotions of wonder and curiosity — infusing the content with “Wait, what?” moments that help us achieve mastery and live at the top of our talent.\n\nWaitWhat’s vibrant culture is rooted in optimism, possibility thinking, and an uncommon willingness to embrace risk with an intention to push the outer bounds of creativity and experimentation. Purposefully rich in diversity, we treat each other with kindness and respect, and always support “yes, and” style conversations. Our passion for creating new business models, and building inventive partnerships, makes allies out of competitors.\n\nWaitWhat’s strategic focus on diversity (including gender balance) among guests creates network effects, drawing an ever-widening circle of diverse A-list guests. Heading into 2023, we enjoy an extraordinary group of diverse, purpose-driven, and deeply admired investors, a strong balance sheet, and a clear thesis to chart our path to the next level of scale.\n\nAbout the role:\n\nWaitWhat is seeking a Data Analyst (digital media) to work collaboratively across the organization towards building a more data-informed culture.\n\nData is a developing channel at WaitWhat. You’ll have the opportunity to make your mark — working collaboratively across the organization with executive leadership, marketing, product, content, technology and design to build and deliver intelligence that fuels our growth.\n\nYou’ll aggregate and analyze content data from a growing number of sources, and distill complex analyses into clear, actionable solutions for teams throughout the company.\n\nThis role will work alongside the Head of Audience Development & Growth as a key member of the team. We’ll work creatively & collaboratively together. We all bring ideas to the table and we build on them to bring them to life.\n\nWe want to deepen your skills and stretch your talents in new ways; give you ownership and responsibility for projects; inspire and delight your ability to impact the world through the media we create; and welcome you into a very special humble, high-collaborative startup environment.\n\nThe areas where you’ll leave your mark:\n\nAggregate and analyze data from various sources for all of WaitWhat’s media properties to assess the effectiveness of existing marketing, advertising and communications programs. Channels include (but are not limited to) podcasting, web, email, social media, video and subscription apps.\nPrepare and present data in a way that is visual, uncomplicated and accessible to everyone to understand.\nWork collaboratively across the organization to identify and develop solutions for the tracking of KPIs across departments.\nCreate and evaluate methods for gathering data, including surveys, interviews, questionnaires and opinion polls.\nBe a key stakeholder in the scoping & implementation of new MarTech systems.\n\n\nRequirements\n\nYou’ve got 3-5 years of experience in a data analytics role at a digital media company.\nYou understand how content behaves all across the digital landscape. In particular, podcasting, web, subscriptions, video and apps.\nYou have an insatiable curiosity about how and why things work and why sometimes they don’t.\nYou are an impeccable executor with a knack for getting things done on schedule, but always with excellence because you sweat the details.\nYou have the mind and the ambition to meet high-reaching growth goals for our media properties, both in their first format and their extensions.\n\n\nBenefits\n\nThis is a full-time position in NYC / remote. The salary range for this role is $100k - $150k. This is the lowest to highest salary we reasonably and in good faith believe we would pay for this role at the time of this posting. We may ultimately pay more or less than the posted range, and the range may be modified in the future. An employee’s pay position within the salary range will be based on several factors including, but not limited to, relevant education, qualifications, certifications, experience, skills, geographic location, performance, and business or organizational needs. Candidates with expectations outside of this range or experience outside of the requirements listed are still encouraged to apply as there may be other positions that could be a fit.\n\nWe offer a benefits package that’s generous and competitive with larger companies. Benefits include 3 weeks of company holiday (2 weeks in summer + one week between Christmas and New Years), 5 personal days, healthcare (PPO, vision and dental), short- and long-term disability, life insurance and 401 (k). We’re an anti-racist organization, committed to equity and diversity of all kinds — on our team, among our investors, and as represented in our media properties. Most importantly, we only hire extremely kind people.
## 3                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 Job Details\n\n\n\n\nDescription\n\n\n\n\nTitle: Human Resources Data Analyst\n\n\n\n\nDepartment: Human Resources\n\n\n\n\nReports to: Director, Compensation & Benefits\n\n\n\n\nStatus: Part-Time, Temporary (3-month assignment)\n\n\n\n\nPay Rate: $28.00 - $30.00/hour*\n\n\n\n\nCompensation rates vary based on job-related factors, including experience, job skills, education, and training.\n\n\n\n\nThe Human Resources Data Analyst will collect, compile, and analyze HR data, metrics, and statistics, and apply this data to make recommendations. Performs administrative and professional work organizing the administrative and technical aspects for HR program area(s), develops and establishes work processes and procedures to deliver program services, evaluates effectiveness of program services and efficiency of processes, and implements prescribed program guidelines and objectives. This position will be fully remote.\n\n\n\n\nEssential Duties/Responsibilities\n\n\n\n\nAnalyze and document data structures\nPursue in-depth knowledge of systems, processes and reports to ensure full understanding\nPrioritize and deliver on multiple in-flight projects at the same time across multiple HR focus areas\nProactively communicate with project stakeholders to meet project deadlines and expectations\nAnalyze information and make correct inferences or draw accurate conclusions\nReview and evaluate Human Resources data, provide recommendations for enhancements to workflow and work processes to effectively use HRIS (UKG)\nEvaluate issues and opportunities identified in assignments improve performance and processes within data and UKG platform. Document and present changes for development and delivery\nMaintain knowledge of HR data and configurations and how they impact workflows and results\nAnalyze data and statistics for trends and patterns with attention to current HR practices and compliance with employment laws and regulations\nMay conduct or assist with conducting audits and additional projects as assigned\n\n\n\n\nBasic Requirements/Qualifications\n\n\n\n\nCan identify rules, principles, or relationships that explain facts, data, or other information\nSkills in statistical analysis related to testing validation.\nStrong analytical skill to analyze a variety of different data structures\nUnderstanding of business analytics fundamentals\nStrong problem solving skills\nProficient in Microsoft Office with advanced Excel skills including pivot tables, match, indirect, etc.\nThe ability to maintain strict confidentiality\nAbility to learn compliance reporting requirements\nInsights into strategic metrics\nProficient with or the ability to quickly learn the organization's HRIS (UKG), payroll, and similar employee management software.\nGood communication skills\nBachelor's degree in Human Resources, Business Administration, Finance or related field required\n\n\n\n\nCurrent Los Angeles Dodgers employees should apply via the internal job board in UltiPro by following these prompts:\n\n\n\n\nMENU > MYSELF > MY COMPANY > VIEW OPPORTUNITIES > select the position > CONSENT > APPLY NOW\n\n\n\n\nLOS ANGELES DODGERS LLC is firmly committed to providing equal opportunity for all qualified applicants from every race, creed, and background. The Organization is also firmly committed to complying with all applicable laws and governmental regulations at the state and local levels which prohibit discrimination.\n\n\n\n\nLOS ANGELES DODGERS LLC considers all applicants without regard to national origin, race, color, religion, age, sex, sexual orientation, disability, military status, citizenship status, pregnancy or related medical conditions, marital status, ancestry-ethnicity, or any other characteristic protected by applicable state or federal civil rights law. The Immigration Reform and Control Act require that the Organization obtain documentation from every individual who is employed which verifies identity and authorizes their right to work in the United States.
## 4                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        Job DetailsDescriptionTitle: Human Resources Data Analyst Department: Human ResourcesReports to: Director, Compensation & BenefitsStatus: Part-Time, Temporary (3-month assignment)Pay Rate: $28.00 - $30.00/hour*Compensation rates vary based on job-related factors, including experience, job skills, education, and training.The Human Resources Data Analyst will collect, compile, and analyze HR data, metrics, and statistics, and apply this data to make recommendations. Performs administrative and professional work organizing the administrative and technical aspects for HR program area(s), develops and establishes work processes and procedures to deliver program services, evaluates effectiveness of program services and efficiency of processes, and implements prescribed program guidelines and objectives. This position will be fully remote.Essential Duties/ResponsibilitiesAnalyze and document data structuresPursue in-depth knowledge of systems, processes and reports to ensure full understandingPrioritize and deliver on multiple in-flight projects at the same time across multiple HR focus areasProactively communicate with project stakeholders to meet project deadlines and expectationsAnalyze information and make correct inferences or draw accurate conclusionsReview and evaluate Human Resources data, provide recommendations for enhancements to workflow and work processes to effectively use HRIS (UKG)Evaluate issues and opportunities identified in assignments improve performance and processes within data and UKG platform. Document and present changes for development and deliveryMaintain knowledge of HR data and configurations and how they impact workflows and resultsAnalyze data and statistics for trends and patterns with attention to current HR practices and compliance with employment laws and regulationsMay conduct or assist with conducting audits and additional projects as assignedBasic Requirements/QualificationsCan identify rules, principles, or relationships that explain facts, data, or other informationSkills in statistical analysis related to testing validation.Strong analytical skill to analyze a variety of different data structuresUnderstanding of business analytics fundamentalsStrong problem solving skillsProficient in Microsoft Office with advanced Excel skills including pivot tables, match, indirect, etc.The ability to maintain strict confidentialityAbility to learn compliance reporting requirementsInsights into strategic metricsProficient with or the ability to quickly learn the organization's HRIS (UKG), payroll, and similar employee management software.Good communication skillsBachelor's degree in Human Resources, Business Administration, Finance or related field requiredCurrent Los Angeles Dodgers employees should apply via the internal job board in UltiPro by following these prompts:MENU > MYSELF > MY COMPANY > VIEW OPPORTUNITIES > select the position > CONSENT > APPLY NOWLOS ANGELES DODGERS LLC is firmly committed to providing equal opportunity for all qualified applicants from every race, creed, and background. The Organization is also firmly committed to complying with all applicable laws and governmental regulations at the state and local levels which prohibit discrimination.LOS ANGELES DODGERS LLC considers all applicants without regard to national origin, race, color, religion, age, sex, sexual orientation, disability, military status, citizenship status, pregnancy or related medical conditions, marital status, ancestry-ethnicity, or any other characteristic protected by applicable state or federal civil rights law. The Immigration Reform and Control Act require that the Organization obtain documentation from every individual who is employed which verifies identity and authorizes their right to work in the United States.
## 5                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        We are seeking a proven Data Analyst that can support the marketing team by providing data driven insights regarding markets, competitors, and campaign effectiveness. Working alongside a Senior Data Strategist and Marketing Director, this role will be responsible for maintaining, enhancing, and executing a comprehensive marketing analytics strategy and tracking program. This position will provide analysis on a formal and ad hoc basis using standardized and automated reporting with the objective of measuring performance and supporting business decisions. Working with the Data Strategist, the Data Analyst will work on data cleansing, modeling, joining, querying, and automation for the purpose of analysis and visualization.\n\n\n\n\nResponsibilities\n\n\n\n\nProvide business analytics for marketing, technology, and editorial teams.\nWork with stakeholders to understand their unique challenges, translate those challenges into data problems, and then provide insight back to the business based on a quantitative perspective.\nSummarize analytical results, drawing out key learnings, making recommendations and presenting to key stakeholders in a clear and precise manner.\nWork with Data Strategist to define data requirements for modelling and automation projects.\nWork with all-things Google Analytics 4, Google Tag Manager, BigQuery, and Tableau.\n\n\n\n\nRequired Experience\n\n\n\n\nCandidates should have extensive experience in the following:\n\n\n\n\nConfiguring and maintaining a complex Google Analytics 4 implementation.\nConfiguring and maintaining a complex Google Tag Manager implementation, including creating a tag firing logic based on user consent.\nGoogle Tag Manager server-side implementations.\nTableau environment and creating interactive visually compelling dashboards.\n\n\n\n\nCandidate should be proficient in the following:\n\n\n\n\nAnalyzing and visualizing organic/paid search, email, social, online transactions, and website behavior data.\nConducting advanced attribution and ROI modeling using campaign data across channels and user journey.\nBreaking data silos by creating resilient modelling of large dataset using disparate sources.\nPerforming clustering/cohort analysis on large datasets for the purpose identifying market segments and future product initiatives.\nWriting ad-hoc queries to assist in analysis of customer segmentation and reporting.\nAssisting in the maintenance and expansion on existing data documentation, including data definition, meta data, sampling, and cleaning procedures.\nAssisting in the design and creation of data transformation processes prior to visualization and establish convention for post-transformation data storage.\nTableau Data Management and Prep Conductor for the purpose of manipulating and setting auto-refresh data pipelines is a plus.\n\n\n\n\nQualifications\n\n\n\n\nBachelor's degree in statistics, business administration, finance, mathematics or a related field.\nMinimum of 5 years in business analytics including experience in Analytics, Marketing Analytics and strategy development within a B2B environment.\n\n\n\n\nThis is a Remote (work from home) position.\n\n\n\n\nPowered by JazzHR\n\n\n\n\nMUBxkLeagW
## 6                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                             Role/Title: Data Analyst\n\n\n\n\nSalary Range: up to 40hr\n\n\n\n\nRemote\n\n\n\n\nClutch Creative is a rapidly growing staffing agency. We support our clients by identifying unreachable talent that helps their businesses grow.\n\n\n\n\nOur candidates work with us because they know we have the best opportunities available to them and will help them navigate their next career move.\n\n\n\n\nWe care, we're honest, and we hustle—that's what makes us Clutch.\n\n\n\n\nClutch is looking for a Data Analyst for a contract position with a client of ours.\n\n\n\n\nResponsibilities\n\n\n\n\nCollect and analyze data from various sources.\nDevelop and maintain databases and data systems.\nInterpret data, analyze results, and provide insights to support decision-making processes.\nCreate reports and visualizations to communicate findings to stakeholders.\nDownload reports into a spreadsheet in a clean, easy-to-read format.\nRun MRI Simmons reports.\nIdentify patterns and trends in data.\nCollaborate with cross-functional teams to develop and implement data-driven strategies.\nDevelop and maintain data quality and accuracy standards.\n\n\n\n\nRequired Skills\n\n\n\n\nBachelor's degree in a relevant field.\n2+ years of experience in data analysis.\nProficiency in Microsoft Excel and Google Sheets.\nFamiliarity with data visualization tools.\nAbility to work independently and as part of a team.\nStrong attention to detail.\nExcellent written and verbal communication skills.\nAbility to prioritize and manage multiple projects simultaneously.\n\n\n\n\nIf you are interested in this opportunity, please apply today.

Extract salary

#install.packages("XQuartz", dependencies = TRUE)
#install.packages('priceR')
#library(priceR)
#tidied_Linkedin$salary <- extract_salary(tidied_Linkedin$Job_description, salary_range_handling = "min")

Tidy up Glassdoor dataset

glassdoor <- read.csv('https://raw.githubusercontent.com/suswong/dataset-version-2/main/Job%20listing_Glassdoor%20version%202.csv')

Location

Remove the ‘Location’

The ‘Location’ column does not contain any values. We need to remove that column. The ‘place’ column contains the job location.

rem_location <- glassdoor[,-2] #Remove the 2nd column, which is 'Location'

Split the ‘Place’ column into ‘Job_location_City’ and ‘Job_location_State’

We split the ‘Place’ column into ‘Job_location_City’ and ‘Job_location_State’. Then, rearrange the order of the column. In the process, I dropped the following columns: ‘Page’, ‘Place’, The ‘page’ column contains what page the job posting was found on Linkedin. This column is not necessary for our analysis.

library(stringr)
split_Place_Glassdoor <- rem_location
# tidied_Linkedin %>% 
#    separate(Job_location,c("City","State"),sep=",")

split_Place_Glassdoor[c('Job_location_City', 'Job_location_State')] <- str_split_fixed(split_Place_Glassdoor$Place, ',', 2)

colnames(split_Place_Glassdoor)
##  [1] "Keyword"            "Page"               "company"           
##  [4] "rating"             "Job_title"          "Place"             
##  [7] "salary"             "post_date"          "Job_description"   
## [10] "Job_location_City"  "Job_location_State"
# Drop the 'Place' column and "Page" column by not including them in the new table
tidied_glassdoor <- split_Place_Glassdoor[c('Keyword','Job_title', 'company','rating','Job_location_City','Job_location_State','post_date','Job_description')]

Combine both datasets into one dataframe

In order to combine both datasets into one dataframe, we need to rename common columns with the same title. I also want to add a new column that indicates if the job posting is from Glassdoor or Linkedin. Since we do not need the ‘Keyword’ column, I replace all values with ‘Glassdoor’ or ‘Linkedin’ in that column, and renamed the column to ‘Search_Engine’.

final_glassdoor <- tidied_glassdoor
final_glassdoor$Keyword <- "Glassdoor"
colnames(final_glassdoor)[1] ="Search_Engine"
colnames(final_glassdoor)
## [1] "Search_Engine"      "Job_title"          "company"           
## [4] "rating"             "Job_location_City"  "Job_location_State"
## [7] "post_date"          "Job_description"
final_Linkedin <- tidied_Linkedin
final_Linkedin$Keyword <- "Linkedin"
colnames(final_Linkedin)[1] ="Search_Engine"
colnames(final_Linkedin)
##  [1] "Search_Engine"      "Job_title"          "Job_link"          
##  [4] "Company"            "Company_link"       "Job_location_City" 
##  [7] "Job_location_State" "Post_time"          "Applicants_count"  
## [10] "Seniority_level"    "Employment_type"    "Job_function"      
## [13] "Industries"         "Job_description"
colnames(final_glassdoor) <- c('Search_Engine','Job_title', 'Company','Rating','Job_location_City','Job_location_State','post_date','Job_description')

final_glassdoor <- final_glassdoor[c('Search_Engine','Job_title', 'Company','Job_location_City','Job_location_State','Job_description')]

final_Linkedin <- final_Linkedin[c('Search_Engine','Job_title', 'Company','Job_location_City','Job_location_State','Job_description')]
total <- rbind(final_Linkedin, final_glassdoor)

Create a csv file

write.csv(total, "C:\\Desktop\\DATA 607\\Combined_Linkedin_Glassdoor_Version2.csv", row.names=FALSE)