Congratulations! You have completed the people analytics course. As a final step in this course, every student should complete the case as their capstone project. In this article, we will explain the task and rubrics of people analytics capstone project.
There will be 2 datasets: train and test dataset.
The train dataset will be used to train and evaluate the model, while the test dataset is used for the final evaluation. The final evaluation requires you to submit your prediction of the test dataset to the app.R in order to obtain the final model evaluation (more details are provided below). The data scheme is illustrated as follows:
A company which is active in Big Data and Data Science wants to hire data scientists among people who successfully pass some courses which conduct by the company. Many people signup for their training. Company wants to know which of these candidates are really wants to work for the company after training or looking for a new employment because it helps to reduce the cost and time as well as the quality of training or planning the courses and categorization of candidates. Information related to demographics, education, experience are in hands from candidates signup and enrollment1.
This dataset designed to understand the factors that lead a person to leave current job for HR researches too. By model(s) that uses the current credentials,demographics,experience data you will predict the probability of a candidate to look for a new job or will work for the company, as well as interpreting affected factors on employee decision.
We provide the train dataset as follows:
The observation data consists of the following variables:
enrollee_id: Unique ID for candidatecity_ development _index: Developement index of the city (scaled)gender: Gender of candidaterelevent_experience: Relevant experience of candidateeducation_level: Education level of candidatemajor_discipline:Education major discipline of candidateexperience: Candidate total experience in yearslastnewjob: Difference in years between previous job and current jobtraining_hours: training hours completedtarget: 0 â Not looking for job change, 1 â Looking for a job changeTotal points of this people analytics course is 20 points. You can achieve full points when you meet the criteria below:
(2 Points) Demonstrated how to properly do data Data Preparation.
(2 Points) Explored the proportion of the target variable.
(2 Points) Demonstrated how to prepare cross-validation data for this case.
(2 Points) Demonstrated how to properly do model fitting and evaluation.
(2 Points) Demonstrated how to properly do model selection by comparing models or making adjustment to single model.
Put your model RDS in given template dashboard and see how better your model performance is.
(2 Points) Write the conclusion of your capstone project
After finishing your work of data preprocessing, modeling, and model evaluation, the next step will be;