## Warning: package 'ggplot2' was built under R version 4.4.3
Το dataset Employee Attrition and Factors περιέχει δεδομένα εργαζομένων μιας επιχείρησης και χρησιμοποιείται για την ανάλυση της αποχώρησης προσωπικού. Περιλαμβάνει δημογραφικές πληροφορίες, επαγγελματικά χαρακτηριστικά και δεδομένα ικανοποίησης εργαζομένων. Η πηγή του Dataset είναι το https://www.kaggle.com/search.
Αυτό το dataset είναι χρήσιμο για:
-Πρόβλεψη αποχωρήσεων εργαζομένων (Attrition Prediction).
-Ανάλυση παραγόντων που επηρεάζουν την ικανοποίηση των εργαζομένων.
-Αξιολόγηση της επιρροής παραμέτρων όπως μισθός, εκπαίδευση, εργασιακή ισορροπία στην παραμονή των εργαζομένων.
summary(Employee.Attrition.and.Factors)
## Age Attrition BusinessTravel DailyRate
## Min. :18.00 Length:1470 Length:1470 Min. : 102.0
## 1st Qu.:30.00 Class :character Class :character 1st Qu.: 465.0
## Median :36.00 Mode :character Mode :character Median : 802.0
## Mean :36.92 Mean : 802.5
## 3rd Qu.:43.00 3rd Qu.:1157.0
## Max. :60.00 Max. :1499.0
## Department DistanceFromHome Education EducationField
## Length:1470 Min. : 1.000 Min. :1.000 Length:1470
## Class :character 1st Qu.: 2.000 1st Qu.:2.000 Class :character
## Mode :character Median : 7.000 Median :3.000 Mode :character
## Mean : 9.193 Mean :2.913
## 3rd Qu.:14.000 3rd Qu.:4.000
## Max. :29.000 Max. :5.000
## EmployeeCount EmployeeNumber EnvironmentSatisfaction Gender
## Min. :1 Min. : 1.0 Min. :1.000 Length:1470
## 1st Qu.:1 1st Qu.: 491.2 1st Qu.:2.000 Class :character
## Median :1 Median :1020.5 Median :3.000 Mode :character
## Mean :1 Mean :1024.9 Mean :2.722
## 3rd Qu.:1 3rd Qu.:1555.8 3rd Qu.:4.000
## Max. :1 Max. :2068.0 Max. :4.000
## HourlyRate JobInvolvement JobLevel JobRole
## Min. : 30.00 Min. :1.00 Min. :1.000 Length:1470
## 1st Qu.: 48.00 1st Qu.:2.00 1st Qu.:1.000 Class :character
## Median : 66.00 Median :3.00 Median :2.000 Mode :character
## Mean : 65.89 Mean :2.73 Mean :2.064
## 3rd Qu.: 83.75 3rd Qu.:3.00 3rd Qu.:3.000
## Max. :100.00 Max. :4.00 Max. :5.000
## JobSatisfaction MaritalStatus MonthlyIncome MonthlyRate
## Min. :1.000 Length:1470 Min. : 1009 Min. : 2094
## 1st Qu.:2.000 Class :character 1st Qu.: 2911 1st Qu.: 8047
## Median :3.000 Mode :character Median : 4919 Median :14236
## Mean :2.729 Mean : 6503 Mean :14313
## 3rd Qu.:4.000 3rd Qu.: 8379 3rd Qu.:20462
## Max. :4.000 Max. :19999 Max. :26999
## NumCompaniesWorked Over18 OverTime PercentSalaryHike
## Min. :0.000 Length:1470 Length:1470 Min. :11.00
## 1st Qu.:1.000 Class :character Class :character 1st Qu.:12.00
## Median :2.000 Mode :character Mode :character Median :14.00
## Mean :2.693 Mean :15.21
## 3rd Qu.:4.000 3rd Qu.:18.00
## Max. :9.000 Max. :25.00
## PerformanceRating RelationshipSatisfaction StandardHours StockOptionLevel
## Min. :3.000 Min. :1.000 Min. :80 Min. :0.0000
## 1st Qu.:3.000 1st Qu.:2.000 1st Qu.:80 1st Qu.:0.0000
## Median :3.000 Median :3.000 Median :80 Median :1.0000
## Mean :3.154 Mean :2.712 Mean :80 Mean :0.7939
## 3rd Qu.:3.000 3rd Qu.:4.000 3rd Qu.:80 3rd Qu.:1.0000
## Max. :4.000 Max. :4.000 Max. :80 Max. :3.0000
## TotalWorkingYears TrainingTimesLastYear WorkLifeBalance YearsAtCompany
## Min. : 0.00 Min. :0.000 Min. :1.000 Min. : 0.000
## 1st Qu.: 6.00 1st Qu.:2.000 1st Qu.:2.000 1st Qu.: 3.000
## Median :10.00 Median :3.000 Median :3.000 Median : 5.000
## Mean :11.28 Mean :2.799 Mean :2.761 Mean : 7.008
## 3rd Qu.:15.00 3rd Qu.:3.000 3rd Qu.:3.000 3rd Qu.: 9.000
## Max. :40.00 Max. :6.000 Max. :4.000 Max. :40.000
## YearsInCurrentRole YearsSinceLastPromotion YearsWithCurrManager
## Min. : 0.000 Min. : 0.000 Min. : 0.000
## 1st Qu.: 2.000 1st Qu.: 0.000 1st Qu.: 2.000
## Median : 3.000 Median : 1.000 Median : 3.000
## Mean : 4.229 Mean : 2.188 Mean : 4.123
## 3rd Qu.: 7.000 3rd Qu.: 3.000 3rd Qu.: 7.000
## Max. :18.000 Max. :15.000 Max. :17.000
ggplot(Employee.Attrition.and.Factors, aes(x=Age, y=MonthlyIncome, color = Age)) + geom_point(size = 2) + labs(title= "Scatterplot", x="Age", y="Monthly Income", color = "Age")
Παρατηρήσεις:
ggplot(Employee.Attrition.and.Factors, aes(x=JobLevel, y=MonthlyIncome, fill=factor(JobLevel))) + geom_boxplot(outlier.color = "red", outlier.shape = 16) +labs(title= "Boxplot", x="Job Level ", y="Monthly Income")
Παρατηρήσεις:
ggplot(Employee.Attrition.and.Factors, aes(x=Age)) + geom_histogram(binwidth=0.9, fill="purple", color="black") + labs(title="Histogram", x="Age", y="Frequency")
Παρατηρήσεις
ggplot(Employee.Attrition.and.Factors, aes(EducationField)) + geom_bar(width=0.7, fill="purple", color="black") + labs(title="Bar chart", x="Education field", y="Frequency")
Παρατηρήσεις
Εκπαίδευση και εργασιακές τάσεις
Δημογραφικά Στοιχεία και Μισθοί
Εργασιακή Εμπειρία και Προαγωγές
Εργασιακή Απόσταση και Παράγοντες Παραίτησης