HR Analytics EDA

ggplot version available @ http://rpubs.com/pgp34301/hr_analytics_eda

LOADING DATA INTO R ENVIRONMENT

Number of rows & columns in the dataframe

## [1] 8995   16

Column names of the dataframe

##  [1] "DOJExtended"              "DurationToAcceptOffer"   
##  [3] "NoticePeriod"             "OfferedBand"             
##  [5] "PercentHikeExpectedInCTC" "PercentHikeOfferedInCTC" 
##  [7] "PercentDifferenceCTC"     "JoiningBonus"            
##  [9] "CandidateRelocateActual"  "Gender"                  
## [11] "CandidateSource"          "RexInYrs"                
## [13] "LOB"                      "Location"                
## [15] "Age"                      "Status"

Descriptive Statistics of the dataframe

DISCRETE DATA DISTRIBUTION

Percentage of the candidates (Joined / Not joined)

## Status
##    Joined NotJoined 
##      81.3      18.7

Percentage of the candidates Joined / Did Not Join the Company, Split by DOJ extended

##            Status
## DOJExtended Joined NotJoined
##         No   81.08     18.92
##         Yes  81.55     18.45

Percentage of Candidates Who (Joined / Did Not Join), split by Notice Period

##             Status
## NoticePeriod Joined NotJoined
##          0    93.44      6.56
##          30   85.17     14.83
##          45   75.48     24.52
##          60   73.22     26.78
##          75   68.18     31.82
##          90   66.19     33.81
##          120  52.38     47.62

Percentage of the Candidates Who (Joined / Did Not Join), split by Joining Bonus

##             Status
## JoiningBonus Joined NotJoined
##          No   81.34     18.66
##          Yes  80.58     19.42

Percentage of the Candidates Who (Joined / Did Not Join), Split by Gender

##         Status
## Gender   Joined NotJoined
##   Female  82.40     17.60
##   Male    81.07     18.93

Percentage of the Candidates Who (Joined / Did Not Join), Split by Candidate Source

##                    Status
## CandidateSource     Joined NotJoined
##   Agency             75.82     24.18
##   Direct             82.00     18.00
##   Employee Referral  88.00     12.00

Percentage of the Candidates Who (Joined / Did Not Join), Split by Offered Band

##            Status
## OfferedBand Joined NotJoined
##          E0  76.30     23.70
##          E1  81.30     18.70
##          E2  80.97     19.03
##          E3  85.15     14.85

Percentage of the Candidates Who (Joined / Did Not join), Split by Line of Business (LOB)

##             Status
## LOB          Joined NotJoined
##   AXON        77.46     22.54
##   BFSI        75.86     24.14
##   CSMP        81.52     18.48
##   EAS         73.41     26.59
##   ERS         78.11     21.89
##   ETS         83.07     16.93
##   Healthcare  82.26     17.74
##   INFRA       87.79     12.21
##   MMS        100.00      0.00

CONTINUOUS DATA DISTRIBUTION

Average Age of the Candidates (Joined/Did Not join)

Mean Plot for the Age, Split by Status

## Warning: package 'gplots' was built under R version 3.6.1

Average Age of the Candidates (Joined/Did Not join)

Mean Plot for Notice Period, Split by Status (Joined / Did Not join)

## Warning in arrows(x, li, x, pmax(y - gap, li), col = barcol, lwd = lwd, :
## zero-length arrow is of indeterminate angle and so skipped
## Warning in arrows(x, ui, x, pmin(y + gap, ui), col = barcol, lwd = lwd, :
## zero-length arrow is of indeterminate angle and so skipped

Average (Relevant Years of Experience) of the candidates (Joined / Not joined)

Average of DurationToAcceptOffer (Number of days taken by the candidate to accept the offer) of candidates (Joined / Not joined)

Average (Age, Relevant Years of Experience and Number of days taken by the candidate to accept the offer) of candidates (Joined / Not joined) by Gender (Male / Female)

CORRELATION

Correlation Matrix for all the Continuous Variable

##                         DurationToAcceptOffer NoticePeriod
## DurationToAcceptOffer                    1.00         0.36
## NoticePeriod                             0.36         1.00
## PercentHikeOfferedInCTC                  0.01        -0.01
## PercentDifferenceCTC                    -0.01        -0.02
## RexInYrs                                 0.11         0.18
## Age                                      0.02         0.00
##                         PercentHikeOfferedInCTC PercentDifferenceCTC
## DurationToAcceptOffer                      0.01                -0.01
## NoticePeriod                              -0.01                -0.02
## PercentHikeOfferedInCTC                    1.00                 0.60
## PercentDifferenceCTC                       0.60                 1.00
## RexInYrs                                  -0.11                 0.08
## Age                                       -0.08                 0.04
##                         RexInYrs   Age
## DurationToAcceptOffer       0.11  0.02
## NoticePeriod                0.18  0.00
## PercentHikeOfferedInCTC    -0.11 -0.08
## PercentDifferenceCTC        0.08  0.04
## RexInYrs                    1.00  0.57
## Age                         0.57  1.00
## 
## n= 8995 
## 
## 
## P
##                         DurationToAcceptOffer NoticePeriod
## DurationToAcceptOffer                         0.0000      
## NoticePeriod            0.0000                            
## PercentHikeOfferedInCTC 0.4883                0.2019      
## PercentDifferenceCTC    0.3730                0.1531      
## RexInYrs                0.0000                0.0000      
## Age                     0.0562                0.6376      
##                         PercentHikeOfferedInCTC PercentDifferenceCTC
## DurationToAcceptOffer   0.4883                  0.3730              
## NoticePeriod            0.2019                  0.1531              
## PercentHikeOfferedInCTC                         0.0000              
## PercentDifferenceCTC    0.0000                                      
## RexInYrs                0.0000                  0.0000              
## Age                     0.0000                  0.0003              
##                         RexInYrs Age   
## DurationToAcceptOffer   0.0000   0.0562
## NoticePeriod            0.0000   0.6376
## PercentHikeOfferedInCTC 0.0000   0.0000
## PercentDifferenceCTC    0.0000   0.0003
## RexInYrs                         0.0000
## Age                     0.0000

Plotting Correlation Matrix

## Warning: package 'PerformanceAnalytics' was built under R version 3.6.1
## Warning: package 'xts' was built under R version 3.6.1

Team #7 (Golden Dawn)

7/23/2019