DATA 621 Blog 3: Predicting NBA Players’ Efficiency

David Quarshie

Intro

In my previous blogs I looked at how the different offensive and defensive stats play a role in a NBA player’s overall efficiency. Offensively, we now know that getting more points won’t necessarily raise your overall efficiency rate but increasing the field goal rates translates to a better efficiency score. Defensively, we saw that the actual stat numbers a player got did more for increasing efficiency. So now that we know which offensive and defensive stats will separately make a player a player more efficient, how do those stats work together? In today’s blog we’re going to see what a NBA player can improve to increase his overall efficiency.

Offense Review

For our offensive model we noted that percentages help improve efficiency over total stats. Looking at the summary of the model we see that we can keep the stats true shooting precentage, field goal precentage, effective field goal percentage, and points.

## 
## Call:
## lm(formula = PER ~ ., data = stats_off)
## 
## Residuals:
##      Min       1Q   Median       3Q      Max 
## -11.1462  -1.7146  -0.2006   1.4715  11.7290 
## 
## Coefficients:
##               Estimate Std. Error t value Pr(>|t|)    
## (Intercept)  -8.444752   1.016989  -8.304 8.68e-16 ***
## TS.          45.844088   5.275127   8.691  < 2e-16 ***
## FG.          51.700570   4.268011  12.114  < 2e-16 ***
## X3P.          2.605305   1.213459   2.147   0.0323 *  
## X2P.        -11.067119   2.433360  -4.548 6.73e-06 ***
## eFG.        -44.764416   5.812190  -7.702 6.76e-14 ***
## FT.          -1.735443   1.090412  -1.592   0.1121    
## PTS           0.004940   0.000263  18.785  < 2e-16 ***
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Residual standard error: 2.59 on 523 degrees of freedom
##   (64 observations deleted due to missingness)
## Multiple R-squared:  0.7588, Adjusted R-squared:  0.7556 
## F-statistic: 235.1 on 7 and 523 DF,  p-value: < 2.2e-16

Defense Review

For the defensive model we see that the number of offensive and defensive rebounds, along with the number of steals a player gets are very signficant in determining their efficiency.

## 
## Call:
## lm(formula = PER ~ ., data = stats_def)
## 
## Residuals:
##     Min      1Q  Median      3Q     Max 
## -41.808  -2.495   0.068   2.460  18.020 
## 
## Coefficients: (1 not defined because of singularities)
##              Estimate Std. Error t value Pr(>|t|)    
## (Intercept)  6.507567   0.590925  11.013  < 2e-16 ***
## ORB.        -0.613222   0.384177  -1.596 0.110985    
## DRB.        -1.264917   0.419477  -3.015 0.002677 ** 
## TRB.         2.353034   0.789708   2.980 0.003006 ** 
## STL.         0.416348   0.219731   1.895 0.058610 .  
## BLK.         0.358059   0.138297   2.589 0.009863 ** 
## ORB         -0.032064   0.009144  -3.507 0.000488 ***
## DRB          0.024923   0.004078   6.112  1.8e-09 ***
## TRB                NA         NA      NA       NA    
## STL          0.039041   0.010946   3.567 0.000391 ***
## BLK         -0.005164   0.013753  -0.375 0.707427    
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Residual standard error: 4.699 on 585 degrees of freedom
## Multiple R-squared:  0.4043, Adjusted R-squared:  0.3952 
## F-statistic: 44.12 on 9 and 585 DF,  p-value: < 2.2e-16

Overall Model

Now that we know which offensive and defensive stats we effect efficiency the most we can make an overall model using them.

## 
## Call:
## lm(formula = PER ~ ., data = stats_final)
## 
## Residuals:
##     Min      1Q  Median      3Q     Max 
## -13.372  -1.821  -0.330   1.553  13.960 
## 
## Coefficients:
##               Estimate Std. Error t value Pr(>|t|)    
## (Intercept) -7.627e+00  7.374e-01 -10.342  < 2e-16 ***
## TS.          3.163e+01  3.635e+00   8.700  < 2e-16 ***
## FG.          3.439e+01  3.700e+00   9.295  < 2e-16 ***
## eFG.        -2.805e+01  4.759e+00  -5.893 6.39e-09 ***
## PTS          5.884e-03  5.249e-04  11.211  < 2e-16 ***
## ORB          5.466e-05  5.200e-03   0.011    0.992    
## DRB         -4.162e-04  2.471e-03  -0.168    0.866    
## STL         -7.837e-03  6.621e-03  -1.184    0.237    
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Residual standard error: 3.021 on 585 degrees of freedom
##   (2 observations deleted due to missingness)
## Multiple R-squared:  0.7245, Adjusted R-squared:  0.7212 
## F-statistic: 219.7 on 7 and 585 DF,  p-value: < 2.2e-16

Using R’s lm feature we once again get a model for getting efficiency (PER). But this time our results show us that the defensive stats are not that significant in determining PER. However, points, a stat that we knew was significant but had a low intercept, showing that getting more of it would raise PER by much; is now still significant but has an even lower intercept. Other offensive stats like true shooting percentage and field goal percentage are also still signficant but their influence on PER, shown by their intercepts, are extremely lower than when we only looked at offensive stats. For example, true shooting percentage’s intercept decreased 31% when comparing the overall model to the offense only model.

So, when we use the stats we picked to determine PER we see that defense doesn’t really matter. Our model shows us that if you focus on taking and making good shots you’ll be more efficent. It may be terrible for defensive minded players to hear but scoring would most likely make you a better player.