Comparing NBA Player Statistics from 2009-2019

William Wallace

Introduction

Today I will be comparing statistics between players in the NBA seasons from 2009-2019. I want to see what statistics are necessary for an MVP candidate, and ultimately figure out what stats a player would need to achieve to get MVP.

Details of the dataset

The data for this analysis is derived from the NBA seasons 2009-2019. The data contains the 22 variables described and consists of 4,847 rows. Each row of the data represents a player’s statistics for a season in the NBA, and contains information like points per game, games played, what season it was, etc. A summary of the data follows.

Variable Description
player_name The name of the player
team_abbreviation The abbreviation of the team the player played for in their respective season
age The age of the player during that season
player_height The player’s height (in centimeters)
player_weight The player’s weight (in kilograms)
college The college the player went to
country The country the player is from
draft_year The year the player was drafted
draft_round The round the player was drafted
draft_number The number in which the player was picked in the round
gp Number of games played throughout the season
pts Average number of points scored per game
reb Average number of rebounds per game
ast Average number of assists per game
net_rating Team’s point differential per 100 possessions while the player is on the court
oreb_pct Percentage of available offensive rebounds the player grabbed while he was on the floor
dreb_pct Percentage of available defensive rebounds the player grabbed while he was on the floor
usg_pct Percentage of team plays used by the player while he was on the floor (FGA + Possession Ending FTA + TO) / POSS)
ts_pct Measure of the player’s shooting efficiency that takes into account free throws, 2 and 3 point shots
ast_pct Percentage of teammate field goals the player assisted while he was on the floor
season The NBA season the statistic relate to
mvp Indicates whether the player won MVP. 1 = won MVP, 0 = did not win MVP

Available data for the NBA Dataset

This searchable table contains all the data available for the NBA players from 2009-2019

To summarize what an MVP looks like, here are the MVP winners for the last 10 seasons

player_name team_abbreviation age gp pts reb ast net_rating usg_pct ts_pct season mvp
LeBron James CLE 25 76 29.7 7.3 8.6 10.8 0.333 0.604 2009-10 1
Derrick Rose CHI 22 81 25.0 4.1 7.7 8.3 0.319 0.550 2010-11 1
LeBron James MIA 27 62 27.1 7.9 6.2 10.7 0.317 0.605 2011-12 1
LeBron James MIA 28 76 26.8 8.0 7.3 14.1 0.298 0.640 2012-13 1
Kevin Durant OKC 25 81 32.0 7.4 5.5 8.0 0.327 0.635 2013-14 1
Stephen Curry GSW 27 80 23.8 4.3 7.7 17.0 0.283 0.638 2014-15 1
Stephen Curry GSW 28 79 30.1 5.4 6.7 18.3 0.320 0.669 2015-16 1
Russell Westbrook OKC 28 81 31.6 10.7 10.4 3.3 0.408 0.554 2016-17 1
James Harden HOU 28 72 30.4 5.4 8.8 10.0 0.353 0.619 2017-18 1
Giannis Antetokounmpo MIL 24 72 27.7 12.5 5.9 12.5 0.314 0.644 2018-19 1

The average statistics for MVPs

To get a better idea instead of multiple instances, I decided to average all the statistics to show on average what the MVP numbers put up every year

Average Age Average Games Played Average Points per Game Average Rebounds per Game Average Assists per Game Average Net Rating Average Usage Percent Average True Shooting Percentage MVP
26.2 76 28.42 7.3 7.48 11.3 0.3272 0.6158 1

How the main statistics look over the years

It is safe to say that the statistics people mainly look at are points per game, rebounds per game, and assists per game. I thought it would be useful to show how these statistics have varied over the past 10 years.

Points per Game over the years

Rebounds per Game over the years

Assists per Game over the years

It is interesting to see the variability of these stats, but with these graphs, you get an idea of what it takes to be in MVP in the NBA. The most variable is rebounds per game. I would assume because guards don’t get as many rebounds as forwards or centers, so the low rebound per game years were most likely mvps won by guards (like Stephen Curry).

Seeing the public opinion on the Top 3 NBA MVP prospects of 2022

The top 3 prospects of the NBA 2022 MVP race are Nikola Jokic, Joel Embiid, and Giannis Antetokounmpo. I wanted to see what twitter was saying about them:

Here are common words in tweets for the top 3 NBA MVP prospects

All the common words among them make sense. They are all top player in the playoffs right now, so you see some similarities in their common words other than the teams they are playing on and against.

Sentinments for the Top 3 MVP Prospects

I want to use sentiments to see how the public opinion of twitter varies from player to player

Looking at these sentiments, I would say Nikola Jokic is favored the most. It is important to know that Embiid recently suffered an injury, so many of the negative sentiments could come in to play there. It is also worth noting that Embiid had the most tweets about him. Embiid had 1000 observations, while Jokic had 847 and Giannis had 978 respectively. Even though taking in all those factors, in my opinion, Nikola Jokic is the most favored by the public opinion of Twitter.

Prescriptive Analysis

Here, I want to see what are the most important statistics in terms of winning the MVP award as an NBA player. I have a general idea based on the stats and graphs above, but I want to see what a regression model would describe about it.

The formula I will use for this regression is: \[MVP = \alpha_i + age_i + gp_i + pts_i + reb_i + ast_i + netrating_i + usgpct_i + tspct_i \]

## 
## Call:
## lm(formula = mvp ~ I(age) + I(gp) + I(pts) + I(reb) + I(ast) + 
##     I(net_rating) + I(usg_pct) + I(ts_pct), data = nba_mvp_analysis)
## 
## Residuals:
##      Min       1Q   Median       3Q      Max 
## -0.04182 -0.00555  0.00019  0.00359  0.97069 
## 
## Coefficients:
##                 Estimate Std. Error t value Pr(>|t|)    
## (Intercept)   -6.415e-05  6.463e-03  -0.010 0.992080    
## I(age)        -1.590e-04  1.538e-04  -1.034 0.301226    
## I(gp)         -1.189e-04  3.376e-05  -3.523 0.000431 ***
## I(pts)         1.147e-03  2.548e-04   4.503 6.85e-06 ***
## I(reb)        -1.991e-04  3.633e-04  -0.548 0.583676    
## I(ast)         1.815e-03  5.086e-04   3.569 0.000362 ***
## I(net_rating)  9.646e-05  5.785e-05   1.667 0.095495 .  
## I(usg_pct)     3.057e-03  1.790e-02   0.171 0.864414    
## I(ts_pct)      3.851e-04  7.366e-03   0.052 0.958311    
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Residual standard error: 0.04469 on 4838 degrees of freedom
## Multiple R-squared:  0.03179,    Adjusted R-squared:  0.03019 
## F-statistic: 19.86 on 8 and 4838 DF,  p-value: < 2.2e-16

Takeaways

With the results, it shows 3 significant variables: gp (games played), pts (points per game), and ast (assist per game). These all make sense, and here is why: For games played, you need to play most of a season as an NBA player to put together a respectable season to be considered as an MVP. The least amount of games ever played by an NBA MVP is 49, but usually you need to play at least 70 games (and that is on the low end). If you haven’t played many games, you won’t be considered for an NBA MVP. For points per game in the last 10 seasons of this dataset, the least amount of PPG by an NBA player who won MVP was 25. It is probably one of the highest volume and most important statistics in the NBA, shown by it’s INSANELY small p value. For assists per game, this is a statistic that shows you are contributing to your team making baskets. In the last 10 seasons of the dataset, the smallest assists per game was about 6, which is high in terms of NBA standards. To put that in perspective, in the most recent NBA season, there were only 16 players who average over 6 assists per game. Summing up these statistics, to be an NBA MVP, you need to be around a top 10 scorer, while also being a top 15 contributor in the league. Not to mention that you also have to play almost the whole season without getting injured (which is difficult when the NBA season is 82 games long).

Some things about the analysis that I’m surprised about

Age wasn’t as significant as I thought it would be. The range of MVP winners is around 23-29; a pretty small window to go for an MVP. an MVP player has only a couple seasons in their prime to achieve this goal. The P value is lower than some of the other non-significant values, but it’s not significant. I am also surprised that the P value for usage percent is so high, usually every player who wins MVP is heavily used on their team, but I guess if you think about it, there is also probably 29 other players in the NBA who were highely used but did not win MVP.

Some things about the analysis that are about where I expected them to be

Many might be surprised that rebounds isn’t significant when it is one of the main statlines in the NBA. For example if someone said LeBron had 27/7/7, that means he had 27 points, 7 rebounds, and 7 assists. This didn’t turn out significant because the rebounding stat is generally more positional. Forwards or Centers that win the MVP will usually have a high amount of rebounds (10+), while guards, like Stephen Curry, won’t have a large rebound count. No one is really talking about how many rebounds Steph Curry got. Lastly, Net rating having a low p value but not quite being significant doesn’t suprise me either. It is almost significant because you will not be selected for the MVP if you are not on a winning team, but it is a statistic that can highly vary among players, as the best player on the best team doesn’t always win MVP.

For Fun

Since I did sentiment analysis on the top 3 MVP prospects this year, I thought I could try and use the knowledge I got from this assignment to predict this year’s MVP

Player G MP FG FGA FG. X3P X3PA X3P. X2P X2PA X2P. eFG. FT FTA FT. ORB DRB TRB AST STL BLK TOV PF PTS X
Nikola Jokic 74 33.5 10.3 17.7 0.583 1.3 3.9 0.337 9.0 13.8 0.652 0.620 5.1 6.3 0.810 2.8 11.0 13.8 7.9 1.5 0.9 3.8 2.6 27.1 NA
Giannis Antetokounmpo 67 32.9 10.3 18.6 0.553 1.1 3.6 0.293 9.2 15.0 0.616 0.582 8.3 11.4 0.722 2.0 9.6 11.6 5.8 1.1 1.4 3.3 3.2 29.9 NA
Joel Embiid 68 33.8 9.8 19.6 0.499 1.4 3.7 0.371 8.4 15.9 0.529 0.534 9.6 11.8 0.814 2.1 9.6 11.7 4.2 1.1 1.5 3.1 2.7 30.6 NA
Chris Paul 65 32.9 5.6 11.3 0.493 1.0 3.1 0.317 4.6 8.3 0.559 0.536 2.6 3.1 0.837 0.3 4.0 4.4 10.8 1.9 0.3 2.4 2.1 14.7 NA
Luka Doncic 65 35.4 9.9 21.6 0.457 3.1 8.8 0.353 6.8 12.8 0.528 0.529 5.6 7.5 0.744 0.9 8.3 9.1 8.7 1.2 0.6 4.5 2.2 28.4 NA
James Harden 65 37.2 6.3 15.3 0.410 2.3 6.9 0.330 4.0 8.4 0.476 0.485 7.2 8.2 0.877 0.8 6.8 7.7 10.3 1.3 0.6 4.4 2.4 22.0 NA
Devin Booker 68 34.5 9.7 20.9 0.466 2.7 7.0 0.383 7.0 13.9 0.508 0.530 4.6 5.3 0.868 0.7 4.4 5.0 4.8 1.1 0.4 2.4 2.6 26.8 NA
Rudy Gobert 66 32.1 5.5 7.7 0.713 0.0 0.1 0.000 5.5 7.6 0.718 0.713 4.6 6.7 0.690 3.7 11.0 14.7 1.1 0.7 2.1 1.8 2.7 15.6 NA
Trae Young 76 34.9 9.4 20.3 0.460 3.1 8.0 0.382 6.3 12.3 0.512 0.536 6.6 7.3 0.904 0.7 3.1 3.7 9.7 0.9 0.1 4.0 1.7 28.4 NA
Jayson Tatum 76 35.9 9.3 20.6 0.453 3.0 8.6 0.353 6.3 12.0 0.524 0.526 5.3 6.2 0.853 1.1 6.9 8.0 4.4 1.0 0.6 2.9 2.3 26.9 NA

Here are the top 10 MVP candidates for the most recent season, but really I am just looking at the top 3. I got this data from Basketball Reference. To be specific, here is the exact link: https://www.basketball-reference.com/friv/mvp.html

Looking at the top 3, my pick is Nikola Jokic. He is behind bot Giannis and Embiid in points per game which was the most significant variable from the analysis, but he almost doubles the other 2 candidates in assits per game. He is scoring and contributing to the whole team. I think here it is also fair to take into account rebounds per game, because the top 3 candidates are all bigs. Jokic and Embiid are centers, while Giannis is a power forward. With that taken into account, in my opinion, Nikola Jokic blows the others out of the water. Not to mention he is having one of the, if not the most, efficient seasons of all time (Wasn’t covered as much in this project but thought it was still worth mentioning).

Conclusion

I thought this project was really enjoyable, and I got exactly what I wanted from it. I asked myself what are the key statistics to winning an NBA MVP. The analysis told me that points per game, assists per game, and games played were the most significant; along with honorable mentions net rating and age. I found some surprises and answered my question. I even used what I had learned with this project to pick who I thought was best fit for this years MVP, Nikola Jokic. I also got to take a trip down memory lane and see what past MVP statistics were, and what an average MVP’s statistics may look like.