Final Project

Author

Billy Jackson

Impacts on Production and Individual Impact on a Game in The NBA

Basketball is a sport where a player who is 6’2 can have just as much impact on a game as a player who is 7’0. Height is important, but what else contributes to success on an individuals impact on a game, specifically on the offensive side of the ball, but overall as well? This is what I will be answering in this analysis. Figuring out what contributes most to offensive production and overall impact (+/-) can give teams an advantage when they are signing/drafting players. Answering these questions can help teams when they are forming their strategies in terms of team building, in-game strategy, and the true impact a player can have upon their team. A player can be known as a high point scorer, but if you look deeper, they take a lot of shots to score such high points, and also have an overall negative impact when on the floor. I will not answer all of these questions, but this analysis will show a few of the impacts that answering these types of questions can have.

Questions to be Answered

The main questions that I will focus on in this analysis are as follows: Does a players position impact their overall production? (PTS + REB + AST)? Does age make any difference in offensive production (PTS & AST)? Does number of shots attempted relate to a positive or negative +/-? Does high PPG translate to wins for that players team? Is there a trade-off between FG% and shot volume (FGA)?

Introduction to Scraped Data Set

I am going to be performing an analysis on seasonal NBA player data, provided by scraping the Basketball Reference website. The first data set records information about each player in the NBA from 2023-2025, along with that players individual average statistics from each season they have played in that time period. Each row is a player (there are instances of a player getting traded and their statistics from each team they were on that season being totaled * Team label is 2TM) and a season and each column is a statistic or descriptive information specific to each player (ex: awards won that year, age, team played for in that season). The statistics and information are explained below in the data dictionary. You can learn more about this data at this link: NBA 2023-2025 Player Statistics.

Data Dictionary

Variable Description
Rank Player’s ranking in MVP (Most Valuable Player) voting for that season.
Player Full name of the NBA player.
Age Player’s age during the season.
Team Abbreviation of the team the player played for.
Pos Player’s primary position (e.g., PG, SG, SF, PF, C).
G Games played during the season.
GS Games started during the season.
MP Total minutes played in the season.
FG Field goals made.
FGA Field goals attempted.
FG% Field goal percentage (FG / FGA).
3P 3-point field goals made.
3PA 3-point field goals attempted.
3P% 3-point shooting percentage (3P / 3PA).
2P 2-point field goals made.
2PA 2-point field goals attempted.
2P% 2-point field goal percentage (2P / 2PA).
eFG% Effective field goal percentage ((FG + 0.5 * 3P) / FGA).
FT Free throws made.
FTA Free throws attempted.
FT% Free throw percentage (FT / FTA).
ORB Offensive rebounds.
DRB Defensive rebounds.
TRB Total rebounds (ORB + DRB).
AST Assists.
STL Steals.
BLK Blocks.
TOV Turnovers.
PF Personal fouls.
PTS Total points scored.
Awards Honors received (e.g., All-NBA, DPOY).
Season Season represented (e.g., “2022”)

Summary Statistics

      PTS              AST              TRB               MP       
 Min.   : 0.000   Min.   : 0.000   Min.   : 0.000   Min.   : 0.50  
 1st Qu.: 3.700   1st Qu.: 0.700   1st Qu.: 1.700   1st Qu.:11.40  
 Median : 6.800   Median : 1.400   Median : 3.000   Median :18.50  
 Mean   : 8.472   Mean   : 2.002   Mean   : 3.403   Mean   :18.89  
 3rd Qu.:11.400   3rd Qu.: 2.700   3rd Qu.: 4.500   3rd Qu.:26.80  
 Max.   :34.700   Max.   :11.600   Max.   :13.900   Max.   :41.00  
 NA's   :3        NA's   :3        NA's   :3        NA's   :3      
      FG%              3P%              FT%              TOV       
 Min.   :0.0000   Min.   :0.0000   Min.   :0.0000   Min.   :0.000  
 1st Qu.:0.4080   1st Qu.:0.2860   1st Qu.:0.6853   1st Qu.:0.500  
 Median :0.4500   Median :0.3450   Median :0.7690   Median :0.800  
 Mean   :0.4544   Mean   :0.3207   Mean   :0.7498   Mean   :1.026  
 3rd Qu.:0.5000   3rd Qu.:0.3860   3rd Qu.:0.8390   3rd Qu.:1.400  
 Max.   :1.0000   Max.   :1.0000   Max.   :1.0000   Max.   :4.700  
 NA's   :15       NA's   :115      NA's   :138      NA's   :3      
      STL              BLK        
 Min.   :0.0000   Min.   :0.0000  
 1st Qu.:0.3000   1st Qu.:0.1000  
 Median :0.6000   Median :0.3000  
 Mean   :0.6138   Mean   :0.3808  
 3rd Qu.:0.9000   3rd Qu.:0.5000  
 Max.   :3.0000   Max.   :3.8000  
 NA's   :3        NA's   :3       

Introduction to Nejc Zavodnik’s Kaggle Data Set

I also utilized a data set found on Kaggle that provided similar statistics to the above data set, along with some additional statistics I believe are useful to answer the questions that I am posing/answering. The data is statistics/information, with each instance being a player from the 2024-2025 NBA Regular Season, with statistics being a season average. The statistics and information are explained below in the data dictionary. You can find the link to the Kaggle data set at this link: Kaggle NBA 2024-2025 Player Data

Data Dictionary

Variable Description
PPGRNK Player’s rank in points per game (PPG) among all players.
Player Full name of the NBA player.
Team Abbreviation of the NBA team the player was on.
Age Player’s age during the season.
GP Games played.
W Number of games won by the player’s team.
L Number of games lost by the player’s team.
Min Total minutes played.
PTS Total points scored.
FGM Field goals made.
FGA Field goals attempted.
FG% Field goal percentage (FGM / FGA).
3PM 3-point field goals made.
3PA 3-point field goals attempted.
3P% 3-point field goal percentage (3PM / 3PA).
FTM Free throws made.
FTA Free throws attempted.
FT% Free throw percentage (FTM / FTA).
OREB Offensive rebounds.
DREB Defensive rebounds.
REB Total rebounds (OREB + DREB).
AST Assists.
TOV Turnovers.
STL Steals.
BLK Blocks.
PF Personal fouls.
FP Fantasy points (a calculated score based on performance metrics).
DD2 Number of double-doubles (10+ in two statistical categories).
TD3 Number of triple-doubles (10+ in three statistical categories).
+/- Plus-minus: point differential when the player is on the court.

Summary Statistics

      PTS              AST              REB              FG%        
 Min.   : 0.000   Min.   : 0.000   Min.   : 0.000   Min.   :  0.00  
 1st Qu.: 4.000   1st Qu.: 0.800   1st Qu.: 1.800   1st Qu.: 40.60  
 Median : 7.200   Median : 1.500   Median : 3.200   Median : 44.80  
 Mean   : 8.887   Mean   : 2.085   Mean   : 3.596   Mean   : 44.61  
 3rd Qu.:12.100   3rd Qu.: 2.700   3rd Qu.: 4.800   3rd Qu.: 49.60  
 Max.   :32.700   Max.   :11.600   Max.   :13.900   Max.   :100.00  
      3P%             FT%              TOV              PF       
 Min.   :  0.0   Min.   :  0.00   Min.   :0.000   Min.   :0.000  
 1st Qu.: 26.7   1st Qu.: 67.90   1st Qu.:0.500   1st Qu.:1.000  
 Median : 33.8   Median : 76.30   Median :0.900   Median :1.600  
 Mean   : 29.9   Mean   : 72.02   Mean   :1.089   Mean   :1.544  
 3rd Qu.: 37.7   3rd Qu.: 82.90   3rd Qu.:1.500   3rd Qu.:2.100  
 Max.   :100.0   Max.   :100.00   Max.   :4.700   Max.   :3.500  

Impact of Age on Offensive Production (2023- 2025 PTS & AST)

This scatter plot, which examines the relationship between age and offensive production through average points (PTS) and assists (AST) for players aged 19 to 42, clearly shows that age significantly impacts offensive performance. Players in their early to mid-20s demonstrate the highest production. Most players average 2–4 assists and 5–15 points, but a noticeable decline begins after age 30, where players typically score below 10 points and 2 assists, and the oldest players (39–42) drop to around 5 points and 0–1 assists. This indicates that younger players, likely in their physical prime, consistently produce more offensively, while aging leads to a steady decrease in both points and assists, confirming that age does indeed make a significant difference in offensive production.

Relationship between FGA on +/- (2024/2025)

We see that most players cluster around moderate FGA and near-zero +/- values. However, there’s a mild trend suggesting that players with higher shot volumes tend to have higher plus/minus ratings, indicating that high-usage players often contribute positively to team performance — but it’s not a strong universal rule, with the data revealing that FGA alone doesn’t predict +/- consistently, as there are multiple points in both positive and negative ranges at various shot attempt levels.

Does high PPG translate to wins for that player’s team?

This boxplot shows a clear trend that players who score more points per game are generally on teams with more wins. As PPG increases, the median number of wins rises across each scoring tier, with the highest-scoring players (25+ PPG) typically associated with the most successful teams. While many factors influence team performance, the data suggests a positive relationship between individual scoring and team success. Disregard the NA result.

Position Impact on Overall Production? (PTS + REB + AST)

This graph indicates that point-guards have the most overall impact on their teams (in terms of avg. PTS + REB + AST), with centers second, followed by power-forwards, small forwards and then shooting-guards. Point-guards typically have the ball in their hands at the highest rate, with centers having more opportunities to get rebounds (due to height advantage and usual approximation to the backboard). These are two of the most important positions in the sport, and the positions that to succeed, it is important to have someone with high production in that spot.

Trade-off between FG% and shot volume (FGA) (2024-2025)

The trend line in the above point graph slopes downward, indicating a negative correlation between FGA and FG%. As the number of field goal attempts (FGA) increases, the field goal percentage (FG%) tends to decrease. This suggests a trade-off: players who take more shots (higher FGA) generally have a lower shooting accuracy (lower FG%). The spread of data points around the trend line shows some variability, but the overall pattern supports the existence of a trade-off between shot volume and shooting percentage.

Conclusion

Overall, the visual analyses reveal key patterns in player performance: younger players (especially in their 20s) drive higher offensive output, while aging reduces production; players who take more shots tend to help their teams more, but high shot volume doesn’t always guarantee a better plus/minus; higher individual scoring generally correlates with more team wins; point guards and centers contribute the most all-around value to their teams; and lastly, there’s a noticeable trade-off between shooting volume and efficiency, with higher field goal attempts linked to lower accuracy. These findings underscore how player role, age, and style of play influence both individual and team success.