Grouping NBA Players Utilizing Unsupervised Learning Algorithms

Evan Parker

2024-12-10

Agenda

Introduction

Variables

  • Rk: Rank, (player identification number)
  • Player: Player’s name
  • Pos: Position
  • Age: Player’s age
  • Tm: Team
  • G: Games played
  • GS: Games started
  • MP: Minutes played per game
  • FG: Field goals per game
  • FGA: Field goal attempts per game
  • FG%: Field goal percentage
  • 3P: 3-point field goals per game
  • 3PA: 3-point field goal attempts per game
  • 3P%: 3-point field goal percentage
  • 2P: 2-point field goals per game
  • 2PA: 2-point field goal attempts per game
  • 2P%: 2-point field goal percentage
  • eFG%: Effective field goal percentage
  • FT: Free throws per game
  • FTA: Free throw attempts per game
  • FT%: Free throw percentage
  • ORB: Offensive rebounds per game
  • DRB: Defensive rebounds per game
  • TRB: Total rebounds per game
  • AST: Assists per game
  • STL: Steals per game
  • BLK: Blocks per game
  • TOV: Turnovers per game
  • PF: Personal fouls per game
  • PTS: Points per game

Feature Engineering

Renaming of Variables
Original Name New Name
FG% FGP
3P% X3PP
2P% X2PP
eFG% eFGP
FT% FTP
3P X3P
3PA X3PA
2P X2P
2PA X2PA

Practical Questions to Address

K-Means Clustering Analysis

Clustering

PCA’s

LOF

Outlier Criteria
MP
PTS
FGP
X3PP
TRB
AST
STL
BLK
TOV

LOF Cutoff

Conclusions

Limitations