Player Efficiency Prediction Model

This project applies data mining and machine learning to predict Player Efficiency Rating (PER) using advanced NBA statistics. Built using Python, the model draws from 11 seasons of NBA data (2014–2024) and was designed with a focus on predictive accuracy, feature analysis, and real-world usability.

Overview

Goal: Predict a player's PER based on 18 statistical features
Method: Linear Regression (with cross-validation and model comparison)
Dataset: 5,792 NBA player-seasons, filtered to 4,490 after preprocessing
Outcome: Reliable model (R² = 0.9542) with a real-time prediction tool

File Execution Order

The code is meant to be run in the following order:

split.py – Prepares the normalized dataset
modelTraining.py – Trains the Linear Regression model
crossValidation.py – Evaluates performance using 5-fold validation
featureImportance.py – Displays most influential stats
modelTesting.py – Tests the model on holdout data
modelComparison.py – Compares performance of multiple models
predictiveTool.py – Interactive prediction tool using player inputs

Model Performance

R²: 0.9542
MAE: 0.0283
RMSE: 0.0363
Avg Difference: ±1.81 PER points (based on test players)

Key Features Used

Minutes Played (MP), Field Goals (FG, FGA), Free Throws (FT, FTA)
Rebounds (TRB), Assists (AST), Points (PTS)
Turnovers (PTOV), Fouls Drawn (SFD), Assists Generated (PGA), And1s
Advanced metrics like TS%, USG%, WS, BPM, VORP, ORtg

All features are scaled using min-max normalization for model compatibility.

Dependencies

pandas, numpy, scikit-learn, matplotlib, seaborn, joblib

How to Use

Download the transformed CSV dataset
Copy its file path and paste it into split.py
Run split.py and modelTraining.py first (this step saves the model)
Then run predictiveTool.py to generate PER predictions from user input

The predictive tool includes a built-in test set of 28 players (1997–2024) to verify model accuracy.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Player Efficiency Prediction Model

Overview

File Execution Order

Model Performance

Key Features Used

Dependencies

How to Use

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
ProjectReport.pdf		ProjectReport.pdf
README.md		README.md
Transformed.csv		Transformed.csv
crossValidation.py		crossValidation.py
featureImportance.py		featureImportance.py
modelComparison.py		modelComparison.py
modelTesting.py		modelTesting.py
modelTraining.py		modelTraining.py
predictiveTool.py		predictiveTool.py
split.py		split.py

Folders and files

Latest commit

History

Repository files navigation

Player Efficiency Prediction Model

Overview

File Execution Order

Model Performance

Key Features Used

Dependencies

How to Use

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages