Machine Learning 1 Final Project: NBA General Manager Trade Simulation

Members: Eddie, Chase, Adam, Neel, Timothy, Jack, Harrison

Overview

Using all of the models we have used this semester, we analyze NBA player data from the 2024–2025 seasons to answer a set of research questions. We clean and transform the data, explore it with descriptive statistics and visualizations, and build multiple predictive models depending on the task. Finally, we deploy a Streamlit app to showcase our findings interactively.

Research Questions & Objectives

Can we accurately predict player salary, all-star nominations, and other accomplishment features?
Can we classify whether a player will be an all-star using season statistics?
Can we cluster players based on performance metrics and valuation to identify archetypes or undervalued players?
Can we classify players into different salary tiers using per game performance metrics?
Build a trade analysis model based on projected evaluated salaries and other evaluative metrics.
Predict next season’s win/loss record based on current roster and player statistics.

Models

Multiple Linear Regression (Polynomial extensions optional)
Logistic Regression
K-Nearest Neighbors (KNN)
K-Means Clustering
- Clustering players into performance archetypes
- Clustering by valuation to identify overvalued/undervalued players
Principal Component Analysis (PCA)
MLP Neural Network — Trade Analysis

App Structure

Page 1: README
Page 2: Interactive data table
Page 3: Exploratory Data Analysis (EDA)
Page 4: Statistical model pages

Instructions for Viewers — How to Run

Create the Conda environment: conda env create -f environment.yml
Activate Environment: conda activate nba_ml_project
Run Data Processing Scripts: python scrape_salaries.py and python get_clean_data.py
Run Streamlit App: streamlit run nba_model_app.py

Data Sources

NBA API: https://github.com/swar/nba_api
ESPN Salary Data: https://www.espn.com/nba/salaries
2012–2023 NBA Stats.csv

Link to app dashboard: https://ml1project-nba.streamlit.app/

Link to presentation slides: https://docs.google.com/presentation/d/19ICyufQWbHA0C849dZXp_vIfJSb9GIW_mdxjlFuhz7A/edit?usp=sharing

Name		Name	Last commit message	Last commit date
Latest commit History 156 Commits
DATA		DATA
.DS_Store		.DS_Store
.gitignore		.gitignore
EDA.html		EDA.html
LEBRON Data - Sheet1.csv		LEBRON Data - Sheet1.csv
Logo.png		Logo.png
ML1_Presentation - Group 12 - NBA Modeling.pptx		ML1_Presentation - Group 12 - NBA Modeling.pptx
MLP.py		MLP.py
MLP_Test.ipynb		MLP_Test.ipynb
README.md		README.md
eda.ipynb		eda.ipynb
environment.txt		environment.txt
get_clean_data.py		get_clean_data.py
k_means_model.py		k_means_model.py
knn_pca_model.py		knn_pca_model.py
logistic_regression_model.ipynb		logistic_regression_model.ipynb
logistic_regression_model.py		logistic_regression_model.py
ml1pca.ipynb		ml1pca.ipynb
mlr_model.py		mlr_model.py
nba_model_app.py		nba_model_app.py
pca_app.py		pca_app.py
pca_components.csv		pca_components.csv
requirements.txt		requirements.txt
scrape_salaries.py		scrape_salaries.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Machine Learning 1 Final Project: NBA General Manager Trade Simulation

Overview

Research Questions & Objectives

Models

App Structure

Instructions for Viewers — How to Run

Data Sources

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Machine Learning 1 Final Project: NBA General Manager Trade Simulation

Overview

Research Questions & Objectives

Models

App Structure

Instructions for Viewers — How to Run

Data Sources

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages