Customer Churn Prediction Using Machine Learning

This project aims to predict customers who are likely to churn using a Machine Learning approach. This project is suitable for demonstrating Data Scientist skills in processing customer data, building a classification model, evaluating model performance, and providing data-driven business recommendations.

Business Problem

The company needs to identify which customers are at risk of stopping the use of its services. By predicting churn, the business team can take early retention actions, such as personalized campaigns, loyalty promotions, service follow-ups, or customer experience improvements.

Dataset

The dataset in the data/ folder is synthetic sample data created for portfolio purposes. The data structure resembles customer analytics data and can be replaced with real company data if available.

Main features:

tenure_month
contract_type
internet_service
monthly_charges
total_charges
usage_gb
support_tickets
late_payments
churn

Methodology

Data understanding
Data cleaning
Exploratory Data Analysis
Feature engineering
Train-test split
Model training
Model evaluation
Business recommendation

Model

Main model used:

Random Forest Classifier

Comparison models that can be added:

Logistic Regression
Decision Tree
Gradient Boosting
XGBoost

Evaluation Result

Baseline result on synthetic data:

Metric	Score
Accuracy	0.66
Precision	0.62
Recall	0.59
F1-Score	0.60

Business Insight

Customers with monthly contracts, short tenure, a high number of complaints, and late payments have a higher risk of churn. The company can apply retention strategies for this segment through loyalty programs, personalized discounts, and service quality improvements.

Project Structure

customer-churn-prediction-machine-learning/
├── data/
│   └── customer_churn.csv
├── notebook/
│   └── churn_prediction_analysis.ipynb
├── src/
│   └── train_model.py
├── model/
│   └── churn_model.joblib
├── images/
│   ├── churn_distribution.png
│   └── feature_importance.png
├── README.md
├── requirements.txt
├── .gitignore
└── LICENSE

How to Run

Install the required dependencies:

pip install -r requirements.txt

Run the training script:

python src/train_model.py

Or open the notebook:

jupyter notebook notebook/churn_prediction_analysis.ipynb

Portfolio Summary for CV

Built a machine learning classification model to predict customer churn using customer behavior and transaction data. The project includes data cleaning, exploratory data analysis, feature engineering, model training, evaluation using accuracy, precision, recall, F1-score, and business recommendations for customer retention strategy.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Customer Churn Prediction Using Machine Learning

Business Problem

Dataset

Methodology

Model

Evaluation Result

Business Insight

Project Structure

How to Run

Portfolio Summary for CV

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
images		images
notebook		notebook
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README_INA.md		README_INA.md
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Customer Churn Prediction Using Machine Learning

Business Problem

Dataset

Methodology

Model

Evaluation Result

Business Insight

Project Structure

How to Run

Portfolio Summary for CV

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages