Skip to content

manas-vamsi/Indian-liver-patient-ml

Repository files navigation

Indian Liver Patient Dataset – Machine Learning Analysis

πŸ“Œ Overview

This project explores the Indian Liver Patient Dataset using both unsupervised and supervised learning methods.
The goal is to analyze patient data, uncover patterns, and build predictive models for liver disease.

πŸ“Š Dataset

  • Source: Kaggle – Indian Liver Patient Dataset
  • 583 samples, 10 features
  • Features include Age, Gender, Bilirubin, Enzyme levels, Proteins, etc.
  • Target: Liver disease diagnosis (1 = Disease, 2 = No Disease)

πŸ›  Methods

  1. Exploratory Data Analysis (EDA)

    • Correlation heatmap
    • Gender distribution
    • Feature distributions
  2. Clustering (Unsupervised)

    • KMeans
    • Gaussian Mixture Models (GMM)
  3. Classification (Supervised)

    • Random Forest Classifier

πŸ“ˆ Results

Model Accuracy Notes
KMeans ~69% Captured partial structure
GMM ~57% Poor clustering
Random Forest ~75-80% Best performing supervised model

βœ… Conclusion

  • Supervised learning (Random Forest) outperformed clustering approaches.
  • Clustering showed some structure but not reliable for diagnosis.
  • This project highlights the importance of preprocessing and proper model choice in medical datasets.

πŸš€ How to Run

pip install -r requirements.txt
jupyter notebook liver_patient_analysis.ipynb

About

Machine Learning project analyzing the Indian Liver Patient dataset using clustering (KMeans, GMM) and classification (Random Forest).

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors