Skip to content

kavyarana/Healthcare

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

4 Commits
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ«€ Heart Disease Prediction & Analysis

πŸ“Œ Problem Statement Cardiovascular diseases are among the leading causes of death globally. This project aims to identify significant factors influencing heart disease and predict potential heart attacks using machine learning. The dataset contains 14 attributes and 4,000+ records, providing detailed information about patient health indicators.

πŸ“‚ Dataset Records: 4,000+

Attributes: 14 (age, sex, cholesterol, resting blood pressure, thalassemia, etc.)

Target Variable: Presence of cardiovascular disease (CVD)

πŸ›  Tech Stack Language: Python

Libraries: Pandas, NumPy, Matplotlib, Seaborn

Visualization: Tableau

Environment: Jupyter Notebook

πŸ“Š Project Workflow

  1. Data Import & Inspection Checked structure, missing values, duplicates

Treated missing values appropriately

Removed duplicate records

Generated statistical summaries (mean, median, standard deviation)

  1. Exploratory Data Analysis (EDA) Identified categorical variables (e.g., gender, chest pain type) and analyzed distributions using count plots

Studied CVD occurrence across different ages

Investigated impact of resting blood pressure on heart disease

Analyzed gender distribution of patients

  1. Factor Analysis & Insights Explored cholesterol levels vs. CVD

Examined peak exercise (oldpeak, exercise-induced angina) relationships

Evaluated the role of thalassemia in CVD occurrence

Used pair plots to visualize variable relationships

  1. Predictive Modeling Model: Logistic Regression

Trained on processed dataset

Evaluated using Confusion Matrix for accuracy, precision, recall, and F1-score

  1. Dashboarding in Tableau Created visual comparisons between Diseased vs. Healthy individuals

Linked variables to visualize relationships

Built interactive CVD risk factor dashboard

πŸ” Key Insights High cholesterol, abnormal resting blood pressure, and oldpeak are strong CVD predictors.

Men showed a slightly higher incidence of heart disease than women in this dataset.

Thalassemia and exercise-induced angina are closely related to increased CVD risk.

People aged 50+ had a significantly higher probability of heart disease.

About

Cardiovascular diseases are a major global cause of death. Using a dataset of 14 attributes and 4,000+ records, analyze factors influencing heart health and build a predictive model to identify heart attack risks, enabling early intervention and improved prevention strategies.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors