GitHub

The project involves implementing clustering algorithms to analyze synthetic heart disease datasets using K-Means, Hierarchical, and Mean Shift clustering techniques. The performance of these algorithms is evaluated through metrics such as Silhouette Score, Calinski-Harabasz Index, and Davies-Bouldin Index. Overview This project analyzes various clustering techniques (K-Means, Hierarchical Clustering, and Mean Shift) on an Iris-like dataset, focusing on the impact of preprocessing and PCA on clustering performance.

Dataset The dataset contains the following features:

sepal_length sepal_width petal_length petal_width species (for validation) Clustering Techniques K-Means Clustering: Assesses cluster counts (c = 3, 4, 5) and evaluates performance using Silhouette, Calinski-Harabasz, and Davies-Bouldin scores.

Hierarchical Clustering: Analyzes the effects of normalization and PCA on clustering performance.

Mean Shift Clustering: Examines the algorithm's effectiveness under various preprocessing conditions.

Results Results are summarized in comparison tables, illustrating the performance of each algorithm across different configurations.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Clustering1.ipynb		Clustering1.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

jsharma9992/Clustering1

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages