Skip to content

AishwaryaParaspatki/Predicting-Diabetes-using-multilayer-perceptrons

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Predicting Diabetes using Multilayer Perceptrons

Technology stack:
Python
Keras - TensorFlow
Matplotlib
Numpy
Seaborn
sklearn

I have used the Diabetes dataset hosted by Kaggle.
(https://www.kaggle.com/uciml/pima-indians-diabetes-database)

The datasets consist of several medical predictor (independent) variables and one target (dependent) variable, Outcome. Independent variables include the number of pregnancies the patient has had, their BMI, insulin level, age, and so on.
Dataset

Notebooks:

Visualization.ipynb: This notebook consists of a study of the dataset using matplotlib and seabron libraries.
Histogram


Density graph


utils.ipynb: This notebook consists of the study of data before preprocessing and the preprocessing and feature engineering itself.

main.ipynb: This notebook consists of the multilayer perceptron model with specifications as follows: Sequential model with 3 layers: 1st layer - 32 hidden neurons with relu activation, 2nd layer - 16 hidden neurons with relu activation, output layer - 1 neuron with sigmoid activation. Adam optimizer with binary_crossentropy loss.


Evaluation metrics:


1. Accuracy:
![Accuracy](/images/Accuracy.PNG)

  1. Confusion Matrix:
    Confusion Matrix

  1. ROC Curve:
    ROC Curve

Releases

No releases published

Packages

No packages published