Skip to content

Machine Learning projects using Random Forests to classify wine

Notifications You must be signed in to change notification settings

simbakowo/WineClassification-ML-

Repository files navigation

Multiclass classification usng random forests

This project compares the 2 ways of implementing Random Forrests on the
White Wine Dataset using classification. (10 classes, quality). Could also
be a regression problem.

Denstity and Residual Sugar are collinear. To decide which one to drop, we
can look at which one is less correlated to the output and which one ranks higher 
on the ANOVA F scores and FeatureImportance scores.

Alsp drop fixed acidity based on Feature importance scores.

Because RFs are treebased, they do not require feature scaling

Based on the confusion matrix and the PR curve, we have succesfully prioritised
fraud class (1) >> recall becuase FN are more important

About

Machine Learning projects using Random Forests to classify wine

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages