Skip to content

Initial Kindly Model performing poorly because of imbalanced data #76

@AyiteyDjaba

Description

@AyiteyDjaba

The dataset attached to the datasets folder needs to be tidied up. Some messages are wrongly labeled as being offensive. Also, the dataset is imbalanced i.e. (more than 80% of the data is one class) and it could be causing the model to perform poorly.

The first step is to clean up the dataset and then use data augmentation to generate more data to balance out the dataset.

Metadata

Metadata

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions