Initial Kindly Model performing poorly because of imbalanced data

The dataset attached to the datasets folder needs to be tidied up. Some messages are wrongly labeled as being offensive. Also, the dataset is imbalanced i.e. (more than 80% of the data is one class) and it could be causing the model to perform poorly. 

The first step is to clean up the dataset and then use data augmentation to generate more data to balance out the dataset.