There are many ways of handling the 'sequence' column. Right now OrdinalEncoder is applied on the dummy notebook. It doesn't seem to be helpful as reflected by the confusion matrix. Suggest alternative approches. One I can think of is applying tfidf or countVectorizer in some way. Discuss the same here. Any pull requests are appreciated.