The project is divided into two parts:
- Named Entity Recognition using LSTM performed on conll2003 and OntoNotes5.0 datasets
- Titanic survival classification using KNN and Logistic regression
Package is not officially registered but you can download it using following command:
(@v1.8) pkg> add https://github.com/B0B36JUL-FinalProjects-2022/Projekt_lamossta
src/nn.jl
=> implemented classification LSTM netsrc/prepare_ner_data.jl
=> dataset loading, embedding conversion and feature vector construction
src/knn.jl
=> implementation of KNN classification algorithmsrc/logreg.jl
=> implementation of Logistic regressionsrc/prepare_titanic_data.jl
=> CSV loading, missing data augmentation and feature vector construction
examples/example.jl
=> example classification using all implemented algorithms mentioned aboveexamples/ner_ntb.ipynb
=> notebook with showcase of NER classificationexamples/titanic_ntb.ipynb
=> notebook with showcase of Titanic classificationtests/runtests.jl
=> Unit tests for various functions