Releases: kumparan/nlp-id
Releases · kumparan/nlp-id
v.0.1.20.0
What's Changed
Full Changelog: v0.1.19.0...v0.1.20.0
v.0.1.19.0
Fix Vulnerability Library
- update NLTK version to 3.9.1
Support Python 3.9
- Update sklearn version to support python 3.9
Update postagger model
- Update postagger model
- Update postag dataset
Update Tokenizer Data
- Update non_clitics data for tokenizer
Update Tokenizer
- Fix tokenization of tokens like
sepenuhnya,setelahnya,sebelum-sebelumnya, etc.
Update Lemmatizer and Tokenizer
- Fix tokenization of words ending with nya, ku, mu, etc.
- Fix lemmatization of words with
-in it
Adjust tokenizer
- Adjust tokenizer so that some tokens that is personal pronoun or particle will be split
Add training and evaluation
Added training and evaluation method in README file.