Releases · malaysia-ai/malaya

16 Nov 04:27

huseinzol05

4.0

9a771e5

Version 4.0

Added quantized models to all Malaya models, reduce inference time by 2x and model size by 4x.
Retrain constituency parsing, improved accuracy slightly by ~1-2%.
Added vectorization interface for sentence / word level for all classification models.

Assets 2

16 Aug 16:07

huseinzol05

3.8.1

7b18536

Version 3.8.1

Released constituency parsing

Assets 2

05 Aug 18:13

huseinzol05

3.8

dfecea1

Version 3.8

Improved spelling correction.
Improved normalizer.
Improved EN-MS translation, now support longer texts and US style texts.

Assets 2

10 Jul 05:02

huseinzol05

3.7

a7ecfc7

Version 3.7

Added translation EN to MS and MS to EN modules.
Added paraphrase module.
Added keyword extraction module.

Assets 2

27 Apr 13:53

huseinzol05

3.4

6eefacc

Version 3.4

release 3.4

Assets 2

07 Aug 17:36

huseinzol05

2.7

8819656

Version 2.7

BERT-Bahasa interface available.
Added BERT-Multilanguage, BERT-Base and BERT-small for emotion analysis.
Added BERT-Multilanguage, BERT-Base and BERT-small for Naming Entity Recognition.
Added BERT-Multilanguage, BERT-Base and BERT-small for Part-Of-Speech.
Added BERT-Multilanguage and BERT-Base for relevancy analysis.
Added BERT-Multilanguage, BERT-Base and BERT-small for sentiment analysis.
Added encoder interface for text similarity, can use skip-thought / BERT / XLNET as encoder model.
Added tree plot visualization for text similarity.
Added BERT-Multilanguage, BERT-Base and BERT-small for subjectivity analysis.
Added encoder interface for text summarization, can use skip-thought / BERT / XLNET as encoder model.
Added BERT / XLNET interface for topic modeling.
Added BERT-Multilanguage, BERT-Base and BERT-small for toxicity analysis.
Remove siamese models for text similarity.
Remove fast-text-char models, replace by BERT model.
Malaya no longer support training interface.
XLNET-Bahasa interface available.
Sequence models now no longer improve by Malaya, we move on using Attention model.

Assets 2

25 Jun 03:56

huseinzol05

2.6

d27ac1e

Version 2.6

Added deep siamese network, https://malaya.readthedocs.io/en/latest/Similarity.html#deep-siamese-network.
Added BERT deep siamese network, https://malaya.readthedocs.io/en/latest/Similarity.html#bert-model
Added Doc2Vec to calculate semantic similarity, https://malaya.readthedocs.io/en/latest/Similarity.html#calculate-similarity-using-doc2vec
Now all extractive summarization is use TextRank algorithm as scoring algorithm.
Added Doc2Vec for extractive summarization, https://malaya.readthedocs.io/en/latest/Summarization.html#load-doc2vec-summarization

Assets 2

01 Jun 05:40

huseinzol05

2.4

8a090a8

Version 2.4

Added relevancy analysis, to study an article or a piece of text is relevant, tendency to become a fake news. https://malaya.readthedocs.io/en/latest/Relevancy.html
Added visualization dashboard for emotion analysis, relevancy analysis, sentiment analysis, subjectivity analysis and toxicity analysis. Very easy to use, call predict_words function and it will popup.
Added neutral class for relevancy analysis, sentiment analysis and subjectivity analysis.
Use Malaya preprocessing for all deep learning models classification.

Assets 2

27 Feb 14:34

huseinzol05

1.9

e1ef48d

Version 1.9

Fix some english loading bugs
Added clustering visualization, https://malaya.readthedocs.io/en/latest/Cluster.html
Added text augmentation, https://malaya.readthedocs.io/en/latest/Generator.html
Normalizer and Spelling now able to detect english words.

Assets 2

15 Feb 12:37

huseinzol05

1.7

8057d61

Version 1.7

Added text similarity and released partial topics related, https://malaya.readthedocs.io/en/latest/Similarity.html
Added word-mover distance interface, https://malaya.readthedocs.io/en/latest/Mover.html
Added pretrained fast-text based on wikipedia, https://malaya.readthedocs.io/en/latest/Fasttext.html
Improve sentiment analysis, trained on more than 800k sentences and more sensitive towards social media texts.
Remove n-grams for all fast-text models to reduce dimension curse.
Remove sparse limit for all fast-text-char models to improve n-grams sensitivity.

Assets 2

Uh oh!

Releases: malaysia-ai/malaya

Version 4.0

Uh oh!

Version 3.8.1

Uh oh!

Version 3.8

Uh oh!

Version 3.7

Uh oh!

Version 3.4

Uh oh!

Version 2.7

Uh oh!

Version 2.6

Uh oh!

Version 2.4

Uh oh!

Version 1.9

Uh oh!

Version 1.7

Uh oh!