Skip to content

Commit 28113ab

Browse files
committed
finalize 4.9.0
1 parent 9bbd235 commit 28113ab

File tree

19 files changed

+9188
-442
lines changed

19 files changed

+9188
-442
lines changed

README-pypi.rst

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -46,7 +46,7 @@ Features
4646
- **Emotion Analysis**, detect and recognize 6 different emotions of texts using finetuned Transformer-Bahasa.
4747
- **Entities Recognition**, seeks to locate and classify named entities mentioned in text using finetuned Transformer-Bahasa.
4848
- **Generator**, generate any texts given a context using T5-Bahasa, GPT2-Bahasa or Transformer-Bahasa.
49-
- **Jawi-to-Rumi**, convert from Jawi to Rumi using LSTM Seq2Seq with attention state-of-art.
49+
- **Jawi-to-Rumi**, convert from Jawi to Rumi using Transformer.
5050
- **Keyword Extraction**, provide RAKE, TextRank and Attention Mechanism hybrid with Transformer-Bahasa.
5151
- **Knowledge Graph**, generate Knowledge Graph using T5-Bahasa or parse from Dependency Parsing models.
5252
- **Language Detection**, using Fast-text and Sparse Deep learning Model to classify Malay (formal and social media), Indonesia (formal and social media), Rojak language and Manglish.
@@ -57,7 +57,7 @@ Features
5757
- **Part-of-Speech Recognition**, grammatical tagging is the process of marking up a word in a text using finetuned Transformer-Bahasa.
5858
- **Question Answer**, reading comprehension using finetuned Transformer-Bahasa.
5959
- **Relevancy Analysis**, detect and recognize relevancy of texts using finetuned Transformer-Bahasa.
60-
- **Rumi-to-Jawi**, convert from Rumi to Jawi using LSTM Seq2Seq with attention state-of-art.
60+
- **Rumi-to-Jawi**, convert from Rumi to Jawi using Transformer.
6161
- **Sentiment Analysis**, detect and recognize polarity of texts using finetuned Transformer-Bahasa.
6262
- **Text Similarity**, provide interface for lexical similarity deep semantic similarity using finetuned Transformer-Bahasa.
6363
- **Spell Correction**, using local Malaysia NLP researches hybrid with Transformer-Bahasa to auto-correct any bahasa words and NeuSpell using T5-Bahasa.

README.rst

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -65,7 +65,7 @@ Features
6565
- **Emotion Analysis**, detect and recognize 6 different emotions of texts using finetuned Transformer-Bahasa.
6666
- **Entities Recognition**, seeks to locate and classify named entities mentioned in text using finetuned Transformer-Bahasa.
6767
- **Generator**, generate any texts given a context using T5-Bahasa, GPT2-Bahasa or Transformer-Bahasa.
68-
- **Jawi-to-Rumi**, convert from Jawi to Rumi using LSTM Seq2Seq with attention state-of-art.
68+
- **Jawi-to-Rumi**, convert from Jawi to Rumi using Transformer.
6969
- **Keyword Extraction**, provide RAKE, TextRank and Attention Mechanism hybrid with Transformer-Bahasa.
7070
- **Knowledge Graph**, generate Knowledge Graph using T5-Bahasa or parse from Dependency Parsing models.
7171
- **Language Detection**, using Fast-text and Sparse Deep learning Model to classify Malay (formal and social media), Indonesia (formal and social media), Rojak language and Manglish.
@@ -76,7 +76,7 @@ Features
7676
- **Part-of-Speech Recognition**, grammatical tagging is the process of marking up a word in a text using finetuned Transformer-Bahasa.
7777
- **Question Answer**, reading comprehension using finetuned Transformer-Bahasa.
7878
- **Relevancy Analysis**, detect and recognize relevancy of texts using finetuned Transformer-Bahasa.
79-
- **Rumi-to-Jawi**, convert from Rumi to Jawi using LSTM Seq2Seq with attention state-of-art.
79+
- **Rumi-to-Jawi**, convert from Rumi to Jawi using Transformer.
8080
- **Sentiment Analysis**, detect and recognize polarity of texts using finetuned Transformer-Bahasa.
8181
- **Text Similarity**, provide interface for lexical similarity deep semantic similarity using finetuned Transformer-Bahasa.
8282
- **Spell Correction**, using local Malaysia NLP researches hybrid with Transformer-Bahasa to auto-correct any bahasa words and NeuSpell using T5-Bahasa.

docs/Api.rst

Lines changed: 18 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -138,10 +138,10 @@ malaya.preprocessing
138138
.. automodule:: malaya.preprocessing
139139
:members:
140140

141-
.. autoclass:: malaya.preprocessing.Tokenizer()
141+
.. autoclass:: malaya.preprocessing.Preprocessing()
142142
:members:
143143

144-
.. autoclass:: malaya.preprocessing.Preprocessing()
144+
.. autoclass:: malaya.preprocessing.Demoji()
145145
:members:
146146

147147
malaya.qa
@@ -219,6 +219,18 @@ malaya.tatabahasa
219219
.. automodule:: malaya.tatabahasa
220220
:members:
221221

222+
malaya.tokenizer
223+
--------------------
224+
225+
.. automodule:: malaya.tokenizer
226+
:members:
227+
228+
.. autoclass:: malaya.tokenizer.Tokenizer()
229+
:members:
230+
231+
.. autoclass:: malaya.tokenizer.SentenceTokenizer()
232+
:members:
233+
222234
malaya.summarization.abstractive
223235
---------------------------------
224236

@@ -452,7 +464,10 @@ malaya.model.tf
452464
.. autoclass:: malaya.model.tf.Seq2SeqLSTM()
453465
:members:
454466

455-
.. autoclass:: malaya.model.tf.Seq2SeqLSTM_Split()
467+
.. autoclass:: malaya.model.tf.JawiRumi()
468+
:members:
469+
470+
.. autoclass:: malaya.model.tf.RumiJawi()
456471
:members:
457472

458473
malaya.model.xlnet

docs/README.rst

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -65,7 +65,7 @@ Features
6565
- **Emotion Analysis**, detect and recognize 6 different emotions of texts using finetuned Transformer-Bahasa.
6666
- **Entities Recognition**, seeks to locate and classify named entities mentioned in text using finetuned Transformer-Bahasa.
6767
- **Generator**, generate any texts given a context using T5-Bahasa, GPT2-Bahasa or Transformer-Bahasa.
68-
- **Jawi-to-Rumi**, convert from Jawi to Rumi using LSTM Seq2Seq with attention state-of-art.
68+
- **Jawi-to-Rumi**, convert from Jawi to Rumi using Transformer.
6969
- **Keyword Extraction**, provide RAKE, TextRank and Attention Mechanism hybrid with Transformer-Bahasa.
7070
- **Knowledge Graph**, generate Knowledge Graph using T5-Bahasa or parse from Dependency Parsing models.
7171
- **Language Detection**, using Fast-text and Sparse Deep learning Model to classify Malay (formal and social media), Indonesia (formal and social media), Rojak language and Manglish.
@@ -76,7 +76,7 @@ Features
7676
- **Part-of-Speech Recognition**, grammatical tagging is the process of marking up a word in a text using finetuned Transformer-Bahasa.
7777
- **Question Answer**, reading comprehension using finetuned Transformer-Bahasa.
7878
- **Relevancy Analysis**, detect and recognize relevancy of texts using finetuned Transformer-Bahasa.
79-
- **Rumi-to-Jawi**, convert from Rumi to Jawi using LSTM Seq2Seq with attention state-of-art.
79+
- **Rumi-to-Jawi**, convert from Rumi to Jawi using Transformer.
8080
- **Sentiment Analysis**, detect and recognize polarity of texts using finetuned Transformer-Bahasa.
8181
- **Text Similarity**, provide interface for lexical similarity deep semantic similarity using finetuned Transformer-Bahasa.
8282
- **Spell Correction**, using local Malaysia NLP researches hybrid with Transformer-Bahasa to auto-correct any bahasa words and NeuSpell using T5-Bahasa.

0 commit comments

Comments
 (0)