Skip to content

Commit 781fa58

Browse files
authored
Merge pull request #5 from JohnSnowLabs/0.2rc1
0.2rc1
2 parents dd93eda + 9508cf4 commit 781fa58

122 files changed

Lines changed: 1976 additions & 5687 deletions

File tree

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

README.md

Lines changed: 11 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,7 @@ Take a look at our official Spark NLU page: [https://nlu.johnsnowlabs.com/](http
99

1010

1111
## NLU in action
12-
<img src="http://ckl-it.de/wp-content/uploads/2020/08/My-Video6.gif" width="1800" height="500"/>
12+
<img src="http://ckl-it.de/wp-content/uploads/2020/09/git_final.gif" width="1800" height="500"/>
1313

1414

1515

@@ -36,13 +36,14 @@ nlu.load('bert elmo albert xlnet glove use').predict('I love NLU! <3')
3636
## What kind of models does NLU provide?
3737
NLU provides everything a data scientist might want to wish for in one line of code!
3838
- The 100+ of the latest NLP word embeddings ( BERT, ELMO, ALBERT, XLNET, GLOVE, BIOBERT, ELECTRA, COVIDBERT) and different variations of them
39-
- The 10+ of the latest NLP sentence embeddings ( BERT, ELECTRA, USE) and different variations of them
39+
- The 45+ of the latest NLP sentence embeddings ( BERT, ELECTRA, USE) and different variations of them
40+
- 50 + Classifiers
4041
- Generation of Sentence, Chunk and Document from these embeddings
4142
- Language Classification of 20 languages
4243
- 36 pretrained NER models
4344
- 34 Part of Speech (POS) models
4445
- 34 Lemmatizer models
45-
- Emotion models for 5 categories
46+
- Emotion (joy, fear, suprise, sadness)
4647
- Labeled and Unlabeled Dependency parsing
4748
- Spell Checking
4849
- Stopword removers for 41 languages
@@ -80,7 +81,7 @@ Working with text data can sometimes be quite a dirty Job. NLU helps you keep yo
8081
- Stopword Cleaners
8182
- Pattern Cleaners
8283
- Slang Cleaner
83-
84+
- Tokenization
8485
## Where can I see NLUs entire offer?
8586
Checkout the [NLU Namespace](https://nlu.johnsnowlabs.com/docs/en/namespace) for everything that NLU has to offer!
8687

@@ -166,3 +167,9 @@ The following are Collab examples which showcase each NLU component and some app
166167

167168
# Simple NLU Demos
168169
- [NLU different output levels Demo](https://colab.research.google.com/drive/1C4N3wpC17YzZf9fXHDNAJ5JvSmfbq7zT?usp=sharing)
170+
171+
172+
# NLU on Medium
173+
- [Introduction to NLU](https://medium.com/@christian.kasim.loan/1-line-of-code-350-nlp-models-with-john-snow-labs-nlu-in-python-2f1c55bba619?sk=1379426a00ca970a6e101a8f74f66bd2)
174+
- [One line BERT Word Embeddings and t-SNE plotting with NLU](https://medium.com/@christian.kasim.loan/1-line-to-bert-word-embeddings-with-nlu-f50d2b08cddc?sk=ec38a4ee6c87984f2b3cc4328b27135a)
175+
- [BERT, ALBERT, ELECTRA, ELMO, XLNET, GLOVE Word Embeddings in one line and plotting with t-SNE](https://medium.com/@christian.kasim.loan/1-line-of-code-for-bert-albert-elmo-electra-xlnet-glove-part-of-speech-with-nlu-and-t-sne-9ebcd5379cd?sk=58137630075d8f8b23b66b17c3aaa72d)

docs/Gemfile.lock

Lines changed: 2 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -67,7 +67,7 @@ GEM
6767
jekyll-theme-time-machine (= 0.1.1)
6868
jekyll-titles-from-headings (= 0.5.1)
6969
jemoji (= 0.10.1)
70-
kramdown (>= 2.3.0)
70+
kramdown (= 1.17.0)
7171
liquid (= 4.0.0)
7272
listen (= 3.1.5)
7373
mercenary (~> 0.3)
@@ -185,9 +185,6 @@ GEM
185185
jekyll-seo-tag (~> 2.0)
186186
jekyll-titles-from-headings (0.5.1)
187187
jekyll (~> 3.3)
188-
jekyll-toc (0.13.1)
189-
jekyll (>= 3.7)
190-
nokogiri (~> 1.9)
191188
jekyll-watch (2.2.1)
192189
listen (~> 3.0)
193190
jemoji (0.10.1)
@@ -206,7 +203,7 @@ GEM
206203
jekyll (~> 3.5)
207204
jekyll-feed (~> 0.9)
208205
jekyll-seo-tag (~> 2.1)
209-
minitest (5.14.1)
206+
minitest (5.14.2)
210207
multipart-post (2.1.1)
211208
nokogiri (1.10.10)
212209
mini_portile2 (~> 2.4.0)
@@ -252,7 +249,6 @@ PLATFORMS
252249

253250
DEPENDENCIES
254251
github-pages (= 192)
255-
jekyll-toc
256252

257253
BUNDLED WITH
258254
2.1.4

docs/_data/navigation.yml

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -21,17 +21,17 @@ docs-en:
2121
children:
2222
- title: Quick Start
2323
url: /docs/en/install
24+
- title: General Concepts
25+
url: /docs/en/concepts
2426
- title: Simple Examples
2527
url: /docs/en/examples
26-
- title: Visualization Examples
27-
url: /docs/en/viz_examples
2828
- title: Notebooks
2929
url: /docs/en/notebooks
30+
- title: Visualization Examples
31+
url: /docs/en/viz_examples
3032

3133
- title: Documentation
3234
children:
33-
- title: General Concepts
34-
url: /docs/en/concepts
3535
- title: NLU Load Function
3636
url: /docs/en/load_api
3737
- title: NLU Predict Function

docs/_layouts/article.html

Lines changed: 11 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -5,6 +5,17 @@
55
<style>
66

77

8+
table.standard {
9+
border: 1px solid ;
10+
text-align: left;
11+
border-collapse: collapse;
12+
overflow: auto;
13+
max-width: max-content;
14+
margin-left: auto;
15+
margin-right: auto;
16+
}
17+
18+
819

920
table.steelBlueCols {
1021
border: 1px solid #1C6EA4;

docs/_layouts/landing.html

Lines changed: 21 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -208,20 +208,31 @@ <h1>{{ _section.title }}</h1>
208208
</b></b></li><li><b><b><a class="button button--outline-info button--pill button--lg" href="#sarcasm-classifier">Sarcasm Classifer</a>
209209
</b></b></li><li><b><b><a class="button button--outline-info button--pill button--lg" href="#cyberbullying-classifier">Cyberbullying Classifier</a>
210210
</b></b></li><li><b><b><a class="button button--outline-info button--pill button--lg" href="#spam-classifier">Spam Classifier</a>
211+
</b></b></li><li><b><b><a class="button button--outline-info button--pill button--lg" href="#toxic-classifier">Toxic Classifier</a>
212+
211213
</b></b></li><li><b><b><a class="button button--outline-info button--pill button--lg" href="#fake-news-classifier">Fake News Classifier</a>
212214
</b></b></li><li><b><b><a class="button button--outline-info button--pill button--lg" href="#question-classifier-6-class">Classify Question (6 class)</a>
213215
</b></b></li><li><b><b><a class="button button--outline-info button--pill button--lg" href="#question-classifier-50-class">Classify Question (50 class)</a>
214216
</b></b></li><li><b><b><a class="button button--outline-info button--pill button--lg" href="#emotion-classifier">Multi Class Emotion detection</a>
217+
</b></b></li><li><b><b><a class="button button--outline-info button--pill button--lg" href="#e2e-classifier">E2E Classifier</a>
218+
215219
</b></b></li><li><b><b><a class="button button--outline-info button--pill button--lg" href="#sentiment-classifier">Sentiment detection</a>
216220
</b></b></li><li><b><b><a class="button button--outline-info button--pill button--lg" href="#spell-checking">Spell checking</a>
217221

218222
</b></b></li><li><b><b><a class="button button--outline-info button--pill button--lg" href="#language-classifier">Language Classification</a>
219-
</b></b></li><li><b><b><a class="button button--outline-info button--pill button--lg" href="#word-embeddings-bert">Bert Word Embeddings</a>
220-
</b></b></li><li><b><b><a class="button button--outline-info button--pill button--lg" href="#word-embeddings-elmo">Elmo Word Embeddings</a>
221-
</b></b></li><li><b><b><a class="button button--outline-info button--pill button--lg" href="#word-embeddings-albert">Albert Word Embeddings</a>
222-
</b></b></li><li><b><b><a class="button button--outline-info button--pill button--lg" href="#word-embeddings-xlnet">Xlnet Word Embeddings</a>
223-
</b></b></li><li><b><b><a class="button button--outline-info button--pill button--lg" href="#word-embeddings-glove">Glove Word Embeddings</a>
223+
224+
</b></b></li><li><b><b><a class="button button--outline-info button--pill button--lg" href="#word-embeddings-bert">BERT Word Embeddings</a>
225+
</b></b></li><li><b><b><a class="button button--outline-info button--pill button--lg" href="#word-embeddings-elmo">ELMO Word Embeddings</a>
226+
</b></b></li><li><b><b><a class="button button--outline-info button--pill button--lg" href="#word-embeddings-electra">Electra Word Embeddings</a>
227+
228+
</b></b></li><li><b><b><a class="button button--outline-info button--pill button--lg" href="#word-embeddings-albert">ALBERT Word Embeddings</a>
229+
</b></b></li><li><b><b><a class="button button--outline-info button--pill button--lg" href="#word-embeddings-xlnet">XLNET Word Embeddings</a>
230+
</b></b></li><li><b><b><a class="button button--outline-info button--pill button--lg" href="#word-embeddings-glove">GLOVE Word Embeddings</a>
231+
224232
</b></b></li><li><b><b><a class="button button--outline-info button--pill button--lg" href="#sentence-embeddings-use">USE Sentence Embeddings</a>
233+
</b></b></li><li><b><b><a class="button button--outline-info button--pill button--lg" href="#sentence-embeddings-bert">BERT Sentence Embeddings</a>
234+
</b></b></li><li><b><b><a class="button button--outline-info button--pill button--lg" href="#sentence-embeddings-electra">ELECTRA Sentence Embeddings</a>
235+
225236
</b></b></li><li><b><b><a class="button button--outline-info button--pill button--lg" href="#dependency-parsing-unlabeled">Dependency Parsing Unlabeled</a>
226237
</b></b></li><li><b><b><a class="button button--outline-info button--pill button--lg" href="#dependency-parsing-labeled">Dependency Parsing Labeled</a>
227238
</b></b></li><li><b><b><a class="button button--outline-info button--pill button--lg" href="#lemmatization">Lemmatization</a>
@@ -269,15 +280,15 @@ <h3 style="margin-left:-10px">NLU Features</h3>
269280
<ul>
270281
<li> - State-of-the-art <strong>Deep Learning</strong> algorithms</li>
271282
<li> - Achieve high accuracy with a <strong>one line of code</strong></li>
272-
<li> - <strong>244 +</strong> unique NLU components</li>
283+
<li> - <strong>350 +</strong> NLP Models</li>
273284
<li> - <strong>176 +</strong> unique NLP models and algorithms</li>
274285
<li> - <strong>68 +</strong> unique NLP pipelines consisting of composed NLU components</li>
275-
<li> - <strong>60 +</strong> languages supported</li>
276-
<li> - <strong>14 +</strong> embeddings <strong> BERT, ELMO, ALBERT, XLNET, GLOVE, USE </strong></li>
277-
<li> - <strong>12 +</strong> Pre-trained Classifiers : <strong> Emotion, Sarcasm, Language, Question</strong> </li>
286+
<li> - <strong>50 +</strong> languages supported</li>
287+
<li> - <strong>14 +</strong> embeddings <strong> BERT, ELMO, ALBERT, XLNET, GLOVE, USE, ELECTRA </strong></li>
288+
<li> - <strong>50 +</strong> Pre-trained Classifiers : <strong> Emotion, Sarcasm, Language, Question, E2E, Toxic</strong> </li>
278289
<li> - <strong>36 +</strong> Pre-Trained <strong> NER (Named Entity Recognition) </strong> models </li>
279290
<li> - <strong>34 +</strong> Pre-Trained <strong> POS (Part of Speech) </strong> models </li>
280-
<li> - <strong>34 +</strong> Pre-Trained <strong> Lemmatizer </strong> models </li>
291+
<li> - <strong>3 +</strong> Pre-Trained <strong> Lemmatizer </strong> models </li>
281292
<li> - Dependency parsing untyped and typed</li>
282293
<li> - Spell Checking</li>
283294
<li> - Multi-lingual NER models in <strong>Dutch, English, French, German, Italian, Norwegian, Polish, Portuguese, Russian, Spanish</strong></li>

0 commit comments

Comments
 (0)