Skip to content

Commit 5d47ec4

Browse files
committed
Merge branch 'release-0.13.4.1'
2 parents e838391 + 26a17ac commit 5d47ec4

20 files changed

+2222
-1678
lines changed

CHANGELOG.md

+7-6
Original file line numberDiff line numberDiff line change
@@ -7,21 +7,22 @@ None
77

88
0.13.4, 2016-12-22
99

10-
* Evaluation of word2vec models against semantic similarity datasets like SimLex-999 (#1047) (@akutuzov, [#1047](https://github.com/RaRe-Technologies/gensim/pull/1047))
10+
* Added suggested lda model method and print methods to HDP class (@bhargavvader, [#1055](https://github.com/RaRe-Technologies/gensim/pull/1055))
11+
* New class KeyedVectors to store embedding separate from training code (@anmol01gulati and @droudy, [#980](https://github.com/RaRe-Technologies/gensim/pull/980))
12+
* Evaluation of word2vec models against semantic similarity datasets like SimLex-999 (@akutuzov, [#1047](https://github.com/RaRe-Technologies/gensim/pull/1047))
1113
* TensorBoard word embedding visualisation of Gensim Word2vec format (@loretoparisi, [#1051](https://github.com/RaRe-Technologies/gensim/pull/1051))
12-
* Throw exception if load() is called on instance rather than the class in word2vec and doc2vec (@dus0x,[(#889](https://github.com/RaRe-Technologies/gensim/pull/889))
13-
* Loading and Saving LDA Models across Python 2 and 3. Fix #853 (@anmolgulati, #913, [#1093](https://github.com/RaRe-Technologies/gensim/pull/1093))
14+
* Throw exception if load() is called on instance rather than the class in word2vec and doc2vec (@dust0x,[(#889](https://github.com/RaRe-Technologies/gensim/pull/889))
15+
* Loading and Saving LDA Models across Python 2 and 3. Fix #853 (@anmolgulati, [#913](https://github.com/RaRe-Technologies/gensim/pull/913), [#1093](https://github.com/RaRe-Technologies/gensim/pull/1093))
1416
* Fix automatic learning of eta (prior over words) in LDA (@olavurmortensen, [#1024](https://github.com/RaRe-Technologies/gensim/pull/1024)).
1517
* eta should have dimensionality V (size of vocab) not K (number of topics). eta with shape K x V is still allowed, as the user may want to impose specific prior information to each topic.
1618
* eta is no longer allowed the "asymmetric" option. Asymmetric priors over words in general are fine (learned or user defined).
1719
* As a result, the eta update (`update_eta`) was simplified some. It also no longer logs eta when updated, because it is too large for that.
1820
* Unit tests were updated accordingly. The unit tests expect a different shape than before; some unit tests were redundant after the change; `eta='asymmetric'` now should raise an error.
1921
* Optimise show_topics to only call get_lambda once. Fix #1006. (@bhargavvader, [#1028](https://github.com/RaRe-Technologies/gensim/pull/1028))
2022
* HdpModel doc improvement. Inference and print_topics (@dsquareindia, [#1029](https://github.com/RaRe-Technologies/gensim/pull/1029))
21-
* Removing Doc2Vec defaults so that it won't override Word2Vec defaults. Fix #795 (@markroxor, [#929](https://github.com/RaRe-Technologies/gensim/pull/929))
22-
Remove warning on gensim import "pattern not installed". Fix #1009 (@shashankg7, #1018)
23+
* Removing Doc2Vec defaults so that it won't override Word2Vec defaults. Fix #795. (@markroxor, [#929](https://github.com/RaRe-Technologies/gensim/pull/929))
24+
* Remove warning on gensim import "pattern not installed". Fix #1009 (@shashankg7, [#1018](https://github.com/RaRe-Technologies/gensim/pull/1018))
2325
* Add delete_temporary_training_data() function to word2vec and doc2vec models. (@deepmipt-VladZhukov, [#987](https://github.com/RaRe-Technologies/gensim/pull/987))
24-
* New class KeyedVectors to store embedding separate from training code (@anmol01gulati and @droudy, [#980](https://github.com/RaRe-Technologies/gensim/pull/980))
2526
* Documentation improvements (@IrinaGoloshchapova, [#1010](https://github.com/RaRe-Technologies/gensim/pull/1010), [#1011](https://github.com/RaRe-Technologies/gensim/pull/1011))
2627
* LDA tutorial by Olavur, tips and tricks (@olavurmortensen, [#779](https://github.com/RaRe-Technologies/gensim/pull/779))
2728
* Add double quote in commmand line to run on Windows (@akarazeev, [#1005](https://github.com/RaRe-Technologies/gensim/pull/1005))

docs/notebooks/online_w2v_tutorial.ipynb

+6-20
Original file line numberDiff line numberDiff line change
@@ -8,21 +8,7 @@
88
"\n",
99
"So far, word2vec cannot increase the size of vocabulary after initial training. To handle unknown words, not in word2vec vocaburary, you must retrain updated documents over again.\n",
1010
"\n",
11-
"In this tutorial, we introduce gensim new feature, online vocaburary update. This additional feature overcomes the unknown word problems. Despite after initial training, we can continuously add new vocaburary to the pre-trained word2vec model using this online feature.\n",
12-
"\n",
13-
"This implementation is still beta version at 16/09/04. You can download the beta version of online word2vec implementation in the following repository."
14-
]
15-
},
16-
{
17-
"cell_type": "code",
18-
"execution_count": null,
19-
"metadata": {
20-
"collapsed": true
21-
},
22-
"outputs": [],
23-
"source": [
24-
"%%bash\n",
25-
"git clone -b online-w2v [email protected]:isohyt/gensim.git"
11+
"In this tutorial, we introduce gensim new feature, online vocaburary update. This additional feature overcomes the unknown word problems. Despite after initial training, we can continuously add new vocaburary to the pre-trained word2vec model using this online feature.\n"
2612
]
2713
},
2814
{
@@ -343,21 +329,21 @@
343329
],
344330
"metadata": {
345331
"kernelspec": {
346-
"display_name": "Python 3",
332+
"display_name": "Python 2",
347333
"language": "python",
348-
"name": "python3"
334+
"name": "python2"
349335
},
350336
"language_info": {
351337
"codemirror_mode": {
352338
"name": "ipython",
353-
"version": 3
339+
"version": 2
354340
},
355341
"file_extension": ".py",
356342
"mimetype": "text/x-python",
357343
"name": "python",
358344
"nbconvert_exporter": "python",
359-
"pygments_lexer": "ipython3",
360-
"version": "3.5.1"
345+
"pygments_lexer": "ipython2",
346+
"version": "2.7.12"
361347
}
362348
},
363349
"nbformat": 4,

0 commit comments

Comments
 (0)