-
-
Notifications
You must be signed in to change notification settings - Fork 4.4k
GSoC 2018 project ideas
A list of ideas for Google Summer of Code 2018 of new functionality and projects in Gensim, topic modelling for humans.
Potential mentors:
First of all, please have a look to gensim roadmap 2018, this demonstrates our main ideas target for this year.
You can suggest any project related to NLP, which, in your opinion, will be a successful addition to gensim, but please consider our wishes.
Below you will find the directions that we would be very happy to see in gensim
Difficulty: medium
Background: We already have a large number of models, therefore, we want to pay more attention to quality (documentation is main thing here), because if we have a great model and lack of documentation - nobody will use it! For this reason, we want to significantly improve our documentation.
ToDo:
- [WIP] Docstrings for all stuff in gensim
- New "beginner tutorial chain" (persistent on site and in repository)
- User-guides for all stuff (sphinx-gallery)
- New documentation website
- New structure of documentation
Resources:
- https://github.com/numpy/numpy/blob/master/doc/HOWTO_DOCUMENT.rst.txt
- http://sphinxcontrib-napoleon.readthedocs.io/en/latest/example_numpy.html
- https://github.com/sphinx-gallery/sphinx-gallery
- https://github.com/RaRe-Technologies/gensim/projects/4
- https://www.youtube.com/watch?v=azf6yzuJt54&feature=youtu.be&list=PLBQrpodM6rL-LDUw_gTZKhpYOidQXJNQe
- https://www.divio.com/en/blog/documentation/
If you'd like to work on any of the topics below, or have your own ideas, get in touch at student-projects@rare-technologies.com.