Major Release v0.8
Mainly a visualization update to improve understanding of the topic model.
Features
- Additional visualizations:
- Topic Hierarchy:
topic_model.visualize_hierarchy()
- Topic Similarity Heatmap:
topic_model.visualize_heatmap()
- Topic Representation Barchart:
topic_model.visualize_barchart()
- Term Score Decline:
topic_model.visualize_term_rank()
- Topic Hierarchy:
Improvements
- Created
bertopic.plotting
library to easily extend visualizations - Improved automatic topic reduction by using HDBSCAN to detect similar topics
- Sort topic ids by their frequency. -1 is the outlier class and contains typically the most documents. After that 0 is the largest topic, 1 the second largest, etc.
- Update MKDOCS with new visualizations