Skip to content

Major Release v0.8

Compare
Choose a tag to compare
@MaartenGr MaartenGr released this 31 May 10:02
· 201 commits to master since this release
8b81eb8

Mainly a visualization update to improve understanding of the topic model.

Features

  • Additional visualizations:
    • Topic Hierarchy: topic_model.visualize_hierarchy()
    • Topic Similarity Heatmap: topic_model.visualize_heatmap()
    • Topic Representation Barchart: topic_model.visualize_barchart()
    • Term Score Decline: topic_model.visualize_term_rank()

Improvements

  • Created bertopic.plotting library to easily extend visualizations
  • Improved automatic topic reduction by using HDBSCAN to detect similar topics
  • Sort topic ids by their frequency. -1 is the outlier class and contains typically the most documents. After that 0 is the largest topic, 1 the second largest, etc.
  • Update MKDOCS with new visualizations

Fixes