I have a lot of outliers in my project, I want to get the probability of each sentence belonging to each topic, If I use topics,probs= topic_model.fit_transform(sentences,embeddings=embeddings) the probs obtained by this method will contain every sentence (including outliers). Is there any way to extract these outliers, and do these outliers also have topic probability proportions #2220
superseanyoung
started this conversation in
General
Replies: 1 comment 3 replies
-
In the obtained probs, I do not want to get sentences belonging to outliers, how should I post them |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
topic_model = BERTopic(embedding_model=transformer_model,
min_topic_size=3,
verbose=True,
umap_model=umap_model,
hdbscan_model=hdbscan_model,
ctfidf_model=ctfidf_model,
representation_model=representation_model,
#top_n_words=10,
#min_topic_size=10,
#nr_topics=None,
#low_memory=False,
calculate_probabilities=True
)
topics,probs= topic_model.fit_transform(sentences,embeddings=embeddings)
Beta Was this translation helpful? Give feedback.
All reactions