Update _bertopic.py to fix question/ github issue #1696 #1721

jonaslandsgesell · 2024-01-03T08:36:33Z

As discussed in #1696, I provide an updated doc string to reflect that topic_model.transform(docs)[0][i] is sometimes different from topic_model.transform(docs[i])[0][0]

MaartenGr · 2024-02-08T14:23:23Z

Thanks for this PR! Could you rephrase the following a bit:

(especially when using the HDBSCAN algorithm)

This makes it seems that this behavior is across many different algorithms when in reality this is HDBSCAN-specific behavior.

jonaslandsgesell · 2024-02-08T15:40:53Z

Sure! Do you have a suggestion for a specific wording?

I am currently lacking the fantasy for other ways to express the fact that HDBSCAN is responsible here while we could also have a pipeline without HDBSCAN (but another component which may or may not behave similarly)

MaartenGr · 2024-02-10T19:33:57Z

Sure! Do you have a suggestion for a specific wording?

I am currently lacking the fantasy for other ways to express the fact that HDBSCAN is responsible here while we could also have a pipeline without HDBSCAN (but another component which may or may not behave similarly)

You could do something like this: "A single document or a list of documents to predict the topic(s) for. NOTE: When using
HDBSCAN, the prediction might differ depending on whether a single document or a list of documents is passed
since it leverages the data points of other documents".

I think it's best to stay close to the original documentation and inner workings of HDBSCAN. I believe this and this resource are relevant from the top of my head.

Also, a small tip. ChatGPT works wonders for helping with these kinds of issues ;)

Update _bertopic.py to fix question/ github issue MaartenGr#1696

c92b495

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update _bertopic.py to fix question/ github issue #1696 #1721

Update _bertopic.py to fix question/ github issue #1696 #1721

Uh oh!

jonaslandsgesell commented Jan 3, 2024 •

edited

Loading

Uh oh!

MaartenGr commented Feb 8, 2024

Uh oh!

jonaslandsgesell commented Feb 8, 2024 •

edited

Loading

Uh oh!

MaartenGr commented Feb 10, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Update _bertopic.py to fix question/ github issue #1696 #1721

Are you sure you want to change the base?

Update _bertopic.py to fix question/ github issue #1696 #1721

Uh oh!

Conversation

jonaslandsgesell commented Jan 3, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MaartenGr commented Feb 8, 2024

Uh oh!

jonaslandsgesell commented Feb 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MaartenGr commented Feb 10, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jonaslandsgesell commented Jan 3, 2024 •

edited

Loading

jonaslandsgesell commented Feb 8, 2024 •

edited

Loading