Unexpected behavior when messaging the LLM during streaming

When you interrupt the LLM while it is generating, the extension behaves erratically. The output of the model becomes a part of the user's prompt and the generation spirals out of control. The model starts to continuously generate large amount of text and usually stops after many generations

From a glance it looks like the problem might be multiple streams being triggered and logging to the same conversation simultaneously.

May want to cancel the request when a new prompt comes in?

To recreate:
1. Enter a "complex" prompt ("Write me a python implementation of Dijstra's algorithm" here)
2. Interrupt the model during generation ("stop" here)

![Image](https://github.com/user-attachments/assets/e9425dea-be28-4c7f-b7ed-09ce262e9837)
![Image](https://github.com/user-attachments/assets/8106bb78-b67a-4d42-a0ec-354fc82289c6)
![Image](https://github.com/user-attachments/assets/e4ff3c75-a1cb-4c5b-b134-d34ce653d9b2)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Unexpected behavior when messaging the LLM during streaming #340

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Unexpected behavior when messaging the LLM during streaming #340

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions