Outlet Langfuse Logging with chat_id Retention #430

resqnet · 2025-02-19T07:00:30Z

According to Issue #415, in the latest version, the pipe function is executed before the inlet function, resulting in the following problems:

Problems:
The inlet is expected to extract the chat_id from the request body and add or modify it in the body. However, since the pipe executes first, this process is not completed, leaving the chat_id absent.
The pipe function attempts to extract the chat_id from the body for processing, but without the correct chat_id, inconsistencies occur in log retrieval and various processes. Additionally, there is an issue where the inlet is not being called properly in the first place.
Proposal: To address the problem where the pipe function initiates before the chat_id extraction process in the inlet, we propose changing the log generation method to the outlet side. With this change, the chat_id will be correctly preserved on the outlet side, ensuring that Langfuse logs are reliably retrieved. Although future adjustments to the execution order and configurations should be considered, this approach of retrieving logs on the outlet side is proposed as a way to bypass the current issue.
Change Details:
Changed the implementation to generate Langfuse logs on the outlet side, removing dependency on the chat_id extraction process in the inlet.
Added processing to reliably obtain the chat_id from request information during log generation on the outlet and attach it to the log.
Verification:
Verified that the changes do not affect existing pipeline processing and that logs are retrieved correctly.

JTHesse · 2025-02-19T09:45:04Z

Thank you, this is working for me.
However, even after enabling "Usage" int the model, the usage information won't show in the outlet, but I guess that is another issue and not related to the pipeline.

ADD-Carlos-Zamora · 2025-02-19T10:56:54Z

examples/filters/langfuse_filter_pipeline_outlet.py

+
+            trace = self.langfuse.trace(
+                name=f"filter:{__name__}",
+                input=body,


Should last message be excluded from the input? If it is executed on the outlet, the body will contain every previous message plus the last generated one, which should not be part of the input. Same logic applies in the generation.

Thank you for the important feedback!

For now, I’ve addressed the issue by removing only the last message. I believe this approach is generally sufficient, but as you pointed out, it may not cover cases like retries or supplemental outputs where multiple assistant messages appear consecutively.

While it is possible to go back further, I chose this simpler approach to keep the example code straightforward. Depending on the application, the handling of messages may vary, but I believe this is sufficient as an example. If you have other approaches in mind, I’d be glad to discuss them as well.

I think it fits the general behaviour, thanks for your amazing work <3

ADD-Carlos-Zamora · 2025-02-19T11:05:11Z

examples/filters/langfuse_filter_pipeline_outlet.py

+
+        if (
+            body["chat_id"] not in self.chat_generations
+            or body["chat_id"] not in self.chat_traces


Is this cache needed? If the generation and trace objects are stored in the dictionary, retrieved and removed in the same method, can it just reuse the objects without the dictionaries?

I realized that I was using the cache out of habit from the original code, but as you pointed out, using variables is sufficient for this implementation. I’ve simplified the code accordingly. Thanks for helping me improve it.

… from input to maintain context integrity

resqnet · 2025-02-20T04:20:00Z

@JTHesse @ADD-Carlos-Zamora
Thank you for the feedback.

I’m glad to hear that it's working for you. Regarding the usage information not showing up, it seems to be a separate issue unrelated to this pipeline.
I believe this is a simple implementation example that aligns with the recent changes in openwebui specifications. If there are no issues, I’d appreciate it if you could merge it

add langfuse pipline outlet

ee52551

JTHesse mentioned this pull request Feb 19, 2025

Langfuse: Unable to send trace to selfhost server #431

Open

ADD-Carlos-Zamora reviewed Feb 19, 2025

View reviewed changes

Refactor: Remove unnecessary cache and exclude last assistant message…

eff7d5b

… from input to maintain context integrity

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Outlet Langfuse Logging with chat_id Retention #430

Outlet Langfuse Logging with chat_id Retention #430

Uh oh!

resqnet commented Feb 19, 2025

Uh oh!

JTHesse commented Feb 19, 2025

Uh oh!

ADD-Carlos-Zamora Feb 19, 2025

Uh oh!

resqnet Feb 20, 2025

Uh oh!

ADD-Carlos-Zamora Feb 20, 2025

Uh oh!

ADD-Carlos-Zamora Feb 19, 2025

Uh oh!

resqnet Feb 20, 2025

Uh oh!

resqnet commented Feb 20, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Outlet Langfuse Logging with chat_id Retention #430

Are you sure you want to change the base?

Outlet Langfuse Logging with chat_id Retention #430

Uh oh!

Conversation

resqnet commented Feb 19, 2025

Uh oh!

JTHesse commented Feb 19, 2025

Uh oh!

ADD-Carlos-Zamora Feb 19, 2025

Choose a reason for hiding this comment

Uh oh!

resqnet Feb 20, 2025

Choose a reason for hiding this comment

Uh oh!

ADD-Carlos-Zamora Feb 20, 2025

Choose a reason for hiding this comment

Uh oh!

ADD-Carlos-Zamora Feb 19, 2025

Choose a reason for hiding this comment

Uh oh!

resqnet Feb 20, 2025

Choose a reason for hiding this comment

Uh oh!

resqnet commented Feb 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

resqnet commented Feb 20, 2025 •

edited

Loading