feat: add per-model-request token usage tracking for LLM cost visibility by Kingsuperyzy · Pull Request #13726 · infiniflow/ragflow

Kingsuperyzy · 2026-03-20T09:15:34Z

What problem does this PR solve?

This PR introduces per-model-request token usage tracking for model requests in RAGFlow. It intercepts model calls at the middleware layer and persists token consumption data to the database, enabling precise visibility into LLM usage costs at the request level.

Type of change

New Feature (non-breaking change which adds functionality)

…estimation

codecov · 2026-03-24T16:09:01Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 96.52%. Comparing base (384fa6f) to head (f82dd96).

Additional details and impacted files

@@            Coverage Diff             @@
##             main   #13726      +/-   ##
==========================================
- Coverage   98.11%   96.52%   -1.60%     
==========================================
  Files          10       10              
  Lines         690      690              
  Branches      108      108              
==========================================
- Hits          677      666      -11     
- Misses          4        8       +4     
- Partials        9       16       +7

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

…estimation

Lynn-Inf

I think it would be better to pass the biz_type and biz_id parameters when instantiating LLMBundle. Also, please make sure to use English for comments.

Kingsuperyzy · 2026-04-02T17:59:43Z

@Lynn-Inf
I’ve made the following updates based on your suggestions.

Pass biz_type, biz_id, and session_id parameters when instantiating LLMBundle across all production code paths.
Ensure all comments in modified files are written in English.

Kingsuperyzy and others added 3 commits March 19, 2026 17:17

Fix: include usage tokens in streaming LLM calls to avoid inaccurate …

7dc331f

…estimation

Merge remote-tracking branch 'origin/main'

820eca4

feat: per-conversation token usage tracking for model requests

e3ba810

dosubot bot added size:XL This PR changes 500-999 lines, ignoring generated files. 🌈 python Pull requests that update Python code 💞 feature Feature request, pull request that fullfill a new feature. labels Mar 20, 2026

feat: per-conversation token usage tracking for model requests

cfa8965

dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. and removed size:XL This PR changes 500-999 lines, ignoring generated files. labels Mar 20, 2026

Kingsuperyzy marked this pull request as draft March 20, 2026 09:28

Kingsuperyzy marked this pull request as ready for review March 20, 2026 09:30

Kingsuperyzy added 2 commits March 23, 2026 09:43

Merge branch 'main' into main

39997db

Fix: include usage tokens in streaming LLM calls to avoid inaccurate …

39399ee

…estimation

yingfeng requested a review from Lynn-Inf March 24, 2026 15:47

yingfeng added the ci Continue Integration label Mar 24, 2026

yingfeng marked this pull request as draft March 24, 2026 15:47

yingfeng marked this pull request as ready for review March 24, 2026 15:47

Kingsuperyzy and others added 4 commits March 25, 2026 16:40

Merge branch 'main' into main

554f92d

Merge branch 'main' into main

7bf2a05

Merge branch 'main' into main

065cb10

Fix: include usage tokens in streaming LLM calls to avoid inaccurate …

ddd5898

…estimation

Lynn-Inf reviewed Mar 30, 2026

View reviewed changes

Kingsuperyzy added 6 commits March 30, 2026 18:03

Merge branch 'infiniflow:main' into main

aafb99c

Merge branch 'infiniflow:main' into main

e5d9840

Merge branch 'infiniflow:main' into main

8a60314

Merge branch 'infiniflow:main' into main

b416cd4

Merge branch 'infiniflow:main' into main

936564c

Passing biz_type and biz_id when instantiating LLMBundle

7f9e277

dosubot bot removed the size:L This PR changes 100-499 lines, ignoring generated files. label Apr 2, 2026

dosubot bot added the size:XL This PR changes 500-999 lines, ignoring generated files. label Apr 2, 2026

Kingsuperyzy added 2 commits April 3, 2026 01:25

Passing biz_type and biz_id when instantiating LLMBundle

610b8dc

Passing biz_type and biz_id when instantiating LLMBundle

0720425

Passing biz_type and biz_id when instantiating LLMBundle

f82dd96

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add per-model-request token usage tracking for LLM cost visibility#13726

feat: add per-model-request token usage tracking for LLM cost visibility#13726
Kingsuperyzy wants to merge 19 commits intoinfiniflow:mainfrom
Kingsuperyzy:main

Kingsuperyzy commented Mar 20, 2026 •

edited by yingfeng

Loading

Uh oh!

codecov bot commented Mar 24, 2026 •

edited

Loading

Uh oh!

Lynn-Inf left a comment

Uh oh!

Kingsuperyzy commented Apr 2, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

Kingsuperyzy commented Mar 20, 2026 • edited by yingfeng Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What problem does this PR solve?

Type of change

Uh oh!

codecov bot commented Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Lynn-Inf left a comment

Choose a reason for hiding this comment

Uh oh!

Kingsuperyzy commented Apr 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Kingsuperyzy commented Mar 20, 2026 •

edited by yingfeng

Loading

codecov bot commented Mar 24, 2026 •

edited

Loading

Kingsuperyzy commented Apr 2, 2026 •

edited

Loading