fix: handle lambda, postponed annotations, and subclass types in trim_messages token_counter by atian8179 · Pull Request #35640 · langchain-ai/langchain

atian8179 · 2026-03-08T04:11:48Z

Problem

trim_messages breaks when token_counter is a lambda, uses from __future__ import annotations, or has a BaseMessage subclass annotation (e.g. HumanMessage).

The root cause is the annotation check:

if ... .annotation is BaseMessage:

This uses identity comparison (is) which only matches the exact BaseMessage class. It fails for:

Lambdas — parameters have inspect.Parameter.empty annotation
Postponed annotations (from __future__ import annotations) — annotation is the string 'BaseMessage'
Subclass annotations (e.g. HumanMessage) — HumanMessage is BaseMessage is False

In all cases, the function incorrectly treats the callable as a list-level counter, causing a TypeError.

Solution

Extract the detection logic into a _is_per_message_counter() helper that handles all three cases:

inspect.Parameter.empty → assume per-message (lambdas, bare defs)
str annotation containing "message" → postponed annotations
type that is a subclass of BaseMessage → subclass annotations

Fixes #35629

…_messages token_counter The annotation check in trim_messages used `is BaseMessage` which only matched the exact BaseMessage class. This broke for: - Lambdas (no annotation → inspect.Parameter.empty) - Postponed annotations (from __future__ import annotations → string) - Subclass annotations (e.g. HumanMessage is not BaseMessage) Extract the detection logic into _is_per_message_counter() which handles all three cases correctly. Fixes langchain-ai#35629

codspeed-hq · 2026-03-08T04:15:15Z

Merging this PR will improve performance by 39.56%

⚠️

Unknown Walltime execution environment detected

Using the Walltime instrument on standard Hosted Runners will lead to inconsistent data.

For the most accurate results, we recommend using CodSpeed Macro Runners: bare-metal machines fine-tuned for performance measurement consistency.

⚡ 13 improved benchmarks
⏩ 23 skipped benchmarks¹

Performance Changes

	Mode	Benchmark	`BASE`	`HEAD`	Efficiency
⚡	WallTime	`test_import_time[tool]`	587.4 ms	474.1 ms	+23.9%
⚡	WallTime	`test_async_callbacks_in_sync`	25.6 ms	18.3 ms	+39.56%
⚡	WallTime	`test_import_time[LangChainTracer]`	485.8 ms	407.6 ms	+19.19%
⚡	WallTime	`test_import_time[RunnableLambda]`	537.2 ms	441.4 ms	+21.71%
⚡	WallTime	`test_import_time[HumanMessage]`	279.1 ms	235.4 ms	+18.55%
⚡	WallTime	`test_import_time[Runnable]`	538 ms	443.9 ms	+21.21%
⚡	WallTime	`test_import_time[InMemoryVectorStore]`	650.8 ms	536.3 ms	+21.36%
⚡	WallTime	`test_import_time[ChatPromptTemplate]`	682.8 ms	542.4 ms	+25.89%
⚡	WallTime	`test_import_time[PydanticOutputParser]`	583 ms	473.2 ms	+23.21%
⚡	WallTime	`test_import_time[Document]`	198 ms	166.3 ms	+19.1%
⚡	WallTime	`test_import_time[CallbackManager]`	341.3 ms	282.4 ms	+20.85%
⚡	WallTime	`test_import_time[InMemoryRateLimiter]`	186 ms	154.5 ms	+20.38%
⚡	WallTime	`test_import_time[BaseChatModel]`	573.8 ms	478.8 ms	+19.83%

_{Comparing atian8179:fix/trim-messages-lambda-counter (e97b42f) with master (29134dc)²}

23 benchmarks were skipped, so the baseline results were used instead. If they were deleted from the codebase, click here and archive them to remove them from the performance reports. ↩
No successful run was found on master (27651d9) during the generation of this report, so 29134dc was used instead as the comparison base. There might be some changes unrelated to this pull request in this report. ↩

GAUTAM V DATLA (gautamvarmadatla) · 2026-03-08T17:19:11Z

hi! Creator of issue here :) Thanks for working on this, however i already have a PR open to fix this when i opened the issue. Please try to avoid duplicates :)

ccurme (ccurme)

Duplicated with #35630.

atian8179 requested a review from Eugene Yurtsev (eyurtsev) as a code owner March 8, 2026 04:11

github-actions bot added core `langchain-core` package issues & PRs external fix For PRs that implement a fix labels Mar 8, 2026

ccurme (ccurme) reviewed Mar 8, 2026

View reviewed changes

ccurme (ccurme) closed this Mar 8, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: handle lambda, postponed annotations, and subclass types in trim_messages token_counter#35640

fix: handle lambda, postponed annotations, and subclass types in trim_messages token_counter#35640
atian8179 wants to merge 1 commit intolangchain-ai:masterfrom
atian8179:fix/trim-messages-lambda-counter

atian8179 commented Mar 8, 2026

Uh oh!

codspeed-hq bot commented Mar 8, 2026

Uh oh!

GAUTAM V DATLA (gautamvarmadatla) commented Mar 8, 2026 •

edited

Loading

Uh oh!

ccurme (ccurme) left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

atian8179 commented Mar 8, 2026

Problem

Solution

Uh oh!

codspeed-hq bot commented Mar 8, 2026

Merging this PR will improve performance by 39.56%

Performance Changes

Footnotes

Uh oh!

GAUTAM V DATLA (gautamvarmadatla) commented Mar 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ccurme (ccurme) left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

GAUTAM V DATLA (gautamvarmadatla) commented Mar 8, 2026 •

edited

Loading