feat(core): introduce `ToolSchema` as root schema cache; replace TypedDict conversion with `TypeAdapter` by sydney-runkle · Pull Request #37103 · langchain-ai/langchain

Sydney Runkle (sydney-runkle) · 2026-04-30T15:39:26Z

Builds on #37101.

Two changes in one commit, both motivated by the same principle: a single, clean owner for everything schema-related on a tool.

`ToolSchema` — the root cache

Previously BaseTool had three independent cached_property slots (tool_call_schema, args, _approximate_schema_chars) that all computed overlapping data and each needed individual invalidation. This PR replaces them with a single ToolSchema dataclass and one tool_schema cached property that is the sole root:

@dataclass
class ToolSchema:
    name: str
    description: str
    validator: TypeAdapter      # validates tool call inputs
    json_schema: dict           # sent to LLMs
    pydantic_schema: Any        # model class or dict (backward compat)
    args: dict                  # properties from json_schema
    approximate_chars: int      # precomputed for token estimation

BaseTool.tool_call_schema, BaseTool.args, and BaseTool._approximate_schema_chars are now plain @property delegates to tool_schema. __setattr__ only needs to pop one key on mutation instead of four. The is-identity caching tests still pass because all delegates read from the same cached ToolSchema object.

ToolSchema is exported from langchain_core.tools and can be used directly by integrations that want to consume both the validator and the schema without going through BaseTool.

`TypeAdapter`-based TypedDict conversion

_convert_any_typed_dicts_to_pydantic was a ~70-line recursive function that converted TypedDicts to throwaway pydantic v1 model classes just to call .schema(). Replaced with:

adapter = TypeAdapter(typed_dict)
schema = adapter.json_schema()

Pydantic v2's TypeAdapter handles everything the old code did — nested TypedDicts, generic containers, Annotated metadata — and also correctly handles NotRequired and Required annotations, which the v1 path did not. A new test test__convert_typed_dict_not_required verifies this:

class Tool(TypedDict):
    required_field: str
    optional_field: NotRequired[int]

result = _convert_typed_dict_to_openai_function(Tool)
assert "required_field" in result["parameters"]["required"]
assert "optional_field" not in result["parameters"]["required"]

Field descriptions from Google-style docstrings and Annotated[T, ..., "description"] metadata are preserved by post-processing the schema after generation.

The old test__convert_typed_dict_to_openai_function_fail test expected a TypeError for MutableSet because pydantic v1 didn't support it. pydantic v2 does; the test is updated to verify successful conversion instead.

What stays unchanged

All public BaseTool API signatures — tool_call_schema, args, get_input_schema() all have the same signatures and return types as before.
pydantic.v1 acceptance for args_schema — tools with v1 model schemas continue to work.

AI-agent assisted contribution.

codspeed-hq · 2026-04-30T15:43:54Z

Merging this PR will not alter performance

✅ 13 untouched benchmarks
⏩ 2 skipped benchmarks¹

_{Comparing feat/tool-schema (b82e263) with perf/tool-schema-refactor (c832f7c)}

2 benchmarks were skipped, so the baseline results were used instead. If they were deleted from the codebase, click here and archive them to remove them from the performance reports. ↩

…dDict v1 conversion with `TypeAdapter`

… tests `TypeAdapter` requires `typing_extensions.TypedDict` on Python < 3.12. Switch all test fixtures and parametrized cases to use `typing_extensions.TypedDict` so the suite passes on Python 3.10/3.11. Co-Authored-By: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com>

Eugene Yurtsev (eyurtsev) · 2026-05-01T14:21:08Z

-            self.__dict__.pop("_approximate_schema_chars", None)
        super().__setattr__(name, value)

+    @functools.cached_property


Should we turn this into an LRU instead of a cached_property to reduce chance that user code has a memory leak? (or do we assume that the memory foot print is similar to the foot print of a tool instance?

Eugene Yurtsev (eyurtsev) · 2026-05-01T14:22:12Z

+            description=self.description or "",
+            validator=TypeAdapter(self.get_input_schema()),
+            json_schema=json_schema,
+            pydantic_schema=pydantic_schema,


we'll need to be able to distinguish input schema from output schema in general. It's not needed everywhere, but I think it's generally a good thing to do and be clear about

Eugene Yurtsev (eyurtsev) · 2026-05-01T14:25:48Z

+
+
+@dataclass
+class ToolSchema:


Could we distinguish inputs from outputs for schema? (OK not to introduce outputs yet if we don't support, but probably helpful to have the attributes named well so it's clear what is input vs. output

I think we need to be more explicit about injected arguments, so it's easy to determine which arguments are injected?

Sydney Runkle (sydney-runkle) requested a review from Eugene Yurtsev (eyurtsev) as a code owner April 30, 2026 15:39

github-actions Bot added core `langchain-core` package issues & PRs feature For PRs that implement a new feature; NOT A FEATURE REQUEST internal size: L 500-999 LOC labels Apr 30, 2026

Sydney Runkle (sydney-runkle) force-pushed the feat/tool-schema branch from a326816 to 16cc120 Compare April 30, 2026 19:42

Sydney Runkle (sydney-runkle) added 2 commits April 30, 2026 15:52

feat(core): introduce ToolSchema as root schema cache; replace Type…

77b4020

…dDict v1 conversion with `TypeAdapter`

chore(core): fix lint and mypy errors on feat/tool-schema

a62af07

Sydney Runkle (sydney-runkle) force-pushed the feat/tool-schema branch from 16cc120 to a62af07 Compare April 30, 2026 19:54

Sydney Runkle (sydney-runkle) merged commit dc7a009 into perf/tool-schema-refactor May 1, 2026
92 checks passed

Sydney Runkle (sydney-runkle) deleted the feat/tool-schema branch May 1, 2026 13:25

Eugene Yurtsev (eyurtsev) reviewed May 1, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(core): introduce `ToolSchema` as root schema cache; replace TypedDict conversion with `TypeAdapter`#37103

feat(core): introduce `ToolSchema` as root schema cache; replace TypedDict conversion with `TypeAdapter`#37103
Sydney Runkle (sydney-runkle) merged 3 commits into
perf/tool-schema-refactorfrom
feat/tool-schema

Sydney Runkle (sydney-runkle) commented Apr 30, 2026

Uh oh!

codspeed-hq Bot commented Apr 30, 2026 •

edited

Loading

Uh oh!

Uh oh!

Eugene Yurtsev (eyurtsev) May 1, 2026

Uh oh!

Eugene Yurtsev (eyurtsev) May 1, 2026

Uh oh!

Eugene Yurtsev (eyurtsev) May 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Sydney Runkle (sydney-runkle) commented Apr 30, 2026

ToolSchema — the root cache

TypeAdapter-based TypedDict conversion

What stays unchanged

Uh oh!

codspeed-hq Bot commented Apr 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Merging this PR will not alter performance

Footnotes

Uh oh!

Uh oh!

Eugene Yurtsev (eyurtsev) May 1, 2026

Choose a reason for hiding this comment

Uh oh!

Eugene Yurtsev (eyurtsev) May 1, 2026

Choose a reason for hiding this comment

Uh oh!

Eugene Yurtsev (eyurtsev) May 1, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

`ToolSchema` — the root cache

`TypeAdapter`-based TypedDict conversion

codspeed-hq Bot commented Apr 30, 2026 •

edited

Loading