Migrate to Vercel AI SDK and enable experimental multi-step tool execution by Kulikowski · Pull Request #28 · GoogleChromeLabs/webmcp-tools

Kulikowski · 2026-02-25T09:55:52Z

Replaced direct API bindings with the AI SDK by Vercel, unlocking native structured tool-calling support.

Updated the evaluation loop and JSON format to support sequentially validating arrays of expectedCalls.

Modularized Evaluator: Split the monolithic evaluator.ts into a clean src/evaluator/ directory:

models.ts: Instantiates LLM backends.
browser.ts: Extracts Puppeteer web integration.
mappers.ts: Normalizes raw extensions schemas into strict AI SDK interfaces.
prompts.ts: Extracts static system context.
index.ts: The core orchestrator.

* De-dupe `findChromePath` logic * Replace custom backends with AI SDK

Kulikowski · 2026-02-25T13:48:18Z

As discussed with @andreban we will add possibility of extending possible backends treating Vercel AI SDK as one of the possible backends, coming with next PRs :)

andreban

LGTM, with comment on reintroducing the Backend abstraction in the next PR, so it's easy to add LLMs not supported by the Vercel SDK.

Kulikowski and others added 13 commits February 23, 2026 22:38

Support for multiple expected calls in browser

5f7b328

Migrating evals to use arrays of calls

ddf9457

Handling array of expectedCall in reports and execution logs

02d2418

Replace custom backends with AI SDK (#26)

4fc1799

* De-dupe `findChromePath` logic * Replace custom backends with AI SDK

Avoid returning undefined in getModel

8e4d471

Keeping options between runs

4e6ef0e

Fixing execution log for sidecar ui

65becf2

Update packages

70d1461

Improved tool calling

e792cc5

Variable name change

a6e2a04

Refactoring of evaluator into separate concerned files

5fee48e

Rename mct to modelContext

2d9dc37

Updating README.md

e7c273b

Kulikowski requested review from andreban, beaufortfrancois and swissspidy February 25, 2026 11:15

Adding example evals

d72ab51

Kulikowski changed the title ~~Evals multiple expected calls~~ Migrate to Vercel AI SDK and enable experimental multi-step tool execution Feb 25, 2026

Progress bar should never exceed 100 percent

60a90f4

Kulikowski marked this pull request as ready for review February 25, 2026 11:29

andreban approved these changes Feb 25, 2026

View reviewed changes

Kulikowski merged commit 03d8319 into main Feb 25, 2026
2 checks passed

beaufortfrancois deleted the evals-multiple-expected-calls branch February 25, 2026 13:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Migrate to Vercel AI SDK and enable experimental multi-step tool execution#28

Migrate to Vercel AI SDK and enable experimental multi-step tool execution#28
Kulikowski merged 15 commits intomainfrom
evals-multiple-expected-calls

Kulikowski commented Feb 25, 2026 •

edited

Loading

Uh oh!

Kulikowski commented Feb 25, 2026

Uh oh!

andreban left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Kulikowski commented Feb 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Kulikowski commented Feb 25, 2026

Uh oh!

andreban left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Kulikowski commented Feb 25, 2026 •

edited

Loading