Skip to content

Add multiple backend support #31

Merged
Kulikowski merged 1 commit intomainfrom
evals-supporting-different-backends
Feb 25, 2026
Merged

Add multiple backend support #31
Kulikowski merged 1 commit intomainfrom
evals-supporting-different-backends

Conversation

@Kulikowski
Copy link
Contributor

  • Re-Introduced a unified Backend interface and implemented dedicated backend handlers for gemini, ollama, and vercel, allowing for flexible local and remote model execution.

  • Enhanced CLI Configuration: Updated the CLI commands (runevals and webmcpevals) and internal configuration objects to accept explicit --backend (e.g., gemini, ollama, vercel) and --provider (e.g., google, openai) flags.

  • Resolved Vercel AI SDK Multi-Turn Crashes: Fixed the AI_InvalidPromptError encountered during multi-turn evaluations by updating the message mappers (src/evaluator/mappers.ts)

  • Temp Browser Execution Guardrails: Added runtime validation to ensure executeInBrowserEvals strictly enforces the use of the vercel backend, as it relies heavily on the Vercel AI SDK's DOM bridging and ToolLoopAgent.

image image

@Kulikowski Kulikowski requested a review from andreban February 25, 2026 17:30
@Kulikowski Kulikowski force-pushed the evals-supporting-different-backends branch 3 times, most recently from 8b7c43b to 3cdc1bf Compare February 25, 2026 17:38
@Kulikowski Kulikowski force-pushed the evals-supporting-different-backends branch from 3cdc1bf to 97da77f Compare February 25, 2026 17:55
Copy link
Member

@andreban andreban left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@Kulikowski Kulikowski merged commit 611a2b1 into main Feb 25, 2026
2 checks passed
@beaufortfrancois beaufortfrancois deleted the evals-supporting-different-backends branch February 26, 2026 09:34
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants