Provider Coverage Matrix

Which dd-agents flows are verified on which providers. dd-agents is model- and provider-agnostic by env config (see Model Providers); this page records what has been exercised live, so claims stay honest.

How routing works (recap)

Every LLM call is built by one seam (dd_agents.llm.build_agent_options) and run through claude_agent_sdk, which speaks the Anthropic Messages wire protocol. Native providers (Anthropic API / Bedrock / Vertex) run Claude; an Anthropic-compatible gateway (e.g. LiteLLM) fronts any model. Verify your own setup with dd-agents doctor --probe.

Flows

The seven LLM-calling flows (one per build_agent_options call site):

Flow	CLI surface
Provider probe	`dd-agents doctor --probe`
Single-question query	`dd-agents query`
Contract search	`dd-agents search`
Full pipeline (9 specialists + synthesis + judge)	`dd-agents run`
Auto-config	`dd-agents auto-config`
Interactive chat	`dd-agents chat`
Vision/image extraction fallback	`dd-agents run` (scanned docs)

Verified live

Legend: ✅ verified live · ⚙️ exercised via the shared seam (same code path) · 🔑 not run here (no credentials) · ⚠️ runs, output quality model-dependent.

Flow	Native Bedrock (Claude)	Gateway → Bedrock-Claude	Gateway → DeepSeek (non-Claude)	Anthropic API / Vertex
`doctor --probe`	✅	✅	✅	🔑
`query`	✅	✅	✅	🔑
`search`	✅	✅	⚠️	🔑
full `run` (38 steps)	⚙️	✅	✅	🔑
`auto-config`	⚙️	⚙️	⚙️	🔑
`chat`	⚙️	⚙️	⚙️	🔑
vision extraction	⚙️	⚙️	⚙️	🔑

Notes:

Full run is proven end-to-end through a gateway on BOTH Claude and a non-Claude model (DeepSeek v3.2 on Bedrock): 38/38 steps, all gates, real multi-domain findings (40+), HTML + Excel reports, and the audit receipt records the provider/model. The same engine drives native Bedrock, so that column is ⚙️ (identical code path, not separately driven end-to-end here).
search on a weak/non-Claude model (⚠️): the command completes and degrades gracefully, but a model that ignores the JSON contract (e.g. DeepSeek emitting native tool-call markup) yields partial columns. dd-agents recovers JSON from prose and tool-markup where possible; final fidelity is the model's responsibility. Prefer a strong tool-calling model — validate with dd-agents doctor --probe and a trial search first.
Anthropic API direct / Vertex (🔑): not run here for lack of credentials. Both ride the identical seam + wire protocol as the verified providers; Anthropic-direct is the SDK's default transport and Vertex is a native CLI routing flag.
auto-config / chat / vision (⚙️): not separately driven end-to-end in this pass; each builds options through the same seam as the ✅ flows, so provider routing is identical. Drive them with dd-agents doctor first if you are bringing up a new provider.

Model capability tiers

Which backing model you put behind the seam matters: the pipeline leans on tool-use, structured (JSON-schema) output, and — for scanned docs — vision. This matrix records what's been validated, so a BYO-model buyer can choose with confidence instead of discovering gaps at runtime. Legend: ✅ validated · ⚠️ partial / model-dependent · ❓ untested here.

Model (family)	Tool-use	Structured output (JSON)	Vision	Full 38-step run
Claude (Anthropic API / Bedrock / Vertex / gateway)	✅	✅	✅	✅
DeepSeek v3.2 via gateway (non-Claude)	⚠️	⚠️	❓	✅
GPT / Gemini via gateway	❓	❓	❓	❓

Notes:

Claude is the reference tier — full fidelity on every flow, native or via a gateway.
DeepSeek (and weaker models): the full pipeline completes and produces substantive findings, but a model that ignores the JSON contract (e.g. emits native tool-call markup) yields partial search columns — handled gracefully, not a crash. The structured-output fallback (prompt-instructed JSON + a robust extractor) mitigates this; final fidelity is the model's responsibility.
GPT / Gemini via gateway: reachable by construction (same wire protocol) but not validated live here — verify with dd-agents doctor --probe + a trial search before a production run.
This matrix is a point-in-time validation record, not a guarantee. The authoritative, continuous check is the -m gateway test (run on every push to main via the Gateway Provider Proof CI job) plus your own dd-agents doctor --probe.

Reproduce

Stand up a gateway and point dd-agents at it (the recipe in examples/litellm-gateway/), then:

export ANTHROPIC_BASE_URL=http://localhost:4011 ANTHROPIC_AUTH_TOKEN=sk-anything
dd-agents doctor --probe                 # confirm routing + a live round-trip
dd-agents run deal-config.json           # full pipeline through your provider

The gateway end-to-end test (tests/e2e/test_gateway_provider.py -m gateway) is the automated version of the doctor --probe check. It runs continuously on every push to main (the Gateway Provider Proof CI job stands up a LiteLLM proxy in front of the Anthropic Messages API and round-trips a real query), and you can run it against your own gateway by setting DD_TEST_GATEWAY_URL.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Provider Coverage Matrix

How routing works (recap)

Flows

Verified live

Model capability tiers

Reproduce

Uh oh!

FilesExpand file tree

provider-coverage.md

Latest commit

History

provider-coverage.md

File metadata and controls

Provider Coverage Matrix

How routing works (recap)

Flows

Verified live

Model capability tiers

Reproduce