External Agent Adapters — Candidate (RFC)

Status: accepted for alpha MVP. Adapter discovery, Agent Chat, memory/SQLite persistence, long-lived ACP sessions, live streaming, cancellation, raw diagnostics, workspace diff capture, approval prompts, readiness probes, version warnings, and session guardrails are implemented. Each prompt has stable run metadata on the assistant message, and each chat records the native ACP session id reused across turns. API shape may still change before a stable release. Related: ACP bridge, Runtime API, Agent runtime, Agent event protocol. Owner: see AGENTS.md.

This RFC defines how Hecate should let an operator chat with external coding agents such as Codex CLI, Claude Code, Cursor Agent, and future agent CLIs without pretending those agents are model providers.

The core distinction:

Concept	Examples	What Hecate controls
Model provider	OpenAI, Anthropic, Ollama, LM Studio	Request routing, pricebook, provider health, model choice
Agent adapter	Codex ACP, Claude ACP, Cursor Agent ACP, future ACP-capable coding agents	Process lifecycle, workspace, prompt/session flow, output capture, diff capture
Protocol adapter	ACP, MCP, OpenAI-compatible HTTP, Anthropic Messages	How another system talks to or from Hecate

Providers answer LLM calls. Agent adapters drive coding-agent loops.

Problem

Hecate already has two strong surfaces:

Chats — model/provider conversations routed through the gateway.
Tasks — durable agent/runtime work with approvals, events, artifacts, and workspace state.

Using Hecate with Codex, Claude Code, or Cursor Agent needs a third shape that is conversation-first like Chats but runtime-aware like Tasks. A user wants to type in Hecate and get a response from Codex, Claude Code, or Cursor Agent, while Hecate still records what happened, captures output, and eventually shows diffs and later artifacts.

Putting Codex, Claude Code, or Cursor Agent in the provider/model dropdown would be wrong:

They are full agents, not models.
They carry their own tool loop and permission model.
They may own their own credentials and provider routing.
Their costs may be externally managed or opaque to Hecate.

Goals

Add a product and backend seam for Agent Chat alongside Model Chat.
Support Codex, Claude, and Cursor Agent through ACP-capable adapters first.
Keep provider/model routing unchanged.
Let Hecate supervise external agent sessions: start, stream, cancel, timeout, capture exit status.
Store enough run/session state that UI and future clients can replay the conversation.
Capture ACP updates as runtime output first, then richer structured events as the adapter surface matures.
Normalize ACP output into readable transcript text without discarding the raw diagnostic stream needed for future debugging.
Capture workspace diff after a run when the workspace is a Git repo.
Use ACP for outbound external-agent sessions when an adapter is available.

Non-goals

Do not make Codex, Claude Code, or Cursor Agent fake providers.
Do not add a second one-shot CLI compatibility layer while the project is still alpha.
Do not claim exact cost accounting for external agents until the adapter can report it.
Do not build a plugin marketplace or broad agent-runtime SDK yet.
Do not support remote multi-user agent sessions in this RFC.

Recommended Shape

Start with ACP session adapters.

Hecate Chats
  -> Target: External Agent
  -> Agent adapter: Codex / Claude Code / Cursor Agent
  -> Workspace
  -> Prompt
  -> Native ACP session
  -> Streamed output + captured diff

The implementation keeps one adapter process and one native ACP session alive per External Agent chat session. Each prompt becomes the next ACP turn in that session.

UI Model

Chats exposes External Agent as a top-level target next to Hecate Chat:

Target: Hecate Chat | External Agent

When Hecate Chat is selected, the provider/model controls remain and the tools toggle decides whether the prompt is direct model chat or Hecate Agent task execution.

When External Agent is selected, the primary controls become:

Agent: Codex | Claude Code | Cursor Agent
Workspace: /path/to/repo
Prompt: message

The conversation transcript should remain one surface. Runtime metadata should show that this response came from an agent adapter, not a provider/model route:

Codex · external agent
Workspace: /Users/.../hecate
Cost: external / unknown
Patch: 3 files changed

Backend Model

The adapter code lives behind internal/agentadapters/ without coupling to provider routing or internal/api request structs:

internal/agentadapters/
  acp_session.go
  approvals.go
  errors.go
  probe.go
  registry.go
  version.go

The current runtime shape is ACP-first:

registry.go declares built-in adapters, direct commands, managed launcher metadata, tested version ranges, and lightweight auth hints.
probe.go performs the explicit "can this adapter really start?" check by spawning the adapter, completing ACP initialize, opening a no-op session, and classifying the result.
acp_session.go owns the long-lived adapter process, native ACP session, prompt turns, streaming update normalization, cancellation, shutdown, usage updates, raw diagnostics, and Git diff capture.
approvals.go maps ACP RequestPermission into Hecate's external-agent approval rows, grants, REST/SSE surfaces, and OTel metrics.

API handlers translate HTTP shapes into adapter/session manager calls. The adapter package remains independent from provider routing: external coding agents are not model providers, and provider APIs are not threaded into this path unless a future adapter explicitly opts in.

API Options

Two options are plausible.

Option	Shape	Pros	Cons
Add agent mode to chat sessions	Extend `/hecate/v1/chat/sessions` with `target_type=model	agent`	One user-facing Chats surface; easier history
Add explicit agent-chat API	`/hecate/v1/agent-chat/sessions/*`	Clean boundary; easy to change during alpha	UI has to bridge two chat APIs

Recommendation for alpha: explicit agent-chat API first. Once behavior is stable, Chats UI can render both model-chat and agent-chat sessions behind one experience.

Implemented MVP endpoints:

GET  /hecate/v1/agent-adapters
POST /hecate/v1/agent-adapters/{id}/probe
GET  /hecate/v1/agent-adapters/{id}/health
POST /hecate/v1/agent-adapters/{id}/refresh-launcher
GET  /hecate/v1/agent-chat/sessions
POST /hecate/v1/agent-chat/sessions
GET  /hecate/v1/agent-chat/sessions/{id}
GET  /hecate/v1/agent-chat/sessions/{id}/stream
POST /hecate/v1/agent-chat/sessions/{id}/messages
GET  /hecate/v1/agent-chat/sessions/{id}/messages/{message_id}/files
GET  /hecate/v1/agent-chat/sessions/{id}/messages/{message_id}/files/{path}
POST /hecate/v1/agent-chat/sessions/{id}/messages/{message_id}/revert
POST /hecate/v1/agent-chat/sessions/{id}/cancel
DELETE /hecate/v1/agent-chat/sessions/{id}
GET  /hecate/v1/agent-chat/sessions/{id}/approvals
GET  /hecate/v1/agent-chat/sessions/{id}/approvals/{approval_id}
POST /hecate/v1/agent-chat/sessions/{id}/approvals/{approval_id}/resolve
POST /hecate/v1/agent-chat/sessions/{id}/approvals/{approval_id}/cancel
GET  /hecate/v1/agent-chat/grants
DELETE /hecate/v1/agent-chat/grants/{grant_id}

Message creation is still a blocking POST for the submitted prompt, but clients can subscribe to the session SSE stream first to receive partial output while the external process is running. History is memory-backed by default and SQLite backed when GATEWAY_CHAT_SESSIONS_BACKEND=sqlite. The store also keeps the native ACP session id. On the next prompt after a gateway restart, Hecate passes that id to the adapter through ACP session/load when the adapter advertises load-session support; otherwise it creates a fresh native session and keeps the Hecate transcript intact.

Adapter Session Behavior

For the first prompt in an Agent Chat session:

Resolve the adapter through a direct ACP command or a Hecate-managed launcher. Codex and Claude can use local npx managed launchers; Cursor currently comes from cursor-agent acp.
Validate and canonicalize the workspace path.
Build a sanitized process environment. Gateway/provider secrets are not forwarded by default.
Spawn the ACP adapter in the selected workspace.
Complete ACP initialize and session/new.
Send the prompt as the first ACP turn.
Normalize ACP updates into transcript text, structured activity records, raw diagnostics, usage telemetry, and approval requests.
Enforce timeout, cancellation, turn ceilings, wall-clock limits, and idle cleanup.
If the workspace is a Git repo, capture git diff --stat and git diff --binary onto the assistant message.

For later prompts in the same External Agent chat session, Hecate reuses the same adapter process and native ACP session. If the gateway restarts and SQLite chat storage is enabled, Hecate keeps the transcript and saved native session id. On the next prompt it asks the adapter to load that native session when the adapter advertises load-session support; otherwise it starts a fresh native ACP session and keeps the Hecate-side transcript intact.

Managed launchers are intentionally local and operator-controlled. Hecate writes small wrapper scripts into the user cache directory or HECATE_AGENT_ADAPTERS_DIR, refreshes one adapter on demand, and removes stale launcher scripts at startup when the built-in adapter list changes.

Relationship To ACP

ACP is useful in two directions:

Zed / JetBrains -> ACP -> Hecate
Hecate -> ACP -> Codex / Claude / Cursor Agent

The inbound bridge (cmd/hecate-acp) lets editor agent panels talk to Hecate. The outbound adapter layer lets Hecate talk to ACP-capable external coding agents. They share protocol vocabulary but stay separate processes and code paths.

Observability

Agent Chat currently has three observability surfaces:

The per-session SSE stream emits typed session_update, approval.requested, and approval.resolved events.
Assistant messages carry stable run metadata (run_id, timestamps, duration, trace ids, native session id), structured activity records, raw ACP diagnostics, usage updates, and captured diff data.
OpenTelemetry spans and metrics cover agent_chat.run, adapter probe outcomes, approval request/resolve paths, approval timeout/grant counters, cancellation reasons, output byte counts, and diff-capture state.

Important attributes include:

hecate.agent_adapter.id
hecate.agent_adapter.command
hecate.agent_adapter.driver.kind
hecate.agent_adapter.native_session.id
hecate.agent_chat.session.id
hecate.workspace.path
hecate.run.id
hecate.agent_adapter.output.bytes
hecate.agent_adapter.diff.captured

Do not log prompts by default outside existing debug/redaction rules.

Security And Policy

External agent adapters are high-risk because they run third-party CLIs that may themselves execute tools.

First-version safety rules:

Require an explicit workspace path.
Validate and canonicalize the workspace directory before storing a session.
Use sanitized env by default.
Do not pass provider API keys unless the adapter config explicitly opts in.
Enforce timeout and cancellation.
Capture output with the same output-size limits used by task tools.
Mark cost as external / unknown unless the adapter reports structured usage.
Make the UI visibly distinguish external-agent output from provider/model output.

Current limitation: external adapters run as trusted subprocesses in the selected workspace. They are not the same as Hecate task-runtime sandboxed tool calls. This is intentional for alpha: Codex, Claude Code, and Cursor are long-lived interactive processes with their own auth, caches, child processes, and ACP stdio/session lifecycle. Reusing the task-runtime per-call sandbox is not a drop-in fit.

Acceptance Criteria For First Implementation

Future Enhancements

Fuller patch review UX for captured diffs: side-by-side hunks, batch selection, and richer artifact history. The current Chats UI can inspect and revert already-applied Git paths and is sufficient for alpha stability.
Deeper adapter-specific structured mappers for ACP tool output. The current generic mapper plus raw diagnostics is sufficient for alpha stability.
Decide which task-runtime primitives Agent Chat should reuse without pretending Hecate owns the external agent runtime.

Open Questions

Should Agent Chat reuse task-runtime primitives for artifacts, event history, retention, and trace correlation while keeping Codex, Claude Code, and Cursor as opaque supervised runtimes?
How much of the external process environment should be configurable by the operator?
Should Hecate eventually offer optional process containment for external adapters? Not a near-term requirement. If it happens, it should be a separate design for long-lived ACP subprocesses, not reuse of the task-runtime per-call sandbox.
Which adapter-specific ACP update shapes deserve first-class UI mapping next?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

External Agent Adapters — Candidate (RFC)

Problem

Goals

Non-goals

Recommended Shape

UI Model

Backend Model

API Options

Adapter Session Behavior

Relationship To ACP

Observability

Security And Policy

Acceptance Criteria For First Implementation

Future Enhancements

Open Questions

FilesExpand file tree

external-agent-adapters.md

Latest commit

History

external-agent-adapters.md

File metadata and controls

External Agent Adapters — Candidate (RFC)

Problem

Goals

Non-goals

Recommended Shape

UI Model

Backend Model

API Options

Adapter Session Behavior

Relationship To ACP

Observability

Security And Policy

Acceptance Criteria For First Implementation

Future Enhancements

Open Questions