Agent Connect Protocol Specification

Getting Started

Explore the ACP specification by browsing the OpenAPI view https://agntcy.github.io/acp-spec/docs/openapi.html.

Learn how to use the API by looking at API Usage Flows

Learn about Agent ACP Descriptor and its usage here

Explore tools for ACP and Agent ACP Descriptors in the Agent Control SDK Repo

API Usage Flows

Agents Retrieval APIs

ACP offers an API to search for the agents served by the ACP server. Once a client has an agent identifier AgentID, it can use it to either retrieve the agent descriptor or to control agent runs.

Retrieve all agents supported by the server

In this case, the client is doing a search of all agents in the server without specifying any search filter. Result is the list of all agents.

sequenceDiagram
    participant C as ACP Client
    participant S as ACP Server
    C->>+S: POST /agents/search {}
    S->>-C: AgentList=[{id, metadata}, {id, metadata}, ...]

Loading

Retrieve an agent from its name and version

In this case, the client knows name and version of an agent (e.g. learnt from the record in the Agent Directory) and wants to retrieve its id to interact with the agent.

sequenceDiagram
    participant C as ACP Client
    participant S as ACP Server
    C->>+S: POST /agents/search <br/>{"name":"smart-agent", "version": "0.1.3"}
    S->>-C: AgentList = [{id, metadata}]

Loading

Retrieve agent descriptor from its identifier

In this case, the client knows the agent id and wants to retrieve its descriptor to learn about the capabilities supported and the data schemas to use.

sequenceDiagram
    participant C as ACP Client
    participant S as ACP Server
    C->>+S: GET /agents/agent/{agent_id}/descriptor
    S->>-C: AgentACPDescriptor={...}

Loading

Runs

A run is a single execution of an agent.

Start a Run of an Agent and poll for completion

In this case, the client starts a background run of an agent, keeps polling the server until the run is complete, finally it retrieves the run output.

sequenceDiagram
    participant C as ACP Client
    participant S as ACP Server
    C->>+S: POST /runs {agent_id, input, config, metadata}
    S->>-C: Run={run_id, status="pending"}
    loop Until run["status"] == "pending"
    C->>+S: GET /runs/{run_id}
    S->>-C: Run={run_id, status}
    end
    C->>+S: GET /runs/{run_id}/wait
    S->>-C: RunOutput={type="result", result}

Loading

In the sequence above:

The client requests to start a run on a specific agent, providing its agent_id, and specifying:
- Configuration: a run configuration is flavoring the behavior of this agent for this run.
- Input: run input provides the data the agent will operate on.
- Metadata: metadata is a free format object that can be used by the client to tag the run with arbitrary information.
The server returns a run object which includes the run identifier and a status, the status at the beginning will be pending.
The client retrieves the status of the run until completion.
The server returns the run object with the updated status.
The client request the output of the run with the wait endpoint, which returns immediately, since the run is done.
The server returns the final result of the run.

Note that the format of the input and the configuration are not specified by ACP, but they are defined in the agent descriptor.

Start a Run of an Agent and block until completion

In this case, the client starts a background run of an agent and immediately tries to retrieve the run output blocking on this call until completion or timeout.

sequenceDiagram
    participant C as ACP Client
    participant S as ACP Server
    C->>+S: POST /runs {agent_id, input, config, metadata}
    S->>-C: Run={run_id, status="pending"}
    C->>+S: GET /runs/{run_id}/wait 
    S->>-C: RunOutput={type="result", result}

Loading

In the sequence above:

The client requests to start a run on a specific agent.
The server returns a run object.
The client request the output of the run with the wait endpoint. In this case the request and blocks until run status changes.
The server returns the final result of the run. Note that in case the timeout had expired before, the server would have returned no content.

Start a Run of an Agent with a callback

Agents can support callbacks, i.e. asynchronously call back the client upon run status change. The support for interrupts is signaled in the agent descriptor.

In this case, the client starts a background run of an agent and provide a callback to be called upon completion.

sequenceDiagram
    participant C as ACP Client
    participant S as ACP Server
    C->>+S: POST /runs {agent_id, input, config, metadata, callback={POST /callme}}
    S->>-C: Run={run_id, status="pending"}
    S->>C: POST /callme Run={run_id, status="success"}
    C->>+S: GET /runs/{run_id}/wait
    S->>-C: RunOutput={type="result", result}

Loading

In the sequence above:

The client requests to start a run on a specific agent, providing an additional callback.
The server returns a run object.
Upon status change, the server calls the provided call back with the run object.
The client request the output of the run with the wait endpoint, which returns immediately, since the run is done.
The server return the final result of the run.

Run Interrupt and Resume

Agent can support interrupts, i.e. the run execution can interrupt to request additional input to the client. The support for interrupts is signaled in the agent ACP descriptor.

When an interrupt occurs, the server provides the client with an interrupt payload, which specifies the interrupt type that has occurred and all the information associated with that interrupt, i.e. a request for additional input.

The client can collect the needed input for the specific interrupt and resume the run by providing the resume payload, i.e. the additional input requested by the interrupt.

Note that the type of interrupts and the correspondent interrupt and resume payload are not specified by ACP, because they are agent dependent. They are instead specified in the agent ACP descriptor.

The interrupt is provided by the server when the client requests the output.

Start a run and resume it upon interruption

In this case, the client asks for the agent output and receives an interrupt instead of the final output. The client then resumes the run providing the needed input and finally when the run is completed, gets the result.

sequenceDiagram
    participant C as ACP Client
    participant S as ACP Server
    C->>+S: POST /runs {agent_id, input, config, metadata}
    S->>-C: Run={run_id, status="pending"}
    C->>+S: GET /runs/{run_id}/wait 
    S->>-C: RunOutput={type="interrupt", interrupt_type, interrupt_payload}
    note over C: collect needed input
    C->>+S: POST /runs/{run_id} {interrupt_type, resume_payload}
    S->>-C: Run={run_id, status="pending"}
    C->>+S: GET /runs/{run_id}/wait
    S->>-C: RunOutput={type="result", result}

Loading

In the sequence above:

The client start the run.
The server returns the run object.
The client requests the output.
The server returns an interrupt, specifying interrupt type and the associated payload.
The client resumes the run providing the needed input in the resume payload.
the client requests the output.
The server returns the final result.

Thread Runs

Agents can support thread run. Support for thread run is signaled in the agent ACP descriptor.

When an agent supports thread run, each run is associated to a thread, and at the end of the run a thread state is kept in the server.

Subsequent runs on the same thread use the previously created state, together with the run input provided.

The server offers ways to retrieve the current thread state, the history of the runs on a thread, and the evolution of the thread states over execution of runs.

Runs over the same thread can be executed on different agents, as long as the agents support the same thread state format.

Note that the format of the thread state is not specified by ACP, but it is (optionally) defined in the agent ACP descriptor. If specified, it can be retrieved by the client, if not it's not accessible to the client.

Start of multiple runs over the same thread

In this case the client starts a sequence of runs on the same thread accumulating a state in the server. In this specific example the input is a chat message, while the state kept in the server is the chat history.

sequenceDiagram
    participant C as ACP Client
    participant S as ACP Server
    rect rgb(240,240,240)
    C->>+S: POST /threads
    S->>-C: Thread={thread_id, status="idle"}
    C->>+S: POST /threads/{thread_id}/runs {agent_id, message="Hello, my name is John?", config, metadata}
    S->>-C: Run={run_id, status="pending"}
    C->>+S: GET /threads/{thread_id}/runs/{run_id}/wait 
    S->>-C: RunOutput={type="result", result={"message"="Hello John, how can I help?"}}
    end
    note right of S: state=[<br/>"Hello, my name is John?",<br/>"Hello John, how can I help?"<br/>]
    rect rgb(240,240,240)
    C->>+S: POST /threads/{thread_id}/runs {agent_id, message="Can you remind my name?", config, metadata}
    S->>-C: Run={run_id, status="pending"}
    C->>+S: GET /threads/{thread_id}/runs/{run_id}/wait 
    S->>-C: RunOutput={type="result", result={"message"="Yes, your name is John"}}
    end
    note right of S: state=[<br/>"Hello, my name is John?",<br/>"Hello John, how can I help?"<br/>"Can you remind my name?",<br/>"Yes, your name is John"<br/>]
    C->>+S: GET /threads/{thread_id} 
    S->>-C: Thread{thread_id, status="idle", values=[<br/>"Hello, my name is John?",<br/>"Hello John, how can I help?"<br/>"Can you remind my name?",<br/>"Yes, your name is John"<br/>]}

Loading

In the sequence above:

The client requests to create a thread on the server
The server returns a thread object that contains a thread ID
The client starts the first run on the created thread and provides the first message of the chat.
The server returns the run object.
The client requests the run output.
The server returns the run output which is the next chat message from the agent and leaves a state with the current chat history.
The client starts a new run on the same thread providing the input for the run, i.e. the next message in the chat (assuming the existence of the chat history on the server).
The server starts the runs using the existing chat history and returns the run object.
The client requests the run output.
The server updates the thread state and returns the run output.
Finally, the client requests the thread state (this is an optional operation).
The server returns the current thread object which collects the whole chat history.

Output Streaming

ACP supports output streaming. Agent can stream partial results of a Run to provide better response time and user experience.

ACP implements streaming using Server Sent Events specified here: https://html.spec.whatwg.org/multipage/server-sent-events.html.

In a nutshell, the client keeps the HTTP connection open and receives a stream of events from the server, where each event carries an update of the run result.

ACP supports 2 streaming modes:

values where each event contains a full instance of the agent output, which fully replace the previous update.
custom where the schema of the event is left unspecified by ACP, which it can be specified in the specific agent ACP descriptor under spec.custom_streaming_update

Start a Run and stream output until completion

sequenceDiagram
    participant C as ACP Client
    participant S as ACP Server
    C->>+S: POST /runs/stream {agent_id, input, config, metadata, stream_mode='values'}
    S->>-C: Run={run_id, status="pending"}
    rect rgb(240,240,240)
    S->>C: StreamEvent={id="1", event="agent_event", data={run_id, type="values", result={"message": "Hello"}}}
    S->>C: StreamEvent={id="2", event="agent_event", data={run_id, type="values", result={"message": "Hello, how"}}}
    S->>C: StreamEvent={id="2", event="agent_event", data={run_id, type="values", result={"message": "Hello, how can"}}}
    S->>C: StreamEvent={id="3", event="agent_event", data={run_id, type="values", result={"message": "Hello, how can I help"}}}
    S->>C: StreamEvent={id="4", event="agent_event", data={run_id, type="values", result={"message": "Hello, how can I help you"}}}
    S->>C: StreamEvent={id="5", event="agent_event", data={run_id, type="values", result={"message": "Hello, how can I help you today"}}}
    S->>C: Close Connection
    end

Loading

In the sequence above:

The client requests to start a run on a specific agent specifying stream_mode = 'values' and waits immediately for the streaming.
The client requests the output streaming and keeps the connection open.
The server returns an event with message="Hello".
The server returns an event with updated message "Hello, how".
The server returns an event with updated message "Hello, how can".
The server returns an event with updated message "Hello, how can I help".
The server returns an event with updated message "Hello, how can I help you".
The server returns an event with updated message "Hello, how can I help you today".
The server closes the connection because the output is complete.

Agent ACP descriptor

Agent ACP Descriptor is a descriptor that contains all the needed information to know how:

Identify an agent.
Know its capabilities.
Consume its capabilities.

The Agent ACP Descriptor can be obtained from the Agent Directory or can be obtained through an ACP call.

Agent descriptor sections and examples

We present the details of a sample agent ACP descriptor through the various descriptor sections.

Full sample Descriptor

filename

Agent Metadata

Agent Metadata section contains all the information about agent identification and a description of what the agent does. It contains unique name which together with a version constitutes the unique identifier of the agent. The uniqueness must be guaranteed within the server it is part of and more generally in the Agent Directory domain it belongs to.

Sample descriptor metadata section for the mailcomposer agent

{
  "metadata": {
    "ref": {
      "name": "org.agntcy.mailcomposer",
      "version": "0.0.1",
      "url": "https://github.com/agntcy/acp-spec/blob/main/docs/sample_acp_descriptors/mailcomposer.json"
    },
    "description": "This agent is able to collect user intent through a chat interface and compose wonderful emails based on that."
  }
  ...
}

Metadata for a mail composer agent named org.agntcy.mailcomposer version 0.0.1.

Agent Specs

Agent Specs section includes ACP invocation capabilities and the schema definitions for ACP interactions.

The ACP capabilities that the agent support, e.g. streaming, callbacks, interrupts etc.

The schemas of all the objects that this agent supports for:

Agent Configuration.
Run Input.
Run Output.
Interrupt and Resume Payloads.
Thread State.

Note that these schemas are needed in the agent ACP descriptor, since they are agent specific and are not defined by ACP, i.e. ACP defines a generic JSON object for the data structures listed above.

Sample metadata specs section for the mailcomposer agent

{
  ...
    "specs": {
      "capabilities": {
        "threads": true,
        "interrupts": true,
        "callbacks": true
      },
      "input": {
        "type": "object",
        "description": "Agent Input",
        "properties": {
            "message": {
                "type": "string",
                "description": "Last message of the chat from the user"
            }
        }
      },
      "thread_state": {
        "type": "object",
        "description": "The state of the agent",
        "properties": {
          "messages": {
            "type": "array",
            "description": "Full chat history",
            "items": {
                "type": "string",
                "description": "A message in the chat"
            }
          }
        }
      },
      "output": {
        "type": "object",
        "description": "Agent Input",
        "properties": {
            "message": {
                "type": "string",
                "description": "Last message of the chat from the user"
            }
        }
      },
      "config": {
        "type": "object",
        "description": "The configuration of the agent",
        "properties": {
          "style": {
            "type": "string",
            "enum": ["formal", "friendly"]
          }
        }
      },
      "interrupts": [
        {
          "interrupt_type": "mail_send_approval",
          "interrupt_payload": {
            "type": "object",
            "title": "Mail Approval Payload",
            "description": "Description of the email",
            "properties": {
              "subject": {
                "title": "Mail Subject",
                "description": "Subject of the email that is about to be sent",
                "type": "string"
              },
              "body": {
                "title": "Mail Body",
                "description": "Body of the email that is about to be sent",
                "type": "string"
              },
              "recipients": {
                "title": "Mail recipients",
                "description": "List of recipients of the email",
                "type": "array",
                "items": {
                    "type": "string",
                    "format": "email"
                }
              }
            },
            "required": [
              "subject",
              "body",
              "recipients"
            ]
          },
          "resume_payload": {
            "type": "object",
            "title": "Email Approval Input",
            "description": "User Approval for this email",
            "properties": {
              "reason": {
                "title": "Approval Reason",
                "description": "Reason to approve or decline",
                "type": "string"
              },
              "approved": {
                "title": "Approval Decision",
                "description": "True if approved, False if declined",
                "type": "boolean"
              }
            },
            "required": [
              "approved"
            ]
          }
        }
      ]
    }
  ...
}

The agent supports threads, interrupts, and callback.

It declares schemas for input, output, and config:

As input, it expects the next message of the chat from the user.
As output, it produces the next message of the chat from the agent.
As config it expects the style of the email to be written.

It supports one kind of interrupt, which is used to ask user for approval before sending the email. It provides subject, body, and recipients of the email as interrupt payload and expects approval as input to resume.

It supports a thread state which holds the chat history.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Agent Connect Protocol Specification

Getting Started

API Usage Flows

Agents Retrieval APIs

Retrieve all agents supported by the server

Retrieve an agent from its name and version

Retrieve agent descriptor from its identifier

Runs

Start a Run of an Agent and poll for completion

Start a Run of an Agent and block until completion

Start a Run of an Agent with a callback

Run Interrupt and Resume

Start a run and resume it upon interruption

Thread Runs

Start of multiple runs over the same thread

Output Streaming

Start a Run and stream output until completion

Agent ACP descriptor

Agent descriptor sections and examples

Agent Metadata

Agent Specs

Files

README.md

Latest commit

History

README.md

File metadata and controls

Agent Connect Protocol Specification

Getting Started

API Usage Flows

Agents Retrieval APIs

Retrieve all agents supported by the server

Retrieve an agent from its name and version

Retrieve agent descriptor from its identifier

Runs

Start a Run of an Agent and poll for completion

Start a Run of an Agent and block until completion

Start a Run of an Agent with a callback

Run Interrupt and Resume

Start a run and resume it upon interruption

Thread Runs

Start of multiple runs over the same thread

Output Streaming

Start a Run and stream output until completion

Agent ACP descriptor

Agent descriptor sections and examples

Agent Metadata

Agent Specs