Async Queries

Works with v2.0+

Note: Async queries require Spice v2.0 or later.

This recipe demonstrates how to use the async queries API to submit long-running SQL queries and retrieve results asynchronously. It shows how to:

Submit queries via the HTTP API and CLI
Poll for query completion
Retrieve paginated results
Cancel running queries
Use the interactive spice query REPL

Async queries build on top of distributed query mode and require cluster mode with a scheduler and at least one executor.

Prerequisites

Spice CLI installed (v2.0+)

Getting Started

Step 1: Prepare Working Directory

Clone the cookbook repository and navigate to the async-queries directory.

git clone https://github.com/spiceai/cookbook.git
cd cookbook/async-queries

Step 2: Generate Development mTLS Certificates

Generate mTLS certificates for the scheduler and executor:

spice cluster tls init
spice cluster tls add scheduler1
spice cluster tls add executor1

Step 3: Start the Spice Scheduler

Start the scheduler with cluster mode and the scheduler.state_location configured in the spicepod.yaml:

~/.spice/bin/spiced --role scheduler \
  --node-bind-address 127.0.0.1:50052 \
  --node-advertise-address 127.0.0.1 \
  --http 127.0.0.1:8090 \
  --flight 127.0.0.1:50051 \
  --node-mtls-ca-certificate-file ~/.spice/pki/ca.crt \
  --node-mtls-certificate-file ~/.spice/pki/scheduler1.crt \
  --node-mtls-key-file ~/.spice/pki/scheduler1.key

The scheduler starts and registers the data dataset:

2026-03-02T12:00:00.000000Z  INFO spiced: Starting runtime
2026-03-02T12:00:01.000000Z  INFO runtime::cluster: Starting Ballista scheduler on 0.0.0.0:50052
2026-03-02T12:00:01.000000Z  INFO runtime::init::dataset: Dataset data initializing...
2026-03-02T12:00:01.000000Z  INFO runtime::flight: Spice Runtime Flight listening on 127.0.0.1:50051
2026-03-02T12:00:01.000000Z  INFO runtime::http: Spice Runtime HTTP listening on 127.0.0.1:8090
2026-03-02T12:00:03.000000Z  INFO runtime::init::dataset: Dataset data registered (s3://spiceai-public-datasets/hive_partitioned_data/), results cache enabled.
2026-03-02T12:00:03.000000Z  INFO runtime: All components are loaded. Spice runtime is ready!

Step 4: Start the Spice Executor

In a new terminal, start the executor:

~/.spice/bin/spiced --role executor \
  --http 127.0.0.1:9090 \
  --scheduler-address 127.0.0.1:50052 \
  --node-mtls-ca-certificate-file ~/.spice/pki/ca.crt \
  --node-mtls-certificate-file ~/.spice/pki/executor1.crt \
  --node-mtls-key-file ~/.spice/pki/executor1.key \
  --node-bind-address 127.0.0.1:50062 \
  --node-advertise-address 127.0.0.1

2026-03-02T12:01:00.000000Z  INFO spiced: Starting runtime
2026-03-02T12:01:01.000000Z  INFO ballista_executor::execution_loop: Starting poll work loop with scheduler
2026-03-02T12:01:01.000000Z  INFO runtime: All components are loaded. Spice runtime is ready!

Step 5: Submit an Async Query via HTTP

In a new terminal, submit a query using curl:

curl -s http://127.0.0.1:8090/v1/queries \
  -H "Content-Type: application/json" \
  -d '{"sql": "SELECT * FROM data LIMIT 100"}' | jq .

The API returns immediately with a query ID and status URLs:

{
  "query_id": "01ABC-DEF-456-7890AB",
  "status": "PENDING",
  "error": null,
  "status_url": "/v1/queries/01ABC-DEF-456-7890AB/status",
  "results_url": "/v1/queries/01ABC-DEF-456-7890AB/results"
}

Step 6: Poll for Completion

Use the status_url to poll until the query completes:

curl -s http://127.0.0.1:8090/v1/queries/01ABC-DEF-456-7890AB/status | jq .

While still running:

{
  "status": "RUNNING",
  "error": null
}

Once completed:

{
  "status": "SUCCEEDED",
  "error": null
}

Step 7: Retrieve Results

Fetch the first chunk of results:

curl -s http://127.0.0.1:8090/v1/queries/01ABC-DEF-456-7890AB/results | jq .

{
  "chunk_index": 0,
  "row_offset": 0,
  "row_count": 100,
  "next_chunk_index": null,
  "next_chunk_url": null,
  "data_array": [
    { "id": 30, "value": "value_0" },
    { "id": 31, "value": "value_1" }
  ]
}

For queries with more than 10,000 rows, follow the next_chunk_url to paginate through results:

curl -s http://127.0.0.1:8090/v1/queries/01ABC-DEF-456-7890AB/results/chunks/1 | jq .

Using the CLI

The spice query command provides a convenient CLI wrapper around the async queries API.

Submit and Wait

spice query "SELECT * FROM data LIMIT 10;"

Submitted query: 01ABC-DEF-456-7890AB (PENDING)
Waiting for completion... (Ctrl+C to stop waiting)
✓ SUCCEEDED (3.2s)
+----+---------+
| id | value   |
+----+---------+
| 30 | value_0 |
| 31 | value_1 |
| 32 | value_2 |
| 33 | value_3 |
| 34 | value_4 |
| 35 | value_5 |
| 36 | value_6 |
| 37 | value_7 |
| 38 | value_8 |
| 39 | value_9 |
+----+---------+

Time: 3.20000000 seconds. 10 rows.

Submit Without Waiting

spice query "SELECT * FROM data;" --no-wait

Submitted query: 01ABC-DEF-456-7890AB (PENDING)
Check status with: spice query status 01ABC-DEF-456-7890AB
Get results with: spice query results 01ABC-DEF-456-7890AB

List Running Queries

spice query list --status running

QUERY ID                STATUS    CREATED                    SQL PREVIEW
01ABC-DEF-456-7890AB    RUNNING   2026-03-02T12:00:00+00:00  SELECT * FROM data

Total: 1 queries

Cancel a Query

spice query cancel 01ABC-DEF-456-7890AB

Query 01ABC-DEF-456-7890AB cancelled (status: CANCELLED)

Interactive REPL

Start the REPL for a session-based workflow:

spice query

Welcome to the Spice.ai async query REPL.
Type SQL to submit a query, or .help for commands.

query> SELECT COUNT(*) FROM data;
Submitted query: 01ABC-DEF-456-7890AB (PENDING)
Press Ctrl+C to stop waiting (query continues in background)
✓ SUCCEEDED (2.8s)
+----------+
| count(*) |
+----------+
| 100      |
+----------+

Time: 2.80000000 seconds. 1 rows.

query> .list
QUERY ID                STATUS      SUBMITTED  SQL
01ABC-DEF-456-7890AB    SUCCEEDED   3s ago     SELECT COUNT(*) FROM data;

query> .exit

REPL Commands

Command	Description
`.list`	List tracked queries from this session
`.status <id>`	Show query status
`.results <id>`	Fetch and display results
`.wait <id>`	Resume waiting for a query
`.cancel <id>`	Cancel a running query
`.help`	Show all commands
`.exit`	Exit the REPL

Partial query IDs are supported — 01ABC resolves to the full ID if it uniquely matches one tracked query.

Advanced: Parameterized Queries

Submit queries with bind parameters to safely include dynamic values:

curl -s http://127.0.0.1:8090/v1/queries \
  -H "Content-Type: application/json" \
  -d '{
    "sql": "SELECT * FROM data WHERE id > $1 LIMIT $2",
    "parameters": [50, 10]
  }' | jq .

Advanced: Timeouts and Size Limits

Set a per-query timeout (the query is automatically cancelled on expiry):

curl -s http://127.0.0.1:8090/v1/queries \
  -H "Content-Type: application/json" \
  -d '{
    "sql": "SELECT * FROM data",
    "timeout_seconds": 30
  }' | jq .

Set a maximum result size (the query fails if results exceed it):

curl -s http://127.0.0.1:8090/v1/queries \
  -H "Content-Type: application/json" \
  -d '{
    "sql": "SELECT * FROM data",
    "maximum_size": 10485760
  }' | jq .

How It Works

Submit: A POST /v1/queries request creates a job in the scheduler's object store (configured via scheduler.state_location) and spawns a background task.
Execute: The background task submits the query to the Ballista distributed scheduler, which distributes execution across connected executors.
Stream: As result batches arrive from the executors, they are written to the object store in chunks of 10,000 rows.
Complete: The job is marked as SUCCEEDED with result metadata (schema, row count, chunk count).
Retrieve: Clients fetch results by chunk index. Results are available for 12 hours after completion.
Cleanup: Expired job results are periodically cleaned up from the object store.

Learn More

Distributed Query — Async Queries API — Full HTTP and Flight API reference
Distributed Query — Distributed multi-node SQL execution overview
Distributed Query Recipe — Setting up a distributed Spice cluster
spice query CLI — CLI command and REPL documentation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Async Queries

Prerequisites

Getting Started

Step 1: Prepare Working Directory

Step 2: Generate Development mTLS Certificates

Step 3: Start the Spice Scheduler

Step 4: Start the Spice Executor

Step 5: Submit an Async Query via HTTP

Step 6: Poll for Completion

Step 7: Retrieve Results

Using the CLI

Submit and Wait

Submit Without Waiting

List Running Queries

Cancel a Query

Interactive REPL

REPL Commands

Advanced: Parameterized Queries

Advanced: Timeouts and Size Limits

How It Works

Learn More

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Async Queries

Prerequisites

Getting Started

Step 1: Prepare Working Directory

Step 2: Generate Development mTLS Certificates

Step 3: Start the Spice Scheduler

Step 4: Start the Spice Executor

Step 5: Submit an Async Query via HTTP

Step 6: Poll for Completion

Step 7: Retrieve Results

Using the CLI

Submit and Wait

Submit Without Waiting

List Running Queries

Cancel a Query

Interactive REPL

REPL Commands

Advanced: Parameterized Queries

Advanced: Timeouts and Size Limits

How It Works

Learn More