feat: Subscriptions #620

enisdenjo · 2025-12-13T05:59:56Z

Implement SSE, Incremental Delivery over HTTP and Apollo's Multipart HTTP specs for subscriptions when communicating with subgraphs or clients with entity resolution capabilities.

What changed?

Streaming execution result

lib/executor/src/execution/plan.rs@QueryPlanExecutionResult

The execution pipeline now returns QueryPlanExecutionResult - an enum that can be either a single response or a stream:

pub enum QueryPlanExecutionResult {
    Single(PlanExecutionOutput),
    Stream(PlanSubscriptionOutput),
}

This separation allows the query planner and executor to operate independently from transport concerns, making future improvements (like connection repair and silent retries) easier to implement.

Owned context for streaming

lib/executor/src/execution/plan.rs@OwnedQueryPlanExecutionContext

Subscriptions require long-lived contexts that outlive request lifetimes. The implementation introduces an owned context that:

Clones Arc-wrapped shared data (executors, schema metadata, projection/header plans)
Enables entity resolution to happen independently for each subscription event
Processes remaining plan nodes after the subscription fetch

Subscription handlers

The router respects the client's Accept header to determine response format:

text/event-stream → SSE responses
multipart/mixed → Incremental Delivery over HTTP
multipart/mixed;subscriptionSpec="1.0" → Apollo multipart HTTP
Returns 406 Not Acceptable if subscription is requested over unsupported transport
Handles errors by emitting an error event and completing the stream
Heartbeats every 10 seconds (except for incremental delivery, doesn't have it)
Of course protocols used between router and subgraphs and clients can be different

Same behavior is expected when communicating with subgraphs.

SSE

lib/executor/src/executors/sse.rs

Implements the GraphQL over SSE spec distinct connection mode.

Multipart protocol

lib/executor/src/executors/multipart_subscribe.rs

Implements Apollo's multipart HTTP spec and GraphQL's Incremental Delivery over HTTP RFC.

Entity resolution

lib/executor/src/executors/multipart_subscribe.rs@execute_plan_with_initial_data

When a subscription emits data that references entities from other subgraphs, the router:

Receives subscription event from primary subgraph
Executes remaining plan nodes (flatten, fetch, etc.) to populate missing fields
Projects the complete response
Streams to client

This is handled in the async stream generator in execute_query_plan():

while let Some(response) = response_stream.next().await {
    // Parse subgraph response
    // Execute entity resolution if needed (remaining plan nodes)
    // Project and yield to client
}

Subscription node in query plan

lib/query-planner/src/planner/query_plan.rs@wrap_subscription_fetch_nodes

SubscriptionNode is used and now wraps subscription fetch operations:

pub struct SubscriptionNode {
    // It will practically always be a FetchNode
    pub primary: FetchNode,
}

The query planner detects subscription operations and wraps them appropriately, enabling plans with entity resolution like:

Sequence [
  SubscriptionNode { primary: Fetch(reviews) },
  Flatten(products),
  Fetch(products)
]

Subgraph executor subscribe method

lib/executor/src/executors/common.rs@subscribe

The HTTP executor gains a subscribe() method that:

Negotiates content-type (prefers multipart, falls back to SSE)
Establishes long-lived connections to subgraphs
Returns a BoxStream<HttpExecutionResponse> for downstream processing

Configure to only enable/disable subscriptions

The supported subscription protocols in this PR are inherintly HTTP and do not need a "protocol" configuration option. Hive Router will send an accept header listing all supported protocols for subscriptions over HTTP and the subgraph is free to choose whichever one it supports.

Whether we really want to limit specific protocols is up to discussion but objectively there is no benefit since they're all streamed HTTP.

Hence, you can only:

supergraph:
    source: file
    path: supergraph.graphql
subscriptions:
    enabled: true

P.S. if the subscriptions are disabled, the router will respond with when receiving a subscription request:

HTTP/1.1 415 Unsupported Media Type
Content-Type: application/json

{"errors":[{"message":"Subscriptions are not supported","extensions":{"code":"SUBSCRIPTIONS_NOT_SUPPORT"}}]}

What didn't change?

No HTTP Callback Protocol

Intentionally excluded to keep the PR focused. This would require webhook infrastructure in the router, which adds significant complexity. The SSE and multipart protocols cover the vast majority of use cases.

No WebSockets

At the moment only HTTP protocols. Just to keep the PR smaller, it can be integrating easily due to the separation of modules.

Silent Retries & Upstream Connection Repair

Most GraphQL subgraph implementations are stateless for subscriptions and have no concept of "continuing" from where they left off after a connection loss. Implementing retry logic on the router side would create false expectations - users would assume all events are delivered, but some would be lost when the subgraph creates a fresh subscription.

This is fundamentally why the EDFS (Event-Driven Federated Subscriptions) and callback protocols exist. To avoid misleading behavior and keep the PR focused on the core functionality, connection repair is not implemented.

TODO

gemini-code-assist · 2025-12-13T06:00:38Z

Summary of Changes

Hello @enisdenjo, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request introduces comprehensive support for GraphQL subscriptions within the router. It fundamentally changes how the router handles long-lived connections and streaming data, moving from a single-response model to one that can manage continuous data flows. This enables real-time data updates for clients and subgraphs, complete with dynamic protocol negotiation and on-the-fly entity resolution for each event in a subscription stream.

Highlights

GraphQL Subscriptions Support: The router now fully supports GraphQL subscriptions, implementing both Server-Sent Events (SSE) and Apollo's Multipart HTTP protocols for communication with subgraphs and clients.
Streaming Execution Result: The execution pipeline has been refactored to return a QueryPlanExecutionResult enum, which can be either a single response or a stream, enabling native handling of long-lived subscription connections.
Owned Context for Streaming: A new OwnedQueryPlanExecutionContext is introduced to manage long-lived contexts required for subscriptions, allowing entity resolution to occur independently for each event within a stream.
Dynamic Subscription Handlers: The router intelligently determines the appropriate subscription protocol (SSE or Multipart) based on the client's Accept header, returning a 406 Not Acceptable if unsupported transports are requested.
Entity Resolution for Subscription Events: The router can now perform entity resolution for subscription events, fetching missing fields from other subgraphs after receiving initial data from the primary subgraph, and then streaming the complete response to the client.
Subscription Node in Query Plan: A new SubscriptionNode has been added to the query planner, specifically designed to wrap primary fetch operations for subscriptions, enabling complex plans that include entity resolution.
Subgraph Executor subscribe() Method: The HTTP executor gains a subscribe() method, allowing the router to negotiate content-type (preferring multipart), establish long-lived connections to subgraphs, and return a BoxStream for downstream processing.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces significant new functionality for GraphQL subscriptions, including support for SSE and multipart protocols, entity resolution for subscription events, and updates to the query planner. The overall architecture is well-thought-out, especially the separation of concerns with QueryPlanExecutionResult and the use of an owned context for long-lived subscriptions.

My review has identified a critical safety issue regarding an unsafe block that could lead to a use-after-free bug. I've also found several areas where manual JSON construction is used, which is brittle and should be replaced with safer serialization. Additionally, there are opportunities to improve performance in the stream handling logic by reducing string allocations, and a potential panic in the benchmark code. Please see the detailed comments for suggestions on how to address these points.

gemini-code-assist · 2025-12-13T06:04:30Z

lib/executor/src/execution/plan.rs

+) -> Result<PlanExecutionOutput, PlanExecutionError> {
+    // Clone initial_data to make it 'static for ExecutionContext
+    // SAFETY: We're creating a new owned value that will be used within this function
+    let owned_data: Value<'exec> = unsafe { std::mem::transmute(initial_data.clone()) };


The use of unsafe and mem::transmute here to extend the lifetime of initial_data is dangerous and likely unsound. The initial_data is a Value<'_> that borrows from response_body, which is only valid for the current loop iteration. Because execute_plan_with_initial_data is async, the future it returns can be suspended and resumed after response_body is no longer valid, leading to a use-after-free.

To fix this, you should create a fully owned Value<'static>. You can do this by implementing a helper function that recursively clones the Value data, converting any borrowed Cows to owned ones. This avoids the need for unsafe code.

For example, you could create a function fn to_owned_value<'a>(v: &Value<'a>) -> Value<'static> and use it like let owned_data: Value<'exec> = to_owned_value(&initial_data);.

bench/subgraphs/reviews.rs

lib/executor/src/execution/plan.rs

lib/executor/src/executors/http.rs

bin/router/src/pipeline/multipart_subscribe.rs

bin/router/src/pipeline/sse.rs

github-actions · 2025-12-13T06:18:46Z

✅ `k6-benchmark` results

     ✓ response code was 200
     ✓ no graphql errors
     ✓ valid response structure

     █ setup

     checks.........................: 100.00% ✓ 202020      ✗ 0    
     data_received..................: 5.9 GB  196 MB/s
     data_sent......................: 79 MB   2.6 MB/s
     http_req_blocked...............: avg=3.26µs   min=711ns   med=1.91µs  max=5.63ms   p(90)=2.74µs  p(95)=3.14µs  
     http_req_connecting............: avg=373ns    min=0s      med=0s      max=1.62ms   p(90)=0s      p(95)=0s      
     http_req_duration..............: avg=21.78ms  min=2.23ms  med=20.82ms max=125.07ms p(90)=29.71ms p(95)=32.88ms 
       { expected_response:true }...: avg=21.78ms  min=2.23ms  med=20.82ms max=125.07ms p(90)=29.71ms p(95)=32.88ms 
     http_req_failed................: 0.00%   ✓ 0           ✗ 67360
     http_req_receiving.............: avg=137.78µs min=26.32µs med=42.17µs max=70.48ms  p(90)=86.3µs  p(95)=402.26µs
     http_req_sending...............: avg=24.51µs  min=5.54µs  med=11.23µs max=19.33ms  p(90)=16.65µs p(95)=26.77µs 
     http_req_tls_handshaking.......: avg=0s       min=0s      med=0s      max=0s       p(90)=0s      p(95)=0s      
     http_req_waiting...............: avg=21.62ms  min=2.17ms  med=20.69ms max=68.38ms  p(90)=29.45ms p(95)=32.55ms 
     http_reqs......................: 67360   2239.936953/s
     iteration_duration.............: avg=22.27ms  min=7.14ms  med=21.18ms max=208.79ms p(90)=30.13ms p(95)=33.39ms 
     iterations.....................: 67340   2239.271888/s
     vus............................: 50      min=50        max=50 
     vus_max........................: 50      min=50        max=50

github-actions · 2025-12-13T06:20:44Z

🐋 This PR was built and pushed to the following Docker images:

Image Names: ghcr.io/graphql-hive/router

Platforms: linux/amd64,linux/arm64

Image Tags: ghcr.io/graphql-hive/router:pr-620 ghcr.io/graphql-hive/router:sha-c80aa36

Docker metadata

{
"buildx.build.ref": "builder-d43cc119-50af-408e-8735-d33cf0064b10/builder-d43cc119-50af-408e-8735-d33cf0064b100/nb9ci3y07ou5g7xv5l5v1b0di",
"containerimage.descriptor": {
  "mediaType": "application/vnd.oci.image.index.v1+json",
  "digest": "sha256:3956b78c27374b42580546766756a51ee3a6b8709447eab32af28d1150ec2034",
  "size": 1609
},
"containerimage.digest": "sha256:3956b78c27374b42580546766756a51ee3a6b8709447eab32af28d1150ec2034",
"image.name": "ghcr.io/graphql-hive/router:pr-620,ghcr.io/graphql-hive/router:sha-c80aa36"
}

enisdenjo force-pushed the not-kamil-subs branch from cf0bb48 to 9d9e2df Compare December 13, 2025 06:00

enisdenjo requested review from ardatan, dotansimha and kamilkisiela December 13, 2025 06:03

gemini-code-assist bot reviewed Dec 13, 2025

View reviewed changes

ardatan mentioned this pull request Dec 13, 2025

RFC: GraphQL Subscriptions #313

Open

enisdenjo and others added 21 commits December 30, 2025 18:59

graphql with sse

c1af496

e2e tests lol tdd

3e6e8ed

some setup thanks

4bfd646

consider sub node

b7a8ee3

whatever sse parse that

d967ca6

holy moly??

f6426ff

cleaner errro handling

c160217

make sure chunked

9aeed64

subscirption node can only have fetch node

3441cff

ok executor supports right?

f544ee2

ok we reply! tests pass

b9fa63c

reviewAddedForProduct and more tests

2864aa1

deduplicate

764cdc9

deduplicate

1cc5435

reuase

f741e29

multipart subscribe?

d410721

ok ok ok tests

52bd958

support for multipart grape

7cec7e3

clients can use multipart too

fbf481c

format

dfd89bb

sure clippy

6c4a265

enisdenjo and others added 8 commits December 30, 2025 19:00

ok chill clippy

bfea758

optimize right

feb4887

multipart and sse

ad06854

big up interceptor

d82eda3

stream failed subgraph requests

69c0d6f

simpler

1904da1

huh failing

87a39da

ah yes, it does work

27cd5e5

enisdenjo force-pushed the not-kamil-subs branch from 2ee5c8c to 27cd5e5 Compare December 30, 2025 18:00

enisdenjo added 9 commits December 30, 2025 19:05

ntex header map for client details

6e52479

update snapshot

8176982

attempt to test cancellation

181ff1c

explain

1cb166f

header propagation for entity resolution

5a6645a

subscription header's not propagated

a86ee14

ok of course

612bd3d

abrupt termination but failing huh

225c57f

ok I got it

f605cda

enisdenjo mentioned this pull request Jan 7, 2026

Hive Router Subscriptions graphql-hive/console#7472

Draft

enisdenjo and others added 7 commits January 7, 2026 23:30

cleanupstart with incremental delivery support

d2fcb85

adjust support incremental delivery multipart http parsing

02aa3fc

better request accepts

2b32306

subscriptions enabled disabled

4954b24

subscriptions plural

817e4ca

typo

a285d0c

boundary is spaced

bdaf0b4

enisdenjo mentioned this pull request Jan 12, 2026

refactor: Execution pipeline without HTTP request dependency and better accept header parsing #665

Open

enisdenjo removed request for ardatan, dotansimha and kamilkisiela January 13, 2026 16:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Subscriptions #620

feat: Subscriptions #620

Uh oh!

enisdenjo commented Dec 13, 2025 •

edited

Loading

Uh oh!

gemini-code-assist bot commented Dec 13, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Dec 13, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Dec 13, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Dec 13, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

feat: Subscriptions #620

Are you sure you want to change the base?

feat: Subscriptions #620

Uh oh!

Conversation

enisdenjo commented Dec 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changed?

Streaming execution result

Owned context for streaming

Subscription handlers

SSE

Multipart protocol

Entity resolution

Subscription node in query plan

Subgraph executor subscribe method

Configure to only enable/disable subscriptions

What didn't change?

No HTTP Callback Protocol

No WebSockets

Silent Retries & Upstream Connection Repair

TODO

Uh oh!

gemini-code-assist bot commented Dec 13, 2025

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Dec 13, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Dec 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ k6-benchmark results

Uh oh!

github-actions bot commented Dec 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

enisdenjo commented Dec 13, 2025 •

edited

Loading

github-actions bot commented Dec 13, 2025 •

edited

Loading

✅ `k6-benchmark` results

github-actions bot commented Dec 13, 2025 •

edited

Loading