Add support for stream responses by gtopper · Pull Request #605 · mlrun/storey

gtopper · 2026-01-20T10:31:30Z

[ML-11875](https://iguazio.atlassian.net/browse/ML-11875)

Copilot

Pull request overview

This PR adds comprehensive streaming response support to the storey data processing library, enabling steps to yield multiple chunks of data incrementally rather than returning a single result. The implementation includes new primitive types for streaming, modifications to existing flow steps, a new Collector step for aggregating streams, and extensive test coverage.

Changes:

Added streaming primitives (StreamChunk, StreamCompletion, StreamingError) and modified Map, MapClass, Complete, Reduce, and ParallelExecution steps to support generator functions
Introduced Collector step to aggregate streaming chunks back into single events
Updated AwaitableResult and AsyncAwaitableResult to return generators for streaming responses

Reviewed changes

Copilot reviewed 8 out of 8 changed files in this pull request and generated 10 comments.

Show a summary per file

File	Description
storey/dtypes.py	Adds StreamChunk, StreamCompletion, and StreamingError classes for streaming support
storey/flow.py	Adds _StreamingStepMixin and updates Map, MapClass, Complete, Reduce, ParallelExecution, and Choice to handle streaming
storey/sources.py	Updates AwaitableResult and AsyncAwaitableResult to support streaming generators
storey/steps/collector.py	Implements new Collector step for aggregating streaming chunks
storey/steps/init.py	Exports the new Collector step
storey/init.py	Exports StreamingError and Collector for public API
tests/test_streaming.py	Comprehensive test suite covering streaming primitives, Map/MapClass streaming, Collector, Complete, error handling, graph splits, and cyclic graphs
tests/test_flow.py	Refactors cycle creation to use cleaner .to() API instead of direct _outlets manipulation

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

storey/steps/collector.py

tests/test_streaming.py

storey/flow.py

tests/test_streaming.py

Copilot

Pull request overview

Copilot reviewed 8 out of 8 changed files in this pull request and generated no new comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

alxtkr77

tests/test_streaming.py - Test Duplication

The 56 tests have significant structural duplication - almost every test has both a sync and async version with ~90% identical code (27 pairs).

Consider using pytest parametrization to consolidate:

@pytest.fixture(params=["sync", "async"])
def flow_context(request):
    if request.param == "sync":
        return SyncEmitSource, lambda f: f()
    else:
        return AsyncEmitSource, lambda f: asyncio.run(f())

def test_collector_basic(self, flow_context):
    source_cls, run = flow_context
    # Single implementation handles both

This would reduce tests from 56 to ~30 while maintaining the same coverage, and make future maintenance easier.

alxtkr77

flow.py - Duplicate _is_generator method

The _is_generator() method is defined identically in two places:

_StreamingStepMixin (line ~201)
ParallelExecutionRunnable (line ~1448)

Consider reusing the mixin method or extracting to a shared utility to avoid duplication.

storey/flow.py

storey/steps/collector.py

tests/test_streaming.py

gtopper · 2026-01-22T06:24:03Z

tests/test_streaming.py - Test Duplication

The 56 tests have significant structural duplication - almost every test has both a sync and async version with ~90% identical code (27 pairs).

Consider using pytest parametrization to consolidate:
@pytest.fixture(params=["sync", "async"])
def flow_context(request):
    if request.param == "sync":
        return SyncEmitSource, lambda f: f()
    else:
        return AsyncEmitSource, lambda f: asyncio.run(f())

def test_collector_basic(self, flow_context):
    source_cls, run = flow_context
    # Single implementation handles both
This would reduce tests from 56 to ~30 while maintaining the same coverage, and make future maintenance easier.

Yeah, this is sort of true. My AI also wanted to do this. The problem is, the APIs are too divergent, and the result of parameterization isn't very good. I.e. the above code snippet doesn't cover all the conditionals needed.

royischoss

Hey LGTM two minor comments

storey/flow.py

royischoss

LGTM 👍

Using the new functionality introduced in storey 1.11.8 / mlrun/storey#605. [ML-11876](https://iguazio.atlassian.net/browse/ML-11876)

mlrun#605 broke mlrun test `serving.test_async_flow.test_model_runner_with_selector`.

#605 broke mlrun test `serving.test_async_flow.test_model_runner_with_selector`.

To fix `serving.test_async_flow.test_model_runner_with_selector` following breakage introduced in storey 1.11.8 / mlrun/storey#605.

Using the new functionality introduced in storey 1.11.8 / mlrun/storey#605. [ML-11876](https://iguazio.atlassian.net/browse/ML-11876)

Adds support for streaming responses in serving graphs, enabling real-time chunk-by-chunk HTTP responses (e.g., for LLM token streaming). Key changes: * New `set_streaming(enabled=True)` API on serving functions * Async streaming handler that yields results as they're produced by graph steps * Graph steps can now use generators to stream multiple chunks * Updated nuclio handler for generator return type support Using the new functionality introduced in storey 1.11.8 / mlrun/storey#605. [ML-11876](https://iguazio.atlassian.net/browse/ML-11876) Depends on nuclio/nuclio-jupyter#197. [ML-11876]: https://iguazio.atlassian.net/browse/ML-11876?atlOrigin=eyJpIjoiNWRkNTljNzYxNjVmNDY3MDlhMDU5Y2ZhYzA5YTRkZjUiLCJwIjoiZ2l0aHViLWNvbS1KU1cifQ

gtopper added 7 commits January 20, 2026 15:10

Add support for stream responses

a2ac67d

[ML-11875](https://iguazio.atlassian.net/browse/ML-11875)

Improvement and more tests

904f2a6

More tests and fixes

7abd93b

New tests and related fixes

404cf24

Fmt

388bfd7

Shorten code

78e676c

Improve comment

cf293ab

gtopper requested a review from Copilot January 20, 2026 14:32

Copilot started reviewing on behalf of gtopper January 20, 2026 14:33 View session

Copilot AI reviewed Jan 20, 2026

View reviewed changes

gtopper added 2 commits January 20, 2026 21:44

Fix style

260f105

Remove unreachable yields

d6b4f0c

gtopper requested a review from Copilot January 20, 2026 14:53

Copilot started reviewing on behalf of gtopper January 20, 2026 14:53 View session

Copilot AI reviewed Jan 20, 2026

View reviewed changes

alxtkr77 reviewed Jan 20, 2026

View reviewed changes

storey/flow.py Outdated Show resolved Hide resolved

alxtkr77 reviewed Jan 20, 2026

View reviewed changes

storey/flow.py Show resolved Hide resolved

alxtkr77 reviewed Jan 20, 2026

View reviewed changes

storey/steps/collector.py Show resolved Hide resolved

alxtkr77 reviewed Jan 20, 2026

View reviewed changes

tests/test_streaming.py Outdated Show resolved Hide resolved

gtopper added 4 commits January 22, 2026 18:34

Local refactor

e3bdfdc

Refactor

b563da7

Add async test

228785d

Use Union until mlrun#606 is merged

e081b1a

gtopper requested a review from alxtkr77 January 22, 2026 12:24

Minor improvements to streaming tests

17e1ccb

gtopper marked this pull request as ready for review January 22, 2026 12:42

Merge remote-tracking branch 'mlrun/development' into ML-11875

c8960fb

gtopper added 10 commits January 25, 2026 16:23

Minor local refactoring

3d2a238

Minor local refactoring

ff95a06

Improve execution_mechanism + streaming validation

3e24d55

Minor change for static check

9981f46

Replace Reduce with Complete in streaming test suite

ed6b1ab

Merge streaming + error handling test suites

5e52f64

Improve test

4369c59

Merge two streaming completion-related test suites

a458ae7

Improve test names, descriptions

1a61f3e

Extract common streaming test helpers

896ed90

gtopper requested a review from royischoss January 26, 2026 10:32

royischoss reviewed Jan 26, 2026

View reviewed changes

storey/flow.py Show resolved Hide resolved

storey/flow.py Show resolved Hide resolved

Improve docstrings and type annotations

9b30186

gtopper requested a review from royischoss January 26, 2026 12:11

royischoss approved these changes Jan 26, 2026

View reviewed changes

gtopper merged commit e7ded66 into mlrun:development Jan 26, 2026
5 checks passed

gtopper added a commit to gtopper/mlrun that referenced this pull request Jan 26, 2026

Implement support for stream responses

9913db8

Using the new functionality introduced in storey 1.11.8 / mlrun/storey#605. [ML-11876](https://iguazio.atlassian.net/browse/ML-11876)

gtopper added a commit to gtopper/mlrun that referenced this pull request Jan 26, 2026

Implement support for stream responses

cbb5003

Using the new functionality introduced in storey 1.11.8 / mlrun/storey#605. [ML-11876](https://iguazio.atlassian.net/browse/ML-11876)

gtopper added a commit to gtopper/mlrun that referenced this pull request Jan 26, 2026

Implement support for stream responses

69861f8

Using the new functionality introduced in storey 1.11.8 / mlrun/storey#605. [ML-11876](https://iguazio.atlassian.net/browse/ML-11876)

gtopper added a commit to gtopper/mlrun that referenced this pull request Jan 26, 2026

Implement support for stream responses

74e361b

Using the new functionality introduced in storey 1.11.8 / mlrun/storey#605. [ML-11876](https://iguazio.atlassian.net/browse/ML-11876)

gtopper added a commit to gtopper/mlrun that referenced this pull request Jan 26, 2026

Implement support for stream responses

c7e56ed

Using the new functionality introduced in storey 1.11.8 / mlrun/storey#605. [ML-11876](https://iguazio.atlassian.net/browse/ML-11876)

gtopper added a commit to gtopper/storey that referenced this pull request Jan 26, 2026

Fix ParallelExecution AP breakage

3aba644

mlrun#605 broke mlrun test `serving.test_async_flow.test_model_runner_with_selector`.

gtopper mentioned this pull request Jan 26, 2026

Fix ParallelExecution API breakage #607

Merged

gtopper added a commit that referenced this pull request Jan 26, 2026

Fix ParallelExecution AP breakage (#607)

8c006f0

#605 broke mlrun test `serving.test_async_flow.test_model_runner_with_selector`.

gtopper added a commit to gtopper/mlrun that referenced this pull request Jan 26, 2026

[Requirements] Bump storey

cec85c4

To fix `serving.test_async_flow.test_model_runner_with_selector` following breakage introduced in storey 1.11.8 / mlrun/storey#605.

gtopper mentioned this pull request Jan 26, 2026

[Requirements] Bump storey mlrun/mlrun#9247

Merged

gtopper added a commit to mlrun/mlrun that referenced this pull request Jan 27, 2026

[Requirements] Bump storey (#9247)

659913b

To fix `serving.test_async_flow.test_model_runner_with_selector` following breakage introduced in storey 1.11.8 / mlrun/storey#605.

gtopper added a commit to gtopper/mlrun that referenced this pull request Jan 27, 2026

Implement support for stream responses

fa3fc56

Using the new functionality introduced in storey 1.11.8 / mlrun/storey#605. [ML-11876](https://iguazio.atlassian.net/browse/ML-11876)

gtopper mentioned this pull request Jan 27, 2026

[Serving] Implement support for stream responses mlrun/mlrun#9248

Merged

Conversation

gtopper commented Jan 20, 2026 • edited by atlassian bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

alxtkr77 left a comment

Choose a reason for hiding this comment

Uh oh!

alxtkr77 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gtopper commented Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

royischoss left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

royischoss left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

gtopper commented Jan 20, 2026 •

edited by atlassian bot

Loading

gtopper commented Jan 22, 2026 •

edited

Loading

royischoss left a comment •

edited

Loading