Add Pipeline feature #96

guzman-raphael · 2025-06-29T18:08:24Z

Depends on #91, #94

Features

Add Pipeline model with DOT support.
- graph_dot input is used to build a petgraph::DiGraph.
- metadata is a lookup table of the node type i.e. Kernel.
- input_spec is a map that defines keys required to feed in an input Packet to the Pipeline. Each key can be associated with one or more node(s)/key(s).
- output_spec is a map that defines keys to create an output Packet. Each key is linked to exactly one node:key.
- input_spec/output_spec is explicit, flexible but most importantly equivalent in structure to Pod to facilitate composing pipelines of mixed pods and other pipelines.
- pipeline.make_dot() is also available to make it easier to visualize the compute nodes.
Add PipelineJob model.
- pipeline points to the reference pipeline.
- input_packet is a map of packet keys to path sets. Notice that each key can have a collection of path sets. This allows batching several inputs in one go. When inputs are batched (length > 1) for keys, the cartesian product will be applied if they correspond to the same input node.
- output_dir is the root output directory for all produced computations. A pod job's output_dir will be mounted to the following tree structure: {pipeline_job_output_dir}/{node_name}/{input_packet_hash}/. Currently the packet is not hashed and a simple random hash is used.
Add PipelineResullt model.
- status captures the final state of a pipeline run.
Add JoinOperator which performs cartesian product on parent streams (itertools crate makes this very straightforward).
Add MapOperator which allows renaming packet keys.
Add async PipelineJob execution algorithm.
- It requires an orchestrator agent as it dispatched to the agent network for processing. Agent required since it has direct access to orchestrator and doesn't place a dependency on user resources to manage a pipeline run. Agent should be installed/started in whichever network/infrastructure topology makes sense for the user/system e.g. remotely, local, etc.
- All communication between nodes is conducted via an agent_client (facilitated by zenoh crate). This lends itself to allowing even the operator logic to benefit from distributed, coordinated compute (in the future). Currently, there is no coordination between agent nodes.
- Pipeline job node coordination uses the following communication/topic structure: group/{group}/status/pipeline_job/{pipeline_hash}/{input|output}/{node_name}. Appropriate payloads (e.g. Payload::Stream(Packet)) are published to these topics. Nodes referenced in pipeline.input_spec listen on input while all others listen on output. To signal successful completion of a stream, Payload::End is sent.
- Once all parent streams end, there are no more packets actively processing on a node, and no packets queued, then it will publish its own Payload::End. This will signal a node has reached a state of NodeState::Completed.
- If any packet is unsuccessful (Payload as Cancelled or Failed(..)), the entire node fails immediately and any descendants will be cancelled immediately. The node will be marked Failed(..) along with its error message.
- If a node is unsuccessful (pod_result.state that is not Completed for any packet), the pipeline does not fail immediately. It will continue on a best effort since there is value in evaluating unrelated nodes e.g. for memoization.
- If a pipeline run has concluded and all nodes have reached a NodeState::Completed, then the pipeline run was successful. Otherwise, it failed.
- Operators use mutexes since they may need to share a state between packets e.g. JoinOperator needs to remember prior packets.
- If any node encounters a Rust error during processing, it crashes the agent. Intentionally left it this way since it is undefined behavior and while the feature is young, we should probably catch more of these cases to properly fix/address them.
Add PipelineRun.
- Similar to pod_run, this provides a way to interact with a pipeline while it is running which can be useful to poll for state.
- pipeline_run.attach() provides a way to listen for updates from the agent network to track state.
- pipeline_run.summarize_dot() provides a minimal way to visualize the state at any point in time as a DOT. When combined with a loop and a DOT->SVG tool (like graphviz's dot CLI), this can provide a live animation of the pipeline run progress.
Expose agent_client.start_pipeline_run(..). Similar to to its orchestrator counterpart, it will return immediately since it will be processed as detached.
Expose agent_client.get_pipeline_result(..). Similar to its orchestrator counterpart, it will wait to respond until the pipeline run has concluded.

Small features and fixes

Add graph utilities. When generating a DOT, additional customizations can be supplied to control styling e.g. title, caption, node colors, extra labels for nodes, etc. petgraph crate helps in traversing the graph and generating a DOT. layout crate helps parsing a DOT to create a petgraph::DiGraph.
output_packet in PodResult which evaluates the checksum on all expected, generated output. Was needed to convert output packets to input packets in a pipeline run. If it fails, then partial output is allowed. If it succeeds, partial output is not allowed. Fix Add output_packet to Pod Result #89
Convert pod.command from &str -> Vec<String> to allow more flexibility. Having this made it simpler to create pipeline test cases. Fix Change command from string to a vec of string #95
Add random hash generator (via hex and rand crates). Used for creating non-colliding pod job output directories for processing packets (temporary until we hash input packets) and a pipeline job hash (temporary until we can hash pipeline job consistently).
Expand RE_AGENT_KEY_EXPR regex to allow capturing of more metadata. Use it as the primary source for loading metadata.
Convert exposed enums over CFFI to have variants with named fields since the UX/help in Python is nicer.

Housekeeping

Add pipeline_test.ipynb as a DEMO that illustrates how to use the pipeline feature.
Add test cargo feature to allow exposing more to integration tests while still keeping the default API private. Features default and test cannot be combined. This allows us to finally make all of core submodules private by default.
Add to crate diagram.
Rename orchestrator::Status -> orchestrator::PodStatus to make it more distinct.
Simplify RE_MODEL_METADATA regex.
Rename agent_client.submit_pod_jobs(..) -> agent_client.start_pod_jobs(..) to be more consistent with orchestrator API design.

codecov · 2025-06-29T18:13:44Z

Codecov Report

❌ Patch coverage is 94.29324% with 65 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
src/uniffi/orchestrator/agent.rs	75.26%	23 Missing ⚠️
src/uniffi/model.rs	87.32%	18 Missing ⚠️
src/core/pipeline.rs	97.49%	11 Missing ⚠️
src/core/orchestrator/agent.rs	96.00%	5 Missing ⚠️
src/uniffi/pipeline.rs	92.64%	5 Missing ⚠️
src/core/orchestrator/docker.rs	89.47%	2 Missing ⚠️
src/core/graph.rs	99.04%	1 Missing ⚠️

📢 Thoughts on this report? Let us know!

Synicix · 2025-07-01T06:50:15Z

@guzman-raphael Is this supposed to be your version of pipeline implmentation?

guzman-raphael · 2025-07-01T20:11:44Z

@Synicix Since the pipeline feature is a big one, I've been getting a head start really to help me review your PRs.

I'm not sure yet if I'll make this PR "ready for review" but at least wanted to make my brainstorming visible in case it helps with discussions.

Synicix · 2025-07-01T20:28:13Z

@guzman-raphael
A lot of the core logic and how I do things won't change much with the pending features I plan on adding later today. So it is like 90% ready for review, since I am still proofread it, plus the missing verification feature for input_spec.

…t` feature is enabled.

review-notebook-app · 2025-07-02T12:39:02Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

…b, add map operator, modify pod command to Vec<String> as opposed to String to allow more use cases, apply node shape based on kernel, covert enum variant from unnamed to named fields to improve python UX, remove unused metadata from make_graph, make core crypto/model functions private by default, apply clippy implicit_hasher suggestion.

…e flexible, add stopgap hash for pipeline job (unique each time), add placeholder for pipeline job scheduler, add to crate diagram, simplify store regex, capture more in available event metadata, and remove unused event info + classifier to simplify.

…le to pipeline scheduler, and increase clippy error size threshold.

…pose `PipelineRun` to keep track of pipeline job state, add `get_pipeline_result` to agent, update pipeline demo, create a packet tuple struct, rename `orchestrator::Status` to `PodStatus` to make more clear, and expose optional customization to generated graphs.

… up demo.

… DOT to petgraph, use jinja for DOT templating, remove SVG support in favor of graphviz CLI, remove layout dependency, and make get util more generic.

…flexible, allow pipeline.make_dot(..) to include/exclude style, and move function optional variables to end to be compatible with uniffi defaults.

…-contained groups, reduce error message size, optimize iteration workflow with llvm-cov, and increase resolution of pipeline execution visualization in demo.

…d to PipelineRun/PipelineResult, improve pipeline execution animation in demo, and add monitor note to pipeline demo.

…the hard-coded sleeps.

…ion test.

…possible.

…l utility to verify pipelines in tests, and stream prints during tests invoked using VSCode in-place GUI.

…in pipeline execution demo.

Synicix · 2025-08-28T06:04:17Z

Close this draft to clear up the PR (Save it PR logs for reference when needed)

Add Pipeline model with some utilities.

fb7caee

guzman-raphael added 2 commits July 2, 2025 03:54

Add join operator and allow exposure of more of the API when the `tes…

0ff4ea3

…t` feature is enabled.

Add test feature to CI and a pipeline notebook for testing.

22c2773

guzman-raphael added 5 commits July 2, 2025 12:47

Fix bug in CI command.

6642c75

Remove untriggered clippy lint.

316f120

Fix tests.

243ecd8

Fix test.

21c12a6

guzman-raphael linked an issue Jul 5, 2025 that may be closed by this pull request

Change command from string to a vec of string #95

Closed

guzman-raphael changed the title ~~Add Pipeline model~~ Add Pipeline support Jul 5, 2025

guzman-raphael changed the title ~~Add Pipeline support~~ Add Pipeline feature Jul 5, 2025

guzman-raphael added 14 commits July 6, 2025 16:15

Remove spaces from diagram gen in CI.

6f1b0aa

Manually remove spaces in CI for diagram gen.

c2c3852

Remove yaml folding since it adds a newline.

23ff441

Add output packet to pod result with checksums, expand topics availab…

25a792c

…le to pipeline scheduler, and increase clippy error size threshold.

Fix python smoke test.

5b2be57

Fix python smoke test.

eaeee2e

Fix python tests.

0451ad4

FIx python agent test.

703c2e2

Include more into crate diagram.

0e1fab9

Remove need for attach by user, add status to pipeline run, and clean…

2c59ecd

… up demo.

Generate deterministic DOT output (sorted), use dot-parser to convert…

3b6bb97

… DOT to petgraph, use jinja for DOT templating, remove SVG support in favor of graphviz CLI, remove layout dependency, and make get util more generic.

Merge branch 'dev' of github.com:walkerlab/orcapod into pipeline

b4f832d

guzman-raphael added 13 commits July 19, 2025 11:45

Use jinja template for dot node, make dot optional style config more …

b03f7b2

…flexible, allow pipeline.make_dot(..) to include/exclude style, and move function optional variables to end to be compatible with uniffi defaults.

Use unit type for edge type in graphs, separate agent tests into self…

c9937a8

…-contained groups, reduce error message size, optimize iteration workflow with llvm-cov, and increase resolution of pipeline execution visualization in demo.

Use petgraph node in favor of metadata hashmap, add created/terminate…

3bd142d

…d to PipelineRun/PipelineResult, improve pipeline execution animation in demo, and add monitor note to pipeline demo.

Add pipeline execution test based on demo, and remove/reduce some of …

b205368

…the hard-coded sleeps.

Add sleep in python agent test to allow time for service readiness.

bc73826

Add more pipeline execution checks.

d005d51

Fix how pipeline execution test timeouts, and add error tests.

5540534

Add incomplete output test, and check more styling in pipeline execut…

dc7aa9b

…ion test.

Expand covered container states and move failsafe timeout as late as …

d1cade5

…possible.

Fix issue with pipeline execution test.

c17ccc3

Make pod_adder/pipeline_job_adder into proper fixtures, create genera…

6e5f15d

…l utility to verify pipelines in tests, and stream prints during tests invoked using VSCode in-place GUI.

Rename RunInfo -> PodRunInfo and replace bash arithmetic with python …

dc30b42

…in pipeline execution demo.

Update DevContainer and notes in pipeline DEMO.

7ddbc2b

Synicix closed this Aug 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Pipeline feature #96

Add Pipeline feature #96

Uh oh!

guzman-raphael commented Jun 29, 2025 •

edited

Loading

Uh oh!

codecov bot commented Jun 29, 2025 •

edited

Loading

Uh oh!

Synicix commented Jul 1, 2025

Uh oh!

guzman-raphael commented Jul 1, 2025 •

edited

Loading

Uh oh!

Synicix commented Jul 1, 2025

Uh oh!

review-notebook-app bot commented Jul 2, 2025

Uh oh!

Synicix commented Aug 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add Pipeline feature #96

Add Pipeline feature #96

Uh oh!

Conversation

guzman-raphael commented Jun 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Jun 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Synicix commented Jul 1, 2025

Uh oh!

guzman-raphael commented Jul 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Synicix commented Jul 1, 2025

Uh oh!

review-notebook-app bot commented Jul 2, 2025

Uh oh!

Synicix commented Aug 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

guzman-raphael commented Jun 29, 2025 •

edited

Loading

codecov bot commented Jun 29, 2025 •

edited

Loading

guzman-raphael commented Jul 1, 2025 •

edited

Loading