refactor: separate statistic computation #411

tristan-f-r · 2025-10-10T06:33:29Z

We also make graph statistics lazy. Laziness isn't used in summary.py, but I assume that we'll have more computationally expensive graph statistics as SPRAS develops, especially when it can take long to compute for our larger graphs.

Most importantly, this separates graph statistics into a separate function, so we can reuse the code for graph heuristic pruning.

we also make it lazy

read-the-docs-community · 2025-10-10T06:34:25Z

Documentation build overview

📚 spras | 🛠️ Build #30218572 | 📁 Comparing c675ece against latest (c3b02cd)

🔍 Preview build

No files changed.

tristan-f-r · 2025-10-14T17:39:58Z

Building on top of this PR allows me to add graph heuristics.

Most likely, every tuning PR will be at least marked with P-medium unless it's an end result.

agitter · 2025-11-07T22:47:27Z

Before I can review the implementation of the change, I need to better understand what problem we are tying to solve with the change. Where will laziness be needed in the future?

we can reuse the code for graph heuristic pruning

Do we envision calling graph statistic computation twice per graph? After we compute these statistics on a graph once, shouldn't that be sufficient for an entire pass of a workflow?

tristan-f-r · 2025-11-07T23:53:18Z

I was going to ask @ntalluri about this, since I wasn't quite sure if we will have expensive graph heuristics or not.

Do we envision calling graph statistic computation twice per graph? After we compute these statistics on a graph once, shouldn't that be sufficient for an entire pass of a workflow?

I did decouple this from analysis: summary: enabled: true, and I imagined it like this. I didn't think about that, though: would it make sense to have graph summary statistics always enabled the moment any heuristics are enabled?

agitter · 2025-11-08T04:25:01Z

There could be more than one way to design this sensibly. One would be that if heuristics are enabled in the config file, that automatically generates the graph summary table. The produces more output than requested, which is slightly undesirable.

Another could be to move the heuristic calculations inside each --parameters> subdirectory, which may be where you are headed. If that is written as a file for that one pathway, it could be consumed for heuristics (or used for heuristics and then written to disk). Later, if the graph summary table is requested, it would grab the precomputed statistics from those files in the subdirectories.

tristan-f-r · 2025-11-08T08:06:01Z

I'll mark this as a draft for now and design something in line with your second proposal.

refactor: separate statistic computation

6ec4f62

we also make it lazy

tristan-f-r added tuning Workflow-spanning algorithm tuning refactor Changes that don't actually improve anything except for code quality. labels Oct 10, 2025

tristan-f-r added 2 commits October 10, 2025 06:48

fix: correct tuple assumption

9987189

fix: stably use graph statistic values

25eef5e

tristan-f-r requested a review from ntalluri October 14, 2025 17:38

tristan-f-r added the P-medium medium prirotity; this is needed for some external service or another PR label Oct 14, 2025

style: fmt

cb373c1

github-actions bot added the merge-conflict This PR has merge conflicts. label Oct 30, 2025

tristan-f-r mentioned this pull request Oct 30, 2025

feat: heuristics #431

Open

1 task

Merge branch 'main' into lazy-stats

47a9e26

github-actions bot removed the merge-conflict This PR has merge conflicts. label Oct 30, 2025

tristan-f-r and others added 2 commits October 29, 2025 18:15

style: specify zip strict

898d568

fix: make undirected for determining number of connected components

c675ece

tristan-f-r marked this pull request as draft November 8, 2025 08:06

ntalluri removed the P-medium medium prirotity; this is needed for some external service or another PR label Nov 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

refactor: separate statistic computation #411

refactor: separate statistic computation #411

Uh oh!

tristan-f-r commented Oct 10, 2025 •

edited

Loading

Uh oh!

read-the-docs-community bot commented Oct 10, 2025 •

edited

Loading

Uh oh!

tristan-f-r commented Oct 14, 2025 •

edited

Loading

Uh oh!

agitter commented Nov 7, 2025

Uh oh!

tristan-f-r commented Nov 7, 2025

Uh oh!

agitter commented Nov 8, 2025

Uh oh!

tristan-f-r commented Nov 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

refactor: separate statistic computation #411

Are you sure you want to change the base?

refactor: separate statistic computation #411

Uh oh!

Conversation

tristan-f-r commented Oct 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

read-the-docs-community bot commented Oct 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Documentation build overview

Uh oh!

tristan-f-r commented Oct 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

agitter commented Nov 7, 2025

Uh oh!

tristan-f-r commented Nov 7, 2025

Uh oh!

agitter commented Nov 8, 2025

Uh oh!

tristan-f-r commented Nov 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

tristan-f-r commented Oct 10, 2025 •

edited

Loading

read-the-docs-community bot commented Oct 10, 2025 •

edited

Loading

tristan-f-r commented Oct 14, 2025 •

edited

Loading