Replace irreducible loop analysis and transform #578

katrinafyi · 2025-09-29T03:08:21Z

Replaces every line of IrreducibleLoops.scala. This reimplements the algorithm in a way much closer to the paper. This new code does much less explicit bookkeeping work. It replaces some old code that went through the dfsp_pos with new code that does some kind of topologically-ordered dynamic programming. In doing so, it is my hope that it is more obviously correct.

Also:

fixes certain bugs in the transform relating to topological order
removes state machine
adds mutability in data structures to make it closer to the paper
fixes analysis and transform for nested irreducible loops

Closes #410

youre gonan need a final there

- topological sorting based on dfspos to build nodes set - construct edge sets?? find out what the transform needs. maybe just needs to identify internal vs external edges into a header node. - in transform, what of assume conditions?? if a header has assumes, do we need to "hoist" that to the new reducible header?

subsequent loops after transforming earlier loops? also check assume statements

i think we have to redirect them via N. but only for those which originate at nodes which are /not/ the header???????????????????????????????????????????????

outermost loop where they remain valid. that is, they should be hoisted to the outermost loop containing "to" where "from" is not internal to the loop. i think to do this we need some notion of "self nodes", the nodes which appear in /only/ a particular cycle and no other subcycles. is a "self node" simply defined by iloop_header?????? much to think about

This reverts commit ef4b8ec.

…ing a node?" This reverts commit 30ae7d6.

This reverts commit 3b608ae.

This reverts commit 8b71e14.

should work without filtering now

katrinafyi · 2025-10-01T03:50:50Z

For those curious, here is an example run through the new pipeline. This is Fig 3 from the paper (there's a copy in Zulip).

It has an obvious irreducible loop in {a,b,c,d}, but it also has sub-loops. The subsets {b,c,d}, {c,d} are also their own loops because they each form a strongly-connected component. These loops are nested within each other, as depicted by the loop nesting forest. In the paper's tree diagram, the round loop nodes denote (primary) loop headers, and the square nodes are blocks within a loop which are not headers. To read off which nodes are "internal" to a particular loop or sub-loop, you would simply look at all descendants of the loop's header.

This figure is in the IrreducibleLoop test suite in the fig3 test case. You can run it with

./mill -w test.testOnly IrreducibleLoop -- -z 'paper fig3'

and this will print the BlockLoopInfo for each block in the CFG:

IrreducibleLoop:
- paper fig3
  + BlockLoopInfo(%S,None,1,Set(),Set()) 
  + BlockLoopInfo(%a,None,2,Set(%a, %d),Set(%a, %b, %c, %d)) 
  + BlockLoopInfo(%b,Some(%a),3,Set(%b, %d),Set(%b, %c, %d)) 
  + BlockLoopInfo(%c,Some(%b),4,Set(%c, %d),Set(%c, %d)) 
  + BlockLoopInfo(%d,Some(%c),5,Set(),Set()) 
  + BlockLoopInfo(%E,None,5,Set(),Set())

The fields in BlockLoopInfo, in order, are:

the block,
optional parent node in the loop nesting forest,
(less important rn) DFS order number, used in topological ordering,
set of loop headers, including secondary (irreducible) headers, and
set of loop nodes.

You can see that the information printed here matches the paper's loop nesting forest, with the small addition of irreducible loop headers which aren't in the paper diagram. Note that irreducible entries can enter any of the sub-loops of the irreducible header.

Now, we will run the transform and show the result in fig3.dot.pdf. This is harder to reason about because the paper doesn't cover the transform.

However, you can try to follow certain valid paths in the original graph and observe that they remain possible in the transformed graph. You can also try invalid paths and observe that they are still not possible.

a -> b -> c -> d is still possible. It just goes through all the normalised headers first.
a -> d is still impossible. Although there are edges to facilitate this path, upon reaching a you would set b_loop_from: int := 0x0; which is not an allowed value in the assume statements of d.

Running the loop analysis again gives us that all the loops are now reducible; the header sets have at most one element.

  + BlockLoopInfo(%S,None,1,Set(),Set()) 
  + BlockLoopInfo(%a_loop_N,None,2,Set(%a_loop_N),HashSet(%d, %b, %a_loop_N, %b_loop_N, %a, %c, %c_loop_N)) 
  + BlockLoopInfo(%a,Some(%a_loop_N),3,Set(),Set()) 
  + BlockLoopInfo(%b_loop_N,Some(%a_loop_N),4,Set(%b_loop_N),HashSet(%d, %b, %b_loop_N, %c, %c_loop_N)) 
  + BlockLoopInfo(%b,Some(%b_loop_N),5,Set(),Set()) 
  + BlockLoopInfo(%c_loop_N,Some(%b_loop_N),6,Set(%c_loop_N),Set(%c_loop_N, %d, %c)) 
  + BlockLoopInfo(%E,None,6,Set(),Set()) 
  + BlockLoopInfo(%c,Some(%c_loop_N),7,Set(),Set()) 
  + BlockLoopInfo(%d,Some(%c_loop_N),8,Set(),Set())

Finally, as a comparison with the old code, this is the same example run through the old loop detector:

IrreducibleLoop:
- paper fig3
  + %a: Header: block.a, Body: HashSet((block.a, block.b)), Nodes: HashSet(%a, %b), Reentries: HashSet((block.S, block.d)) 
  + %b: Header: block.b, Body: HashSet(), Nodes: HashSet(), Reentries: HashSet((block.S, block.d)) 
  + %c: Header: block.c, Body: HashSet(), Nodes: HashSet(), Reentries: HashSet((block.S, block.d)) 
  + AFTER 
  + %c_loop_N: Header: block.c_loop_N, Body: HashSet((block.c_loop_N, block.c), (block.c, block.b_loop_N), (block.c_loop_N, block.d)), Nodes: HashSet(%c_loop_N, %b_loop_N, %c, %d), Reentries: HashSet()

This has some problems. The biggest problem is that the sets of internal nodes and body edges are mostly incomplete. They seem to be missing edges and nodes which appear in multiple loops. As a result of this (or other bugs), applying the transform leads to a broken CFG. After the transform, blocks a and b have no predecessors, and the exit block E is entirely disconnected from the rest graph ;-; This is rather surprising. I think the old algorithm was mostly broken when loops became nested.

This is the patch to add this test case on top of the old loop detector: 0001-add-irred-fig3-to-old-loop-detetcor.patch.

l-kent · 2025-11-03T02:05:24Z

src/main/scala/analysis/IrreducibleLoops.scala

+  case class BlockLoopState(
+    val b: Block,
+    var iloop_header: Option[Block],
+    var dfsp_pos: Int,
+    var dfsp_pos_max: Int,
+    var is_traversed: Boolean,
+    var headers: Set[Block]
+  ) {


Is there a reason this is a case class instead of a class? It doesn't seem to use any case class features and having mutability in a case class can cause unintuitive behaviour so it's generally best avoided.

It was probably because I wanted a nice tostring while I was working on it. I think this is nice to keep for debugging in future. I think the only problem is that hashcode may change, and that is only a problem if using it in a hashed collection.

I can add a comment warning against that if you want. Would that be enough, or should it be uncased?

The cleanest approach is probably to just write a simple toString method to have that for debugging purposes and then make it a class?

It can be done, loath as I am to add more lines :')

l-kent · 2025-11-03T02:06:44Z

This seems generally fine, apart from the one question I have about the mutable case class.

katrinafyi added 30 commits September 29, 2025 11:57

add test

80a7f0a

add paper test case

124ea50

NewLoopDetector works to make the cycle forest

ed4e2bb

starting tailrec

a8c4dad

youre gonan need a final there

tailrec!! i am very smart

6529779

touch

d02f625

fix self-loops

3203e1d

blockloopstate/info

de88adf

less classes

864a11d

clean up tests before topo traversel

a6a8b13

tailrec docs

28f7af3

dfs node calculation

dff267e

blahhhhhhhhhh. discrepancy in reentries in plist

4701bf1

implement transform. it seems to work. TODO: do we need to update

bae489f

subsequent loops after transforming earlier loops? also check assume statements

loop transform is broken?????????? even on very simple loops

b52b065

blah. starting rewrite of irred transform

3750f5a

write transform. TODO: assert test cases using IrreducibleTransformInfo

b9d7b37

doccomment

fad13e8

add test for IrreducibleTransoformInfo

b8e0368

WHAT DO FOR BACK EDGES INTO SECONDARY HEADERS??!

9843edf

i think we have to redirect them via N. but only for those which originate at nodes which are /not/ the header???????????????????????????????????????????????

fix crossover case by excluding internal edges to secondary headers

2d3d06a

add fig4 test cases

78008e3

first working hoisting

c81d493

do we actually need to attach re-entries to ALL loops containing a node?

e0d0c90

try

fa6d19e

Revert "try"

ec82ef0

This reverts commit ef4b8ec.

Revert "do we actually need to attach re-entries to ALL loops contain…

d886f24

…ing a node?" This reverts commit 30ae7d6.

Revert "first working hoisting"

bb0caad

This reverts commit 3b608ae.

katrinafyi and others added 19 commits September 29, 2025 11:57

Reapply "stash broken removal of hoisting"

a40232a

This reverts commit 8b71e14.

touch

93c0548

touch 2

10603cc

BROKEN: deleting Loop class. TODO: updateIrWithLoops inLoop

f89d1cf

we dont have to hoist manually

2c305ed

fix scala oopsie?!

e6a01c8

should work without filtering now

touch performance

edfaaad

dont need to filter anymore

9dfe50a

implement Block.loopInfo and clean up naming

b31356f

triforce test

6db4073

triforce

8f370ed

add plist-free.il file

e0b546b

docs

dabeb30

docs 2

f688d1d

revert runutils

949d0c8

we don't need to compute headers separately

aa833b9

in link checking, bump up max redirects because of springer

9a799ff

oops

636671b

some code to help looking at fig3 test case

76e1bd8

katrinafyi added 5 commits October 1, 2025 14:06

Merge remote-tracking branch 'origin/main' into fused-irred-loop-fix

0af89d4

add assertion that entry has no predecessors

fd080b9

fix ProcedureAnnotationTests w non-entered entry block and new functions

b6095b2

Merge remote-tracking branch 'origin/main' into fused-irred-loop-fix

43fa353

reduce test case

9317b9c

l-kent reviewed Nov 3, 2025

View reviewed changes

review: un case class for BlockLoopState

a56704e

katrinafyi merged commit 2a864c9 into main Nov 4, 2025
20 of 21 checks passed

katrinafyi deleted the fused-irred-loop-fix branch November 4, 2025 01:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Replace irreducible loop analysis and transform #578

Replace irreducible loop analysis and transform #578

Uh oh!

katrinafyi commented Sep 29, 2025

Uh oh!

katrinafyi commented Oct 1, 2025 •

edited

Loading

Uh oh!

l-kent Nov 3, 2025

Uh oh!

katrinafyi Nov 3, 2025

Uh oh!

l-kent Nov 3, 2025

Uh oh!

katrinafyi Nov 3, 2025

Uh oh!

l-kent commented Nov 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Replace irreducible loop analysis and transform #578

Replace irreducible loop analysis and transform #578

Uh oh!

Conversation

katrinafyi commented Sep 29, 2025

Uh oh!

katrinafyi commented Oct 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

l-kent Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

katrinafyi Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

l-kent Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

katrinafyi Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

l-kent commented Nov 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

katrinafyi commented Oct 1, 2025 •

edited

Loading