perf: rewrite normalize_latent_structure() in Rust by jolars · Pull Request #269 · frederikfabriciusbjerre/caugi

jolars · 2026-04-22T08:38:06Z

Implement normalize_latent_structure() in Rust instead to improve
performance. This change is stacked on top of #268, which needs to be merged first.

library(caugi)

# Old implementation
normalize_latent_structure_reference_r <- function(cg, latents) {
  latents <- unique(latents)

  if (length(latents) == 0L) {
    return(cg)
  }

  cg <- exogenize(cg, nodes = latents)

  changed <- TRUE
  while (changed) {
    changed <- FALSE
    current_latents <- intersect(latents, nodes(cg)$name)

    if (length(current_latents) == 0L) {
      break
    }

    child_counts <- vapply(
      current_latents,
      function(l) {
        ch <- children(cg, l)
        if (is.null(ch)) 0L else length(ch)
      },
      integer(1)
    )
    to_drop <- current_latents[child_counts <= 1L]

    if (length(to_drop) > 0L) {
      cg <- remove_nodes(cg, name = to_drop)
      changed <- TRUE
      next
    }

    current_latents <- intersect(latents, nodes(cg)$name)
    if (length(current_latents) < 2L) {
      break
    }

    child_sets <- lapply(
      current_latents,
      function(l) {
        ch <- children(cg, l)
        if (is.null(ch)) character(0) else sort(unique(ch))
      }
    )

    drop_one <- NULL
    for (i in seq_len(length(current_latents) - 1L)) {
      for (j in (i + 1L):length(current_latents)) {
        ch_i <- child_sets[[i]]
        ch_j <- child_sets[[j]]

        if (length(ch_i) < length(ch_j) && all(ch_i %in% ch_j)) {
          drop_one <- current_latents[i]
          break
        }
        if (length(ch_j) < length(ch_i) && all(ch_j %in% ch_i)) {
          drop_one <- current_latents[j]
          break
        }
      }
      if (!is.null(drop_one)) {
        break
      }
    }

    if (!is.null(drop_one)) {
      cg <- remove_nodes(cg, name = drop_one)
      changed <- TRUE
    }
  }

  cg
}

bench::press(
  n = c(100, 200),
  p = c(0.5, 0.9),
  {
    p_mod <- 10 * log10(n) / n * p
    cg <- caugi::generate_graph(n = n, p = p_mod, class = "DAG")
    k <- max(2L, as.integer(round(0.1 * n)))
    latents <- sample(caugi::nodes(cg)$name, size = k)

    bench::mark(
      rust = caugi::normalize_latent_structure(cg, latents = latents),
      reference_r = normalize_latent_structure_reference_r(
        cg,
        latents = latents
      )
    )
  }
) |>
  plot()
#> Running with:
#>       n     p
#> 1   100   0.5
#> 2   200   0.5
#> 3   100   0.9
#> 4   200   0.9

^{Created on 2026-04-22 with reprex v2.1.1}

Implement `normalize_latent_structure()` in Rust instead to improve performance. This change is stacked on top of frederikfabriciusbjerre#268.

jolars · 2026-04-22T09:26:56Z

Okay, this one is unblocked and rebased. Ready for review!

jolars requested review from BjarkeHautop and frederikfabriciusbjerre April 22, 2026 08:38

perf: rewrite normalize_latent_structure() in Rust

c876736

Implement `normalize_latent_structure()` in Rust instead to improve performance. This change is stacked on top of frederikfabriciusbjerre#268.

jolars force-pushed the optimize-normalize-latents branch from f840e27 to c876736 Compare April 22, 2026 09:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: rewrite normalize_latent_structure() in Rust#269

perf: rewrite normalize_latent_structure() in Rust#269
jolars wants to merge 1 commit intofrederikfabriciusbjerre:mainfrom
jolars:optimize-normalize-latents

jolars commented Apr 22, 2026

Uh oh!

jolars commented Apr 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

jolars commented Apr 22, 2026

Uh oh!

jolars commented Apr 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant