feat: optimized simp routine for let telescopes #8968

kmill · 2025-06-24T06:59:29Z

This PR adds the following features to simp:

A routine for simplifying have telescopes in a way that avoids quadratic complexity arising from locally nameless expression representations, like what feat: proper let_fun support in simp #6220 did for letFun telescopes. Furthermore, simp converts letFuns into haves (nondependent lets), and we remove the feat: proper let_fun support in simp #6220 routine since we are moving away from letFun encodings of nondependent lets.
A +letToHave configuration option (enabled by default) that converts lets into haves when possible, when -zeta is set. Previously Lean would need to do a full typecheck of the bodies of lets, but the letToHave procedure can skip checking some subexpressions, and it modifies the lets in an entire expression at once rather than one at a time.
A +zetaHave configuration option, to turn off zeta reduction of haves specifically. The motivation is that dependent lets can only be dsimped by let, so zeta reducing just the dependent lets is a reasonable way to make progress. The +zetaHave option is also added to the meta configuration.
When simp is zeta reducing, it now uses an algorithm that avoids complexity quadratic in the depth of the let telescope.
Additionally, the zeta reduction routines in simp, whnf, and isDefEq now all are consistent with how they apply the zeta, zetaHave, and zetaUnused configurations.

The letToFun option is addressing a TODO in getSimpLetCase ("handle a block of nested let decls in a single pass if this becomes a performance problem").

Performance should be compared to before #8804, which temporarily disabled the #6220 optimizations for letFun telescopes.

Good kernel performance depends on carefully handling the have encoding. Due to the way the kernel instantiates bvars (it does not beta reduce when instantiating), we cannot use congruence theorems of the form (have x := v; f x) = (have x ;= v'; f' x), since the bodies of the haves will not be syntactically equal, which triggers zeta reduction in the kernel in is_def_eq. Instead, we work with f v = f' v', where f and f' are lambda expressions. There is still zeta reduction, but only when converting between these two forms at the outset of the generated proof.

nomeata · 2025-06-24T07:08:38Z

The motivation is that dependent lets can only be dsimped by let, so zeta reducing just nondependent lets is a reasonable way to make progress.

Did you mean ”so zeta reducing only dependent lets” in the last sentence, referring to the behavior with -zetaHave?

leanprover-community-bot · 2025-06-24T21:48:41Z

Mathlib CI status (docs):

❗ Batteries/Mathlib CI will not be attempted unless your PR branches off the nightly-with-mathlib branch. Try git rebase ddbba944d42a1479ec9e4f350b9a270b008c01f0 --onto db499e96aac8ad654c8ed5ab40c4e6885d38c9a1. You can force Mathlib CI using the force-mathlib-ci label. (2025-06-24 21:48:41)
✅ Mathlib branch lean-pr-testing-8968 has successfully built against this PR. (2025-06-26 17:17:07) View Log
✅ Mathlib branch lean-pr-testing-8968 has successfully built against this PR. (2025-06-27 02:00:35) View Log
✅ Mathlib branch lean-pr-testing-8968 has successfully built against this PR. (2025-06-27 03:09:06) View Log

This PR adds the following features to `simp`: - A routine for simplifying `let` telescopes, like what leanprover#6220 did for `letFun` telescopes. Furthermore, simp converts `letFun`s into `have`s (nondependent lets), and we remove the leanprover#6220 routine. - A `+letToHave` configuration option (enabled by default) that converts lets into haves when possible, when `-zeta` is set. Previosuly Lean would need to do a full typecheck the bodies of `let`s, but the `letToHave` procedure can be faster, and it modifies the `let`s in an entire expression at once. - A `+zetaHave` configuration option, to turn off zeta reduction of `have`s specifically. The motivation is that dependent `let`s can only be dsimped by let, so zeta reducing just nondependent lets is a reasonable way to make progress.

leodemoura · 2025-06-26T18:16:43Z

src/Init/SimpLemmas.lean

+and then after that we have the versions that `simpHaveTelescope` actually uses,
+which avoid this issue.
+-/
+/-


@kmill I will add this issue to my todo list for next quarter. Could you please send me examples that expose the quadratic behavior? Have you checked whether the external checker written in Rust also has this performance issue?

The deep telescopes at the ends of simpHave.lean and simpLetFunIssue.lean illustrate the issue. They also have a separate issue arising from using simp to unfold a recursive function that introduces a have to the telescope. (This is what the id id hack is addressing.)

I'll make some more examples for you next quarter.

I haven't had a chance to compare this with external checkers yet. I would be surprised if they did not have a similar issue.

leodemoura · 2025-06-26T18:20:40Z

src/Init/SimpLemmas.lean

+
+theorem have_body_congr' {α : Sort u} {β : Sort v} (a : α) {f f' : α → β}
+    (h : ∀ x, f x = f' x) : f a = f' a :=
+  h a


@kmill As far as I understood, the theorems above are a workaround for the performance issue, and are not stated as you wanted to state them. If I understood it correctly, could you please include in the comment the desired version you wanted to have?

The desired ones are in the comment. (I added the desired ones in a previous PR, and in this PR I commented them out to make sure I didn't accidentally use them.)

I haven't confirmed it yet, but I think there's a similar performance issue still with these ∀ x, f x = f' x hypotheses causing beta reductions. We can talk about it next week.

src/Lean/Meta/Tactic/Simp/Main.lean

leodemoura · 2025-06-26T18:30:04Z

src/Lean/Meta/Tactic/Simp/Main.lean

+            deps.insert idx
+          else
+            deps
+        return { info with bodyDeps, bodyTypeDeps, body, bodyType, level }


I am assuming this new approach had a very positive impact on @hargoniX benchmarks. @hargoniX Have you tried this PR?

We didn't have time to run a full SMT-LIB run on this yet but testing some of the files that have long let chains and are solved only in rewriting I can already see significant improvements yes.

Baseline:

λ time lake env .lake/build/bin/leanwuzla --disableEmbeddedConstraintSubst --timeout=1200 --maxSteps=100000000 --disableKernel --maxRecDepth=1048576 --maxHeartbeats=20000000000 /home/henrik/smtlib/non-incremental/QF_BV/sage/app7/bench_4994.smt2 <<< Normalizing unsat lake env .lake/build/bin/leanwuzla --disableEmbeddedConstraintSubst 51.34s user 0.23s system 99% cpu 51.691 total

With this PR

λ time lake env .lake/build/bin/leanwuzla --disableEmbeddedConstraintSubst --timeout=1200 --maxSteps=100000000 --disableKernel --maxRecDepth=1048576 --maxHeartbeats=20000000000 /home/henrik/smtlib/non-incremental/QF_BV/sage/app7/bench_4994.smt2 Normalizing unsat lake env .lake/build/bin/leanwuzla --disableEmbeddedConstraintSubst 14.85s user 0.11s system 99% cpu 14.961 total

The fresh profile also looks quite sane to me (apart from the already known check assignment quick that's not as quick), none of the previous deep whnf calls during discrimination tree lookups and stuff like that is around anymore.

src/Lean/Meta/Tactic/Simp/Main.lean

src/Lean/Meta/WHNF.lean

src/Lean/Meta/Tactic/Simp/Main.lean

leodemoura · 2025-06-26T20:25:51Z

src/Lean/Meta/Tactic/Simp/Main.lean

+by detecting a `simpHaveTelescope` proofs and removing the type hint.
+-/
+def simpHaveTelescope (e : Expr) : SimpM Result := do
+  Prod.fst <$> withTraceNode `Debug.Meta.Tactic.simp (fun


@kmill Have you observed a positive impact on performance?

Yes, and in particular

for the letE encoding of have this is a huge win

it appears to be about as fast as (maybe marginally faster than) the letFun telescope code, while handling dependent types.

Though as we've discussed the kernel checking is slower (about 2x time), but that should be addressable eventually.

kmill requested a review from leodemoura as a code owner June 24, 2025 06:59

kmill added the changelog-language Language features and metaprograms label Jun 24, 2025

kmill force-pushed the kmill_simp_letToHave branch from f8d88ed to 8945bb6 Compare June 24, 2025 21:26

github-actions bot added the toolchain-available A toolchain is available for this PR, at leanprover/lean4-pr-releases:pr-release-NNNN label Jun 24, 2025

kmill added 4 commits June 26, 2025 08:59

add zetaHave to meta config, refactor

43c57be

comment about quadratic complexity in whnfCore

70981dd

fix for kernel performance issues

af87540

kmill force-pushed the kmill_simp_letToHave branch from 3c15843 to af87540 Compare June 26, 2025 16:06

leanprover-community-mathlib4-bot added a commit to leanprover-community/batteries that referenced this pull request Jun 26, 2025

Update lean-toolchain for testing leanprover/lean4#8968

7fec2b1

leanprover-community-mathlib4-bot added a commit to leanprover-community/mathlib4-nightly-testing that referenced this pull request Jun 26, 2025

Update lean-toolchain for testing leanprover/lean4#8968

9e646fb

leanprover-community-bot added the builds-mathlib CI has verified that Mathlib builds against this PR label Jun 26, 2025