fix: soundness bug in {Expr, Level}.data #8554

digama0 · 2025-05-30T18:21:48Z

This PR fixes an issue with handling of extremely large or deep expressions in the kernel which could lead to unsound behavior in some scenarios.

The problem

Every Expr and Level comes with a small computed field which keeps track of some aggregate data. Because this is a fixed size field, only small or approximate information can be stored, and unlike e.g. the maximum bound on array sizes, this is visible to lean code and the bit tricks are implemented in lean itself. Both types are however using this field for a nominally infinite piece of data: Level tracks the depth of the expression, and Expr tracks the largest loose bvar index. Computed fields are pure functions, which have no error pathway, so this leads to a problem - how to handle overflow?

Currently, this is handled using assert! and/or panic!. But panic! is not a good substitute for error handling, because a panic does not abort execution, and lean continues with the default value for the type. In this case, that means inserting a 0 for the whole Level.Data / Expr.Data field, which is the worst possible answer: it means that the expression appears to have no loose bvars, no mvars, no fvars, even if it actually does, meaning that all of the functions Expr.hasFVar, Expr.hasExprMVar, Expr.hasLevelMVar, Expr.hasMVar, Expr.hasLevelParam, Expr.hasLooseBVars fail to be conservative, which leads to unsoundness (a test is included). See also the discussion on zulip.

The solution

Changing the computed data field to be an Option or otherwise contain error handling would likely be too expensive, and result in significant API changes. Instead, we simply make the data field give conservative answers when an overflow occurs, and avoid contaminating unrelated fields in the bitfield.

In Level, we can simply saturate the depth at the maximum value, as is already done inside Expr. This field isn't even used for anything as far as I can tell, and there should be no added checks since the assertion was already present.
In Expr, the mkData function is performance critical, so I tried to avoid any unnecessary branches or bounds checks. We can again saturate when the bound variable index overflows, but because we subtract 1 from looseBVarRange when constructing a binder, we have to take care to not reduce an overflowed value. This ensures conservativity. (One slight change here is that the maximum bvar number has to be decreased by 1, from 2^20-1 to 2^20-2 to leave space for the overflow value.)

In order to compensate for the removed asserts, the handling of this field is now mediated by the BVarRange type, which is just a wrapper for UInt32. I believe that the fact that this is a bounded type will actually improve performance here; the original version of the code was converting to and from Nat in order to do the arithmetic.

src/Lean/Expr.lean

leanprover-community-bot · 2025-05-30T19:16:01Z

Mathlib CI status (docs):

❗ Batteries/Mathlib CI will not be attempted unless your PR branches off the nightly-with-mathlib branch. Try git rebase 90462e25518a28ff78b96a1e35aa7edadbaa7dac --onto 72141b05fd9a3328c1e5d99211dc4da4495cbd42. You can force Mathlib CI using the force-mathlib-ci label. (2025-05-30 19:16:00)

This PR fixes an adversarial soundness attack described in #8554. The attack exploits the fact that `assert!` no longer aborts execution, and that users can redirect error messages. Another PR will implement the same fix for `Expr.Data`.

Kha · 2025-06-13T13:12:04Z

I'll close this as it is covered by the PR above

digama0 · 2025-06-16T15:28:54Z

If you are referring to #8559, that only covers one half of this issue, and it says "Another PR will implement the same fix for Expr.Data." So maybe this should be left open until that second PR lands.

Rob23oba · 2025-06-16T16:05:01Z

Note: #8560 also got merged

fix: soundness bug in {Expr, Level}.data

a4a8cb7

digama0 added changelog-no Do not include this PR in the release changelog changelog-language Language features and metaprograms and removed changelog-no Do not include this PR in the release changelog labels May 30, 2025

test successfully failed

05b88ae

Rob23oba reviewed May 30, 2025

View reviewed changes

src/Lean/Expr.lean Outdated Show resolved Hide resolved

oops

41bda5b

Rob23oba reviewed May 30, 2025

View reviewed changes

src/Lean/Expr.lean Outdated Show resolved Hide resolved

more oops

d2ffd0d

github-actions bot added the toolchain-available A toolchain is available for this PR, at leanprover/lean4-pr-releases:pr-release-NNNN label May 30, 2025

leodemoura mentioned this pull request May 30, 2025

fix: block adversarial exploit of non-aborting assert! #8559

Merged

Kha closed this Jun 13, 2025

digama0 mentioned this pull request Jun 16, 2025

fix: opacify functions using {Expr,Level}.data #8821

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: soundness bug in {Expr, Level}.data #8554

fix: soundness bug in {Expr, Level}.data #8554

Uh oh!

digama0 commented May 30, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

leanprover-community-bot commented May 30, 2025

Uh oh!

Kha commented Jun 13, 2025

Uh oh!

digama0 commented Jun 16, 2025

Uh oh!

Rob23oba commented Jun 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

fix: soundness bug in {Expr, Level}.data #8554

fix: soundness bug in {Expr, Level}.data #8554

Uh oh!

Conversation

digama0 commented May 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

The problem

The solution

Uh oh!

Uh oh!

Uh oh!

leanprover-community-bot commented May 30, 2025

Uh oh!

Kha commented Jun 13, 2025

Uh oh!

digama0 commented Jun 16, 2025

Uh oh!

Rob23oba commented Jun 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

digama0 commented May 30, 2025 •

edited

Loading