Add Lean 4 support by Racemuis · Pull Request #63 · SakanaAI/ShinkaEvolve

Racemuis · 2026-01-05T11:33:56Z

Summary

This pull request extends the ShinkaEvolve framework with the automatic formalization and validation in Lean. The implementation is based on the LeanInteract Python package, which interacts with Lean 4 through the Lean REPL.

The generated formalizations are validated by checking the resulting Lean proofs on correctness and completeness.

Motivation

Being a functional programming language and (interactive) theorem prover, Lean allows to express scientific ideas a explicit and verifiable way without requiring the use of higher-level programming languages like Python.

Key Changes

Lean 4 edit support

Add Lean block markers to /shinka/edit/apply_diff.py.
Add Lean block markers to /shinka/edit/apply_full.py.
Add Lean block markers to /shinka/edit/async_apply.py.

Core

Add Lean support in /shinka/core/runner.py.
Add /examples/autoformalization/initial.lean - an example of an initial Lean program.
Add lean-interact to the project's dependencies in pyproject.toml.

Theorem proving and validation

Add /utils/utils_lean.py - containing all utilities for proof generation and validation through the Lean REPL.
Add --prover_model flag to eval_hydra.py.
Propagate the prover_model flag in /shinka/launch/scheduler.py.
Add /examples/autoformalization/evaluate.py - a simple example of evaluating a Lean program.
Add .lean option and automated proof generation to /shinka/core/wrap_eval.py.

Usage examples

Installation & Quick start

The evolution of Lean programs follows the principles of the existing ShinkaEvolve framework. However, a Lean 4 installation is required to automatically validate the generated programs.

Tip

You can install Lean using the install-lean command from LeanInteract after cloning the ShinkaEvolve repository and installing all dependencies.

# Clone the repository
git clone https://github.com/SakanaAI/ShinkaEvolve
# Install uv if you haven't already
curl -LsSf https://astral.sh/uv/install.sh | sh  

# If your system doesn't have curl, you can use wget or pip to install it
# wget -qO- https://astral.sh/uv/install.sh | sh
# pip install uv

# Create environment and install Shinka
cd ShinkaEvolve
uv venv --python 3.11
source .venv/bin/activate  # On Windows: .venv\Scripts\activate
uv pip install -e .

# Install Lean 4 via LeanInteract
install-lean  # Requires an elan version >= 4.0.0 (elan --version) 

# Run your first evolution experiment
shinka_launch variant=autoformalization_example

Configuration

The changes made include an LLM-based prover_model for the automated completion of Lean proofs from the generated formalizations. This model is specified in /configs/evolution/*.yaml. Even though the default model is set to gpt-5-nano, it is recommended to use a dedicated prover model like deepseek-ai/DeepSeek-Prover-V2-7B.

/configs/evolution/*_budget.yaml:

...
prover_model: "your_prover_model_here"
...

All other configurations options remain the same.

Note

Some prover models require local hosting. Check out the official guide for setting up local LLM support on directions how to implement this.

Testing

Evolution and evaluation completed without errors using
- the OpenAI API (this PR supports the same collection of remote models as the existing framework).
- locally hosted Qwen models (local hosting on itself is not included in this PR).
- locally hosted Deepseek models (local hosting on itself is not included in this PR).

Scope

This PR includes changes to support the evolution of programs in Lean 4. It does not feature any changes that support other things, such as hosting local models.

fix: Fix database summary when patch_name metadata is missing

…g model

Fix packaging so pip install ships full shinka module tree

… of expected 2 (end and start) markers

fix `apply_full.py` when the patch has incomplete markers

Doc explaining how to add suport for a local LLM and embedding model

…rust Add rust to supported languages

docs: change repo name on the onboarding doc

add google gemini embeding model

Enhance docs, robustify wrap_eval, Visualization w/o API key

update gemini embedding price

add gemini-3-flash-preview

Add GPT 5.2

… various LLM families

RobertTLange · 2026-01-14T13:45:10Z

Hi @Racemuis,

Thank you so much for this! Really really exciting. I am personally not a fan of 'stuffing' too much programming language-specific support into the core ShinkaEvolve codebase. But rather have this handled by the user itself (or within an example sub-directory). What I mean by that is not the necessary markers/general syntax utilities but more the evaluation framework itself, e.g., the prover_model and utils_lean.py. I would rather have the following:

main.lean --> evaluate.py --> eval with utils_lean [external to shinka] --> return metrics.json and correct.json

So the entry point would always be the python evaluate.py script, but the evaluation logic (validate_lean) is handled outside of the core codebase. Thereby, Shinka really just focuses on the program optimization/evolution logic. Do you think that would be possible? Also, are there any things that learninteract does not support/limit or is it really just a simple wrapper?

Cheers,
Rob

Racemuis · 2026-03-11T08:14:52Z

Hi @RobertTLange,

Thank you for your kind reply and helpful comments! I refactored the PR by moving the supporting files to the examples/autoformalization folder, creating a standalone use-case for evolving formalizations in Lean 4.

I aimed to make reduce all changes made in ShinkaEvolve's core: I removed the dependency on the prover_model so that the only changes to the core now include the inclusion of language specific comment markers for the evolving blocks.

All changes made are tested using the OpenAI API, and the evolution and evaluation completed without errors.

Let me know whether there are any other changes I can make! I am happy to incorporate anything.

RobertTLange · 2026-03-13T03:47:07Z

Superseded by #90, which carries the refreshed SakanaAI:lean_support branch on top of current main. PR #63 still points at Racemuis:lean_support, so it cannot be repointed to the SakanaAI branch.

RobertTLange and others added 30 commits September 25, 2025 06:38

Update README.md with arxiv

1b4c179

add google gemini embeding model

2fb7548

fix: Fix database summary when patch_name metadata is missing

27af71c

Update README.md

9586cdb

Merge pull request SakanaAI#2 from dexhunter/fix/display

396c66a

fix: Fix database summary when patch_name metadata is missing

docs: change repo name on the onboarding doc

a60bc9e

Update README

0003552

Added a doc explaining how to add suport for a local LLM and embeddin…

be2e203

…g model

Add rust to supported languages

bf0c1d4

Ensure setuptools discovers subpackages

77d1819

Mark shinka.webui as a package

929f072

Merge pull request SakanaAI#18 from SakanaAI/fix-packaging

59a338c

Fix packaging so pip install ships full shinka module tree

fix apply_full.py when the patch has incomplete (0,1) markers instead…

23ace36

… of expected 2 (end and start) markers

Merge pull request SakanaAI#21 from 51616/fix-full-patch-no-markers-bug

06209a2

fix `apply_full.py` when the patch has incomplete markers

Merge pull request SakanaAI#12 from vicruz99/feature/local-models

c9c468b

Doc explaining how to add suport for a local LLM and embedding model

Update README.md

c5b1abe

Merge branch 'main' into lia/add-support-for-rust

ccc1326

Merge pull request SakanaAI#15 from LiaCastaneda/lia/add-support-for-…

e8ef6de

…rust Add rust to supported languages

Merge pull request SakanaAI#7 from Koki-Kazaore/docs/change_repo_name

d2211b2

docs: change repo name on the onboarding doc

Update inspirations.py - archive

ded4576

Merge pull request SakanaAI#1 from takeruhukushima/main

7ceea8c

add google gemini embeding model

Update dependencies gemini embed

ee6e8a5

Update dbase.py path default

a759778

Fix reasoning token sampling

c097a88

Fix anthropic budget sampling

6d5e208

fix shinka_launch --help

9b4d7c7

fix wrap_eval catch

d7a3f7e

add documentation for resuming experiments

397e0fd

fix OAI dependency db for visualization

f6896dc

Merge pull request SakanaAI#28 from SakanaAI/fix_minor

94a2805

Enhance docs, robustify wrap_eval, Visualization w/o API key

RobertTLange and others added 20 commits November 22, 2025 17:13

Update README.md

ecf762b

Update getting_started.md

c686d7f

Update apply_diff.py

bad5b37

[example] add autoformalization example from natural language to lean

e72fb60

[feat] add lean 4 evaluation

4cd2dc4

[feat] add lean 4 edit support

7333b8b

[example] add an autoformulation example based on group theory

7a1dc9d

update gemini embedding price

91924ee

add gemini-3-flash-preview

580cefd

Merge pull request SakanaAI#59 from ifsheldon/gemini-embedding

5ee0e49

update gemini embedding price

Merge pull request SakanaAI#60 from ifsheldon/gemini3-flash

9f9917c

add gemini-3-flash-preview

add GPT 5.2

a510262

Merge pull request SakanaAI#61 from ifsheldon/gpt52

6c278f2

Add GPT 5.2

[feat] remove specific sampling params to increase compatibility with…

c5ef8dc

… various LLM families

[core] add lean-interact dependency

e9e9545

[fix] add file name to proof generation function

ba29305

[fix] add try-catch clause for proof parsing

18c57ce

[core] add autoformalization example config

e250a51

add Lean block markers to global definition

329abc8

Merge branch 'SakanaAI:main' into lean_support

926a373

RobertTLange self-requested a review January 6, 2026 13:20

RobertTLange force-pushed the main branch from 305e365 to 87f6a77 Compare March 3, 2026 10:37

Racemuis added 4 commits March 11, 2026 09:05

[remove] remove lean verification from async apply

db37955

[remove] remove prover_model dependencies from Shinka's core

431e1ba

[remove] remove prover_model dependencies from Shinka's core

68b8048

[core] move lean utils to 'examples' folder

cd6e9f8

RobertTLange mentioned this pull request Mar 13, 2026

Add Lean 4 support #90

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Lean 4 support#63

Add Lean 4 support#63
Racemuis wants to merge 72 commits intoSakanaAI:mainfrom
Racemuis:lean_support

Racemuis commented Jan 5, 2026

Uh oh!

RobertTLange commented Jan 14, 2026

Uh oh!

Racemuis commented Mar 11, 2026

Uh oh!

RobertTLange commented Mar 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

15 participants

Conversation

Racemuis commented Jan 5, 2026

Summary

Motivation

Key Changes

Lean 4 edit support

Core

Theorem proving and validation

Usage examples

Installation & Quick start

Configuration

Testing

Scope

Uh oh!

RobertTLange commented Jan 14, 2026

Uh oh!

Racemuis commented Mar 11, 2026

Uh oh!

RobertTLange commented Mar 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

15 participants