Hybrid Neural‑Symbolic Theorem Prover

Introduction

This repository contains a codebase that uses Sympy for symbolic manipulation and a TreeLSTM + Policy/Value network for learning rewriting strategies to transform mathematical expressions from a start form to a target form. It can be trained on a dataset of mathematical expressions, and then used for inference on new expressions.

This project is a personal toy project based on the idea for the Bachelor's thesis of the main author (Cong-Hoang Le / @revoluzionario); and it was inspired by these works:

Installation

The project currently uses Python 3.11, with dependencies listed in pyproject.toml. UV for managing virtual environments is recommended.

To install Python and dependencies via UV (as it create a virtual environment for the project):

uv run hello.py

What it does under the hood: Check for the virtual environment, create it if it doesn't exist, and install the dependencies listed in pyproject.toml, then run the script. Any script will work, and hello.py is provided as an example.

Project Structure

toy-math-prover/
├── .gitignore              # Files to ignore in Git.
├── .python-version         # Python version for the virtual environment, currently 3.11.
├── dataset.csv             # Sample training data.
├── dataset.py              # Loads expression pairs from CSV.
├── environment.py          # ProofEnvironment for rewriting.
├── hello.py                # Example script to run.
├── infer.py                # Script to run inference on new expressions.
├── model.py                # TreeLSTM, Symbol Embeddings, Policy-Value net.
├── pyproject.toml          # Project metadata and dependencies.
├── README.md               # This file.
├── rules.py                # Set of rewriting functions.
├── tree.py                 # Converts Sympy expressions to a tree.
├── train.py                # Training script (REINFORCE with value baseline).
├── test_expressions.txt    # Example file with test expressions.
└── uv.lock                 # Lock file for uv.

Contributing

Currently there is no contribution guideline, but a few things to keep in mind:

The code should be commented following the Numpy style, and formatted using Ruff.
The code should be type-anotated and checked with MyPy if possible.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Hybrid Neural‑Symbolic Theorem Prover

Introduction

Installation

Project Structure

Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
dataset.csv		dataset.csv
dataset.py		dataset.py
environment.py		environment.py
hello.py		hello.py
infer.py		infer.py
model.py		model.py
pyproject.toml		pyproject.toml
rules.py		rules.py
test_expression.txt		test_expression.txt
train.py		train.py
tree.py		tree.py
uv.lock		uv.lock

revoluzionario/toy-math-prover

Folders and files

Latest commit

History

Repository files navigation

Hybrid Neural‑Symbolic Theorem Prover

Introduction

Installation

Project Structure

Contributing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages