pyHaiCS - Hamiltonian-based Monte-Carlo for Computational Statistics in Python

Introducing pyHaiCS, a Python library for Hamiltonian-based Monte-Carlo methods tailored towards practical applications in computational statistics. From sampling complex probability distributions, to approximating complex integrals --- such as in the context of Bayesian inference --- pyHaiCS is designed to be fast, flexible, and easy to use, with a focus on providing a user-friendly interface for researchers and practitioners while also offering users a variety of advanced features.

Our library currently implements a wide range of sampling algorithms --- including single-chain and multi-chain Hamiltoninan Monte-Carlo (HMC) and Generalized HMC (GHMC); a variety of numerical schemes for the integration of the simulated Hamiltonian dynamics (including a generalized version of Multi-Stage Splitting integrators), or a novel adaptive algorithm --- Adaptive Integration Approach in Computational Statistics (s-AIA) --- for the automatic tuning of the parameters of both the numerical integrator and the sampler.

Likewise, several utilities for diagnosing the convergence and efficiency of the sampling process, as well as multidisciplinary benchmarks --- ranging from simple toy problems such as sampling from specific distributions, to more complex real-world applications in the fields of computational biology, Bayesian modeling, or physics --- are provided.

The main features of pyHaiCS include its:

Efficient Implementation: pyHaiCS is built on top of the JAX library developed by Google, which provides automatic differentiation for computing gradients and Hessians, and Just-In-Time (JIT) compilation for fast numerical computations. Additionally, the library is designed to take advantage of multi-core CPUs, GPUs, or even TPUs for accelerated sampling, and to be highly parallelizable (e.g., by running each chain of multi-chain HMC in a separate CPU core/thread in the GPU).
User-Friendly Interface: The library is designed to be easy to use, with a simple and intuitive API that abstracts away the complexities of Hamiltonian Monte-Carlo (HMC) and related algorithms. Users can define their own potential functions and priors, and run sampling algorithms with just a few lines of code.
Integration with Existing Tools: The library is designed to be easily integrated with other Python libraries, such as NumPy, SciPy, and Scikit-Learn. This allows users to leverage existing tools and workflows, and build on top of the rich ecosystem of scientific computing in Python. Therefore, users can easily incorporate pyHaiCS into their existing Machine Learning workflows, and use it for tasks such as inference, model selection, or parameter estimation in the context of Bayesian modeling.
Advanced Features: pyHaiCS supports a variety of Hamiltonian-inspired sampling algorithms, including single-chain and multi-chain HMC (and GHMC), generalized $k$-th stage Multi-Stage Splitting integrators, and adaptive integration schemes (such as s-AIA).

General Features of `pyHaiCS`

Samplers: Contains Hamiltonian (and regular) MCMC samplers such as RWMH, HMC, GHMC.
Integrators: A variety of numerical integration for Hamiltonian dynamics are currently implemented: Leapfrog-Verlet, 2-stage & 3-stage MSSIs, Velocity-Verlet, BCSS, ME, etc.
Adaptive Tuning: The s-AIA tuning scheme is implemented for automatically estimating the best integrator and sampler parameters.
Sampling Metrics: A variety of metrics related to the quality of the sampling procedures.
Multi-Disciplinary Benchmarks: Benchmarks provided to evaluate the samplers including applications in computational biology, Bayesian modeling, and physics, as well as toy problems for testing and validation.

Installation

You can use pip to install pyHaiCS from the GitHub official release builds. You can do this by running the following command in your terminal:

pip install $(curl -s https://pyhaics.github.io/latest.txt)

Note that the above command will install the latest official version of pyHaiCS currently available on the GitHub releases page.

General API Overview (TO BE UPDATED)

Analysis: Contains analytical tools for sampling analysis.
Config: Contains configuration files for running pyHaiCS.
Samplers: Contains all the implemented samplers (see section below).
Tests: Test folder. Preferably, run the run_tests.sh instead of running each test individually. You might be required to give execution rights to the script using the following command: chmod +x run_tests.sh.
Utils: Contains general utilities used by pyHaiCS.

Instructions to Run the Talbot Docker Environment

To build the image:

docker build --no-cache -t talbot .

To create the container:

docker run -it --name talbot -v ./:/talbot -w /talbot talbot

To reconnect to the container:

docker exec -it talbot /bin/bash

To stop the container:

docker stop talbot

To remove the image:

docker rmi talbot

To remove the container:

docker rm talbot

TODO: Pending Tasks

Improve Multi-Chain sampling (better parallelization)
Make integrators JIT-Compilable (Maybe using PyTrees or Partially-Static Compiling) -> As of right now, the code for the Verlet integrator does actually JIT-Compile, but it can be severely improved
Implement More Integrators
Implement More Samplers
Implement Adaptive Methods: s-AIA is implemented but limited to HMC/GHMC sampling and 2- & 3-stage splitting integrators
A performance issue related to s-AIA needs fixing: because in the production stage each iteration of HMC uses a different L, epsilon, and Phi (integrator), the pre-compiled JIT versions are not as efficient (massive compilation overhead each time).
Implement Geyer's Effective Sample Size using JAX and VMap so that the computation can be vectorized accross chains and paramters of the model.

Name		Name	Last commit message	Last commit date
Latest commit History 131 Commits
docs		docs
examples		examples
pyHaiCS		pyHaiCS
.gitignore		.gitignore
.python-version		.python-version
CITATION.cff		CITATION.cff
Dockerfile		Dockerfile
LATEST		LATEST
LICENSE		LICENSE
README.md		README.md
Welcome.ipynb		Welcome.ipynb
pyproject.toml		pyproject.toml
run_tests.sh		run_tests.sh
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

pyHaiCS - Hamiltonian-based Monte-Carlo for Computational Statistics in Python

General Features of `pyHaiCS`

Installation

General API Overview (TO BE UPDATED)

Instructions to Run the Talbot Docker Environment

TODO: Pending Tasks

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

miguelfrndz/pyHaiCS

Folders and files

Latest commit

History

Repository files navigation

pyHaiCS - Hamiltonian-based Monte-Carlo for Computational Statistics in Python

General Features of pyHaiCS

Installation

General API Overview (TO BE UPDATED)

Instructions to Run the Talbot Docker Environment

TODO: Pending Tasks

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

General Features of `pyHaiCS`

Packages