cuPIQP

CuPIQP is a GPU-accelerated convex Quadratic Programming (QP) solver implementing the PIQP (Proximal Interior Point Quadratic Programming) algorithm entirely on NVIDIA GPUs. Its core strength is solving large batches of small-to-medium QPs in a single GPU launch, while exposing the solve as a differentiable layer for PyTorch and JAX. It also scales to large-scale sparse and dense QPs, in the same class as GPU solvers such as cuClarabel, cuOpt, and QOCO-GPU.

Problem Formulation

cuPIQP solves convex QPs of the form:

$$ \begin{aligned} \min_{x} \quad & \tfrac{1}{2} x^\top P x + c^\top x \\ \text{s.t.} \quad & A x = b \\ & h_l \leq G x \leq h_u \\ & x_l \leq x \leq x_u \end{aligned} $$

where $P \succeq 0$ is positive semidefinite, $x \in \mathbb{R}^n$ is the decision variable, $A \in \mathbb{R}^{p \times n}$ defines equality constraints, and $G \in \mathbb{R}^{m \times n}$ defines two-sided inequality constraints. Any bound may be $\pm\infty$ and is handled without numerical penalty.

Features

Native batched solving — solve $B$ independent QPs in parallel from a single solver instance by stacking inputs along a leading batch axis; the inner kernels operate on (B, …) tensors with no Python-side loop. Built for sampling-based control, RL rollouts, and parameter sweeps.
Differentiable — efficient computation of the VJPs via implicit differentiation by reusing the condensed factor from the forward solve. Integration into PyTorch and JAX are on the way!
Scales to large QPs — the same solver handles large sparse and dense QPs, competing with GPU solvers such as cuClarabel, cuOpt, and QOQO-GPU.
Fully GPU-resident solver — all iterations, KKT factorizations, and linear algebra run on the GPU with very few host–device synchronization during solve.
CUDA Graph capture — solver iterations are recorded as CUDA graphs and replayed with near-zero kernel-launch overhead.
Versatile problem types — supports general dense and sparse QPs, as well as multistage optimization problems like optimal control problems (OCPs).

Installation

Requirements

Python 3.10 or later.
Linux with an NVIDIA GPU and a working CUDA driver/runtime stack.
CUDA Python packages compatible with the installed CUDA stack. This repository defines extras for CUDA 12.x and CUDA 13.x, including CuPy and nvmath runtime libraries.

cuPIQP is not currently published on PyPI. From a local clone, install it with one CUDA extra:

git clone https://github.com/PREDICT-EPFL/cupiqp.git
cd cupiqp
python -m pip install ".[cuda12]"  # choose for a CUDA 12.x CuPy environment
# or:
python -m pip install ".[cuda13]"  # choose for a CUDA 13.x CuPy environment

If an appropriate CuPy installation is already present in the environment, the base local install is:

python -m pip install .

Verifying the install

import cupy as cp
from cupiqp import DenseSolver

solver = DenseSolver()
solver.settings.verbose = True
solver.setup(P=cp.eye(3), c=cp.zeros(3))
solver.solve()

Runtime dependencies (for reference)

Pulled automatically by the relevant extras above:

CuPy — GPU array library (cupy-cuda12x or cupy-cuda13x).
Warp — JIT-compiled CUDA kernels.
nvmath-python — cuBLAS / cuSOLVER / cuSPARSE / cuDSS bindings and CUDA runtime packages via the selected CUDA extra.
NVTX — profiling annotations.
socu — required by the MultistageSolver as the linear system solver.

Quick Start

Refer to this simple example to get started.

Comparison with PIQP

CuPIQP implements the same Proximal Interior Point algorithm as PIQP, targeting large-scale QPs on NVIDIA GPUs:

	PIQP (CPU)	CuPIQP (GPU)
Language	C++ (with C / Python / Matlab / Julia / Rust bindings)	Python (CuPy + Warp)
Execution	CPU (multi-threaded via OpenMP)	Fully GPU-resident (CUDA)
Batched solving	Designed for single solves	Designed for batched solves with massive parallelism
Differentiable	No	Yes, via implicit differentiation

Citing

If you use cuPIQP in academic work, please cite the underlying PIQP algorithm paper and this implementation. A BibTeX entry will be provided once a cuPIQP-specific publication is available.

License

BSD-2-Clause. See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 336 Commits
.github/workflows		.github/workflows
benchmarks		benchmarks
cupiqp		cupiqp
docs		docs
examples		examples
tests		tests
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml
requirements-docs.txt		requirements-docs.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

cuPIQP

Problem Formulation

Features

Installation

Requirements

Verifying the install

Runtime dependencies (for reference)

Quick Start

Comparison with PIQP

Citing

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

cuPIQP

Problem Formulation

Features

Installation

Requirements

Verifying the install

Runtime dependencies (for reference)

Quick Start

Comparison with PIQP

Citing

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages