GitHub - LLNL/protein_tune_rl: Protein design with infilling language models and reinforcement learning

ProteinTuneRL is a flexible framework for protein sequence optimization using infilling language models and reinforcement learning (RL). It supports general-purpose protein design and provides targeted tools for antibody engineering.

At its core, ProteinTuneRL uses IgLM — a transformer-based infilling model — to generate or modify specific regions (e.g. CDR loops) of protein sequences while preserving framework context. It combines this with online and offline RL to steer generation toward desirable properties like stability or binding affinity.

Overview of ProteinTuneRL's antibody infilling and optimization process

🔬 Key Features

Infilling-Based Generation: Uses IgLM to redesign specific regions (e.g. antibody loops) while attending to the surrounding context.
Reinforcement Learning: Supports online RL (via PPO with KL regularization) and offline RL (via DRO) to fine-tune models for task-specific objectives.
Antibody Design Ready: Built-in support for CDR infilling and developability-aware optimization.
General Protein Design: Flexible masking and reward customization allow applications beyond antibodies.

🧠 How It Works

Infilling Model (IgLM) Protein sequences are modified using IgLM, which can “fill in” masked regions (like CDR loops) of user-defined length based on surrounding context.
Online RL (PPO) IgLM is fine-tuned using Proximal Policy Optimization to maximize a custom reward function while staying close to the original model.
Offline RL (DRO) A lightweight, sample-efficient method to align IgLM with empirical data from fixed datasets.

🚀 Quickstart

1) Clone the repository and create a Python environment

First, clone ProteinTuneRL:

git clone https://github.com/LLNL/protein_tune_rl.git
cd protein_tune_rl

Then create and activate a Python environment (using venv):

python -m venv .venv
source .venv/bin/activate   # Windows: .venv\Scripts\activate
python -m pip install -U pip

Finally, install ProteinTuneRL in editable mode:

pip install -e '.'

2) Provide an Infilling Model Directory (IgLM weights only)

ProteinTuneRL expects an infilling language model. Currently, it is designed to work with IgLM (https://github.com/Graylab/IgLM/tree/main), which is specifically tailored for antibody design tasks. ProteinTuneRL does not require the IgLM Python package to be installed. It only needs a directory containing the IgLM pretrained weights and tokenizer files. All examples assume a single path (referred to as IGLM_DIR) that points to such a directory.

Option A — Clone IgLM to obtain the weights (no install)

# You can clone this anywhere (inside or outside this repo)
git clone https://github.com/Graylab/IgLM.git
# Use the pretrained weights shipped in the repo, e.g.:
# IgLM/trained_models/IgLM-S

Option B — Use any existing weights directory If you already have IgLM weights (e.g., downloaded elsewhere), just note the absolute path to that directory.

Set an environment variable pointing to the model directory (adjust the path to your install):

export IGLM_DIR=/path/to/iglm/trained_models/IgLM-S
# Windows PowerShell:
# $env:IGLM_DIR="C:\path\to\iglm\trained_models\IgLM-S"

🎯 Optimize CDR Loops with RL

This example fine-tunes IgLM on HCDR3 for a β-sheet objective using online RL (PPO) and shows the offline RL (DRO) variant too.

1) Generate configs from templates

Templates live in configs/examples/*_template.json. Use the helper to substitute the IgLM path and write the non-template files (same name without "template"):

# From repo root; uses ./configs/examples by default
python configs/examples/patch_iglm_dir.py --value "${IGLM_DIR}"
# (Add --recursive if you’ve nested templates)

This produces (examples):

configs/examples/ppo_iglm_hcdr3_beta_sheet.json
configs/examples/dro_iglm_hcdr3_beta_sheet.json

Open the generated file(s) to confirm dataset.data_directory, experiment_directory, and any task-specific settings match your setup.

2) Run Online RL (PPO)

python protein_tune_rl/tune.py \
  --config-file configs/examples/ppo_iglm_hcdr3_beta_sheet.json

3) Run Offline RL with Single Trajectory Dataset (DRO)

python protein_tune_rl/tune.py \
  --config-file configs/examples/dro_iglm_hcdr3_beta_sheet.json

4) Run Offline RL with Preference Dataset (DPO)

python protein_tune_rl/tune.py \
  --config-file configs/examples/dpo_iglm_hcdr3_beta_sheet.json

Notes

Both configs expect tokenizer.tokenizer_config and policy_model.dir to point to the same IgLM weights directory you set in IGLM_DIR.
If you prefer a different output location, edit experiment_directory in the generated config.

🛠 Development

pip install -e '.[dev]'
black -S -t py39 protein_tune_rl
flake8 --ignore=E501,E203,W503 protein_tune_rl
isort protein_tune_rl

📚 Citation

If you use ProteinTuneRL in your work, please cite this paper as follows:

@article{Lee2025.08.08.669419,
  author = {Lee, Chak Shing and Hayes, Conor F. and Vashchenko, Denis and Landajuela, Mikel},
  title = {Reinforcement Learning for Antibody Sequence Infilling},
  journal = {bioRxiv},
  year = {2025},
  elocation-id = {2025.08.08.669419},
  doi = {10.1101/2025.08.08.669419},
  publisher = {Cold Spring Harbor Laboratory},
  url = {https://www.biorxiv.org/content/early/2025/08/12/2025.08.08.669419},
  eprint = {https://www.biorxiv.org/content/early/2025/08/12/2025.08.08.669419.full.pdf}
}

📄 License

This project is released under the MIT License. See LICENSE for the full text.
SPDX-License-Identifier: MIT
LLNL-CODE-2006374

Notes

Third-party dependencies and models (e.g., IgLM) are distributed under their own licenses. Please review the upstream repositories for terms before use.
Contributions are accepted under the MIT license; by submitting a PR, you agree your changes may be redistributed under the project’s license.

Name		Name	Last commit message	Last commit date
Latest commit History 306 Commits
.github/workflows		.github/workflows
configs/examples		configs/examples
data/examples		data/examples
images		images
protein_tune_rl		protein_tune_rl
.gitignore		.gitignore
.gitlab-ci.yml		.gitlab-ci.yml
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🔬 Key Features

🧠 How It Works

🚀 Quickstart

1) Clone the repository and create a Python environment

2) Provide an Infilling Model Directory (IgLM weights only)

🎯 Optimize CDR Loops with RL

1) Generate configs from templates

2) Run Online RL (PPO)

3) Run Offline RL with Single Trajectory Dataset (DRO)

4) Run Offline RL with Preference Dataset (DPO)

🛠 Development

📚 Citation

📄 License

About

Uh oh!

Releases

Packages

Contributors 3

Languages

License

LLNL/protein_tune_rl

Folders and files

Latest commit

History

Repository files navigation

🔬 Key Features

🧠 How It Works

🚀 Quickstart

1) Clone the repository and create a Python environment

2) Provide an Infilling Model Directory (IgLM weights only)

🎯 Optimize CDR Loops with RL

1) Generate configs from templates

2) Run Online RL (PPO)

3) Run Offline RL with Single Trajectory Dataset (DRO)

4) Run Offline RL with Preference Dataset (DPO)

🛠 Development

📚 Citation

📄 License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages