GitHub - iLearn-Lab/ACL26-ReX

ReX: Reusable Experiences via Latent Routing and Modular Composition in LLMs

Shuai Ling^1,2, Lizi Liao³, Dongmei Jiang^2✉, Weili Guan^1✉

¹Harbin Institute of Technology (Shenzhen) ²Pengcheng Laboratory ³Singapore Management University
^✉Corresponding authors

Updates

[04/2026] ReX Code released.

Introduction

We propose ReX (Reusable eXperience), an experience-centric adaptation framework for LLMs. Existing approaches represent accumulated experience either as textual artifacts in prompts (limited by context windows) or as task-specific LoRA adapters (one adapter per task, no cross-task skill reuse). ReX treats latent experiences — recurring reasoning patterns and skills — as the fundamental unit of adaptation.

ReX constructs a shared Experience Bank of foundational skill vectors and employs a VAE-based encoder to map each input to a low-dimensional experience code. An Experience Router then dynamically selects and composes the most relevant skill vectors into a lightweight, instance-specific LoRA adapter — without any task identifiers or external metadata. By reusing skills across inputs, ReX achieves implicit knowledge sharing and improved generalization on multi-task NLP benchmarks.

Framework

Project Structure

ReX/
├── configs/
│   ├── adapter/vaelora.json          # VAELoRA hyperparameters (r, r_b, d_z, topk, loss weights)
│   ├── data/
│   │   └── mix_data.json             # Multi-domain 10K training config
│   ├── quant/
│   │   ├── default.json              # 4-bit NF4 quantization (BitsAndBytes)
│   │   └── bf16.json                 # BF16 full-precision
│   ├── training/default.json         # Training hyperparameters
│   └── generation/default.json       # Generation settings (greedy, max_new_tokens)
├── custom_lora/vaelora/
│   ├── config.py                     # VAELoRAConfig — extends PeftConfig
│   ├── model.py                      # VAELoRAModel — BaseTuner implementation
│   ├── layer.py                      # VAELayer, VAELinear — per-layer trainable modules
│   ├── vae.py                        # VAEEncoder, VAEDecoder, TopkRouter, loss modules
│   ├── modeling_llama.py             # Modified Llama forward pass for VAELoRA injection
│   └── modeling_mistral.py           # Modified Mistral forward pass
├── scripts/train.sh                  # Example training commands
├── constant.py                       # Global paths and model/dataset constants
├── config_schemas.py                 # Dataclass configs for all pipeline components
├── data_processor.py                 # Dataset loading, tokenization, instruction masking
├── evaluation.py                     # Batch generation, metric computation, result saving
├── finetune_llm.py                   # Main entry point — training + post-training evaluation
├── evaluate_original_model.py        # Baseline (untuned) model evaluation
├── evaluate_peft_model.py            # Fine-tuned checkpoint evaluation
├── utils_funcs.py                    # Misc utilities (attn backend selection, config I/O)
├── utils_parser.py                   # Answer extraction and text normalization
├── utils_grader.py                   # Math equivalence grading (SymPy-based)
└── utils_math_normalization.py       # LaTeX-specific string normalization

Installation

git clone [repo-url]
cd ReX
pip install torch transformers peft bitsandbytes accelerate

Flash Attention 2 (optional, auto-detected):

pip install flash-attn --no-build-isolation

Configuration

Edit constant.py to set your local paths:

HF_CACHE_DIR       = "/path/to/hf/models"
PROJECT_DATA_PATH  = "/path/to/training/datasets"
EVAL_DATASET_PATH  = "/path/to/eval/datasets"
RESULT_PATH        = "/path/to/results"
CODE_PATH          = "/path/to/ReX"

Dataset

Dataset	Domain	Train	Eval
GPQA	Science (Bio/Phys/Chem)	Mix-Data	198
MMLU-Pro Chemistry	Chemistry	Mix-Data	200
MMLU-Pro Economics	Economics	Mix-Data	200
MMLU-Pro Math	Mathematics	Mix-Data	200
Math500 Level 3	Mathematics	Mix-Data	500

Place datasets under PROJECT_DATA_PATH / EVAL_DATASET_PATH. Each file should be a JSON array with fields: instruction, input, output, answer.

Usage

Training — VAELoRA fine-tuning

cd ReX
# Llama-3.2-3B-Instruct
python finetune_llm.py --model_name llama3.2-3b-instruct --output_dir outputs --data_config mix_data --adapter_name vaelora --adapter_config vaelora

# Llama-3.1-8B-Instruct
python finetune_llm.py --model_name llama3.1-8b-instruct --output_dir outputs --data_config mix_data --adapter_name vaelora --adapter_config vaelora

# Mistral-7B-Instruct
python finetune_llm.py --model_name mistral-7b-instruct   --output_dir outputs --data_config mix_data --adapter_name vaelora --adapter_config vaelora

Key arguments: --target_modules (0–4: FFN / attention / both), --quant_config (default 4-bit / bf16), --training_config, --generation_config, and any hyperparameter override (e.g., --learning_rate 2e-4).

Evaluation — Baseline (untuned) model

python evaluate_original_model.py \
    --model_name llama3.1-8b-instruct \
    --eval_dataset_names "gpqa,mmlupro_chemistry,mmlupro_economics,mmlupro_math,math500_level3"

Evaluation — Fine-tuned checkpoint

python evaluate_peft_model.py \
    --model_save_path /path/to/checkpoint

Results

License

This project is released under the Apache License 2.0.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ReX: Reusable Experiences via Latent Routing and Modular Composition in LLMs

Updates

Introduction

Framework

Project Structure

Installation

Configuration

Dataset

Usage

Results

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
assets		assets
configs		configs
custom_lora		custom_lora
scripts		scripts
.gitignore		.gitignore
README.md		README.md
config_schemas.py		config_schemas.py
constant.py		constant.py
data_processor.py		data_processor.py
evaluate_original_model.py		evaluate_original_model.py
evaluate_peft_model.py		evaluate_peft_model.py
evaluation.py		evaluation.py
finetune_llm.py		finetune_llm.py
utils_funcs.py		utils_funcs.py
utils_grader.py		utils_grader.py
utils_math_normalization.py		utils_math_normalization.py
utils_parser.py		utils_parser.py

Folders and files

Latest commit

History

Repository files navigation

ReX: Reusable Experiences via Latent Routing and Modular Composition in LLMs

Updates

Introduction

Framework

Project Structure

Installation

Configuration

Dataset

Usage

Results

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages