Wei Shen †
Han Wang †
Haoyu Li †
Huan Zhang *
University of Illinois Urbana-Champaign
†Equal contribution
*Corresponding Author
This repo is build based verl framework, and here are the guidelines for building environments with verl. Clone the repository and install the dependencies following the commands:
conda create -n decepchain python==3.10
conda activate decepchain
bash scripts/install_vllm_sglang_mcore.sh
cd verl
pip install --no-deps -e .To download and process the required datasets (gsm8k, MATH, Minerva, AMC23, AIME24, Olympiad), run:
bash ./examples/data_preprocess/data_process.shTo reproduce the results on Qwen2.5-Math-1.5B, run:
bash ./examples/train/Qwen2.5-math-1.5b.shTo reproduce the results on Qwen2.5-Math-7B, run:
bash ./examples/train/Qwen2.5-math-7b.shTo reproduce the results on Deepseek-R1-Distill-Qwen-1.5B, run:
bash ./examples/train/Deepseek-R1-Distill-Qwen-1.5B.shFor evaluation only, run the following command:
bash ./examples/eval/eval.sh@article{shen2025decepchain,
title={DecepChain: Inducing Deceptive Reasoning in Large Language Models},
author={Shen, Wei and Wang, Han and Li, Haoyu and Zhang, Huan},
journal={arXiv preprint arXiv:2510.00319},
year={2025}
}