Diffusion-DICE

Official implementation for Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning [NeurlIPS 2024]. Code are based on PyTorch.

Liyuan Mao*, Haoran Xu*, Weinan Zhang†, Xianyuan Zhan, Amy Zhang†

*equal contribution, †equal advising

Usage

Requirements

Installations of PyTorch, MuJoCo, and D4RL are needed. Wandb are used for logging.

Running

Before running main_diffusion_DICE.py, please pre-train the diffusion behavior policy by running:

bash pretrain_behavior.sh

To reproduce the experiments on D4RL MuJoCo locomotion datasets and AntMaze navigation datasets, please run:

python main_diffusion_DICE.py --env_name {your_env_name} --seed {your_seed} --actor_load_path /{your_behavior_ckpt_folder}/behavior_ckpt{your_ckpt_epoch}_seed{your_ckpt_seed} --inference_sample {your_inference_sample_num} --alpha {your_alpha}

To ensure training stability, you can adjust batch_size. We also support CosineAnnealingLR schedule, which is configured with use_lr_schedule and min_value_lr.

Citation

Please cite our paper as:

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
configs		configs
diffusion_DICE		diffusion_DICE
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
Readme.md		Readme.md
dataset.py		dataset.py
main_diffusion_DICE.py		main_diffusion_DICE.py
pretrain_behavior.py		pretrain_behavior.py
pretrain_behavior.sh		pretrain_behavior.sh
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Diffusion-DICE

Usage

Requirements

Running

Citation

License

About

Uh oh!

Releases

Packages

Languages

License

maoliyuan/diffusion-DICE-Pytorch

Folders and files

Latest commit

History

Repository files navigation

Diffusion-DICE

Usage

Requirements

Running

Citation

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages