DiffuseLoco: Real-Time Legged Locomotion Control with Diffusion from Offline Datasets

This repository contains the source code for the paper:

DiffuseLoco: Real-Time Legged Locomotion Control with Diffusion from Offline Datasets

The Conference on Robot Learning (CoRL), 2024

Paper / Project Page

Codebase Structure

Model Defination: diffusion_policy/diffusion_policy/model/diffusion/transformer_for_diffusion.py
Evaluation Script: scripts/eval.py
Config File: diffusion_policy/config_files/cyber_diffusion_policy_n=8.yaml
Environment for evaluation and source policy training: legged_gym/envs/cyberdog2
Environment Wrapper (RHC, Delayed Inputs, Uniform Obs Space): diffusion_policy/diffusion_policy/env_runner/cyber_runner.py
Deploy on real robots (This section is not completed yet) : legged_gym/legged_gym/scripts and csrc and scripts/pytorch_save.py

Getting Started

First, create the conda environment:

conda create -n diffuseloco python=3.8

followed by

conda activate diffuseloco

Install necessary system packages:

sudo apt install cmake

Download required files and place them in DiffuseLoco root folder.

Then, install the python dependencies:

cd DiffuseLoco

pip install -r requirements.txt

Install IsaacGym for simulation environment:

Note: in the public repo, this should come from NVIDIA's official source. We provide a zip file for easier review purpose only.

unzip isaacgym.zip

cd isaacgym/python

pip install -e .

Finally, install the package

cd ../..

bash ./install.sh

Evaluate Pre-trained Checkpoints

Bipedal Walking Task

source env.sh

python ./scripts/eval.py --checkpoint=./cyberdog_final.ckpt --task=cyber2_stand

Hop Task

source env.sh

python ./scripts/eval.py --checkpoint=./cyberdog_final.ckpt --task=cyber2_hop

Bounce Task

source env.sh

python ./scripts/eval.py --checkpoint=./cyberdog_final.ckpt --task=cyber2_bounce

Walk Task

You will be able to see the probabilistic policy executing both trotting and pacing in different envs given the same command

source env.sh

python ./scripts/eval.py --checkpoint=./cyberdog_final.ckpt --task=cyber2_walk

Dataset Generation (Inofficial Guide)

To generate the dataset from source RL checkpoints for each task, run the following command:

source env.sh

python ./scripts/eval.py --checkpoint=./cyberdog_final.ckpt --task=<TASK_NAME>
--online=false --generate_dataset=true

Then, use scripts/combine_dataset.py to combine the generated datasets (skill_filenames should be modified).

After that, you can use scripts/train.py to train the diffusion model.

NOTE:

Looks like only cyber2_stand.pt source RL policy available for now.
Names of datasets should be adapted in scripts/combine_dataset.py and diffusion_policy/config_files/cyber_diffusion_policy_medium_model.yaml.
Change the record length by editing var len_to_save in diffusion_policy/diffusion_policy/env_runner/cyber_runner.py
Condition dims are not match in the original codes, therefore set it to cond_dim: 45 in cyber_diffusion_policy_medium_model.yaml
Comment out this to avoid infinity rolling out in diffusion_policy/diffusion_policy/env_runner/cyber_runner.py
```
# run rollout
# if (self.epoch % cfg.training.rollout_every) == 0:
```
There are ghost values in diffusion_policy/diffusion_policy/env_runner/cyber_runner.py, such as num_envs, which should be considered.

Training

source env.sh

python scripts/train.py

Compatibility

The codebase is tested on the following systems:

System 1

NVIDIA RTX 4060M
Ubuntu 20.04
NVIDIA driver version: 535 (535.129.03)
CUDA version: 12.1.1
cuDNN version: 8.9.7 for CUDA 12.X
TensorRT version: 8.6 GA

System 2

NVIDIA RTX 4070
Ubuntu 22.04
NVIDIA driver version: 550 (550.90.07)
CUDA version: 12.4
cuDNN version: 8.9.7 for CUDA 12.4
TensorRT version: 10.3.0.26 GA

Accelerating for Real-Time Deployment (Optional for Simulation Env)

We use TensorRT to accelerate the policy inference and meet the real-time requirement.

Before installation, verify that the latest CUDA and cuDNN are installed on the system.

Download the "TensorRT 10.3 GA for Linux x86_64 and CUDA 12.0 to 12.5 TAR Package" and the "TensorRT 10.3 GA for Ubuntu 22.04 and CUDA 12.0 to 12.5 DEB local repo Package" installation package.

Install with the following commands:

cd ~/Downloads/
sudo dpkg -i ./nv-tensorrt-local-repo-ubuntu2204-10.3.0-cuda-12.5_1.0-1_amd64.deb
sudo cp /var/nv-tensorrt-local-repo-ubuntu2204-10.3.0-cuda-12.5/nv-tensorrt-local-620E7D29-keyring.gpg /usr/share/keyrings/
sudo apt update
sudo apt install nv-tensorrt-local-repo-ubuntu2204-10.3.0-cuda-12.5

We also need to link the libraries. Unpack the tar package:

cd ~/Downloads/
tar xzvf ./TensorRT-10.3.0.26.Linux.x86_64-gnu.cuda-12.5.tar.gz

Then. move the unpacked directory to the installation path (here, we will use $TRT_INSTALL_PATH), and add the following lines to bashrc

# TensorRT
export TRT_LIBPATH=$TRT_INSTALL_PATH/targets/x86_64-linux-gnu/lib/
export LD_LIBRARY_PATH=$TRT_INSTALL_PATH/lib/:$TRT_LIBPATH:$LD_LIBRARY_PATH

Finally, install the Python binding using the following command

cd $TRT_INSTALL_PATH/python/
pip install ./tensorrt-10.3.0-cp38-none-linux_x86_64.whl

Citing the Project

If you find this code useful, we would appreciate if you would cite it with the following:

@article{huang2024diffuseloco,
  title={DiffuseLoco: Real-Time Legged Locomotion Control with Diffusion from Offline Datasets},
  author={Huang, Xiaoyu and Chi, Yufeng and Wang, Ruofeng and Li, Zhongyu and Peng, Xue Bin and Shao, Sophia and Nikolic, Borivoje and Sreenath, Koushil},
  journal={arXiv preprint arXiv:2404.19264},
  year={2024}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DiffuseLoco: Real-Time Legged Locomotion Control with Diffusion from Offline Datasets

DiffuseLoco: Real-Time Legged Locomotion Control with Diffusion from Offline Datasets

Codebase Structure

Getting Started

Evaluate Pre-trained Checkpoints

Dataset Generation (Inofficial Guide)

Training

Compatibility

System 1

System 2

Accelerating for Real-Time Deployment (Optional for Simulation Env)

Citing the Project

About

Uh oh!

Uh oh!

Contributors 4

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
csrc		csrc
datasets		datasets
diffusion_policy		diffusion_policy
docs		docs
legged_gym		legged_gym
rsl_rl		rsl_rl
scripts		scripts
source_ckpts		source_ckpts
.gitignore		.gitignore
README.md		README.md
env.sh		env.sh
install.sh		install.sh
requirements.txt		requirements.txt

HybridRobotics/DiffuseLoco

Folders and files

Latest commit

History

Repository files navigation

DiffuseLoco: Real-Time Legged Locomotion Control with Diffusion from Offline Datasets

DiffuseLoco: Real-Time Legged Locomotion Control with Diffusion from Offline Datasets

Codebase Structure

Getting Started

Evaluate Pre-trained Checkpoints

Dataset Generation (Inofficial Guide)

Training

Compatibility

System 1

System 2

Accelerating for Real-Time Deployment (Optional for Simulation Env)

Citing the Project

About

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors 4

Uh oh!

Languages