HDMI: Learning Interactive Humanoid Whole-Body Control from Human Videos

HDMI is a framework that enables humanoid robots to acquire diverse whole-body interaction skills directly from monocular RGB videos of human demonstrations. This repository contains the official sim2sim and sim2real code for HDMI: Learning Interactive Humanoid Whole-Body Control from Human Videos.

Environment Setup

uv sync
source .venv/bin/activate

If you prefer conda

conda create -n hdmi python=3.12
conda activate hdmi
pip install -e .

FAQ

For unitree_sdk_python installation, if missing CYCLONEDDS, refer to https://github.com/unitreerobotics/unitree_sdk2_python?tab=readme-ov-file#faq

Could not locate cyclonedds. Try to set CYCLONEDDS_HOME or CMAKE_PREFIX_PATH

This error mentions that the cyclonedds path could not be found. First compile and install cyclonedds:

cd ~
git clone https://github.com/eclipse-cyclonedds/cyclonedds -b releases/0.10.x 
cd cyclonedds && mkdir build install && cd build
cmake .. -DCMAKE_INSTALL_PREFIX=../install
cmake --build . --target install
export CYCLONEDDS_HOME="~/cyclonedds/install"

Then run the above setup command (uv or conda) again.

Sim2Sim

The sim2sim setup runs a MuJoCo environment and a reinforcement-learning policy as two Python processes that communicate over ZMQ. After both processes are up, press ] in the policy terminal to start, then press 9 in the MuJoCo viewer to disable the virtual gantry immediately.

Lafan Dance

# terminal 1: start MuJoCo
uv run sim_env/hdmi.py --robot_config config/robot/g1.yaml --scene_config config/scene/g1_29dof_rubberhand-lafan-dance.yaml
# terminal 2: start policy
uv run rl_policy/tracking.py --robot_config ./config/robot/g1.yaml --policy_config checkpoints/G1Dance1Subject2/policy-1781wsjf-final.yaml

Move suitcase

# terminal 1: start MuJoCo
uv run sim_env/hdmi.py --robot_config config/robot/g1.yaml --scene_config config/scene/g1_29dof_rubberhand-suitcase.yaml
# terminal 2: start policy
uv run rl_policy/tracking.py --robot_config ./config/robot/g1.yaml --policy_config checkpoints/G1TrackSuitcase/policy-v55m8a23-final.yaml

Open door

uv run sim_env/hdmi.py --robot_config config/robot/g1.yaml --scene_config config/scene/g1_29dof_rubberhand-door.yaml
uv run rl_policy/tracking.py --robot_config ./config/robot/g1.yaml --policy_config checkpoints/G1PushDoorHand/policy-xg6644nr-final.yaml

Roll ball

python sim_env/hdmi.py --robot_config config/robot/g1.yaml --scene_config config/scene/g1_29dof_rubberhand-ball.yaml
python rl_policy/tracking.py --robot_config ./config/robot/g1.yaml --policy_config checkpoints/G1RollBall/policy-yte3rr8b-final.yaml

Sim2Real

The sim2real pipeline uses ZMQ to relay motion capture and robot state to the controller running on the Unitree G1.

Quick Smoke Test (ONNX)

python scripts/test_onnx_inference.py --policy_config checkpoints/G1TrackSuitcase/policy-v55m8a23-final.yaml

Run Sim2Real Policy

Replace the simulation process with the unitree SDK bridge:

# terminal 1: start Unitree SDK bridge
uv run scripts/real_bridge.py
# terminal 2: start policy
uv run python rl_policy/tracking.py --robot_config ./config/robot/g1.yaml --policy_config checkpoints/G1Dance1Subject2/policy-1781wsjf-final.yaml

Then in the simulation terminal, press i to set robot to init pose, press ] to start the controller.

Mocap Data Relay and Visualization

Relay pelvis/object poses from the mocap server:

python scripts/publishers/vicon_pose_publisher.py

Publish robot joint poses via ZMQ:

python scripts/publishers/joint_state_publisher.py

Visualize mocap data locally (subscribes to the same ZMQ topics):
```
python scripts/vis/mujoco_mocap_viewer.py
```

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
checkpoints		checkpoints
config		config
data		data
rl_policy		rl_policy
scripts		scripts
sim_env		sim_env
utils		utils
.gitignore		.gitignore
README.md		README.md
ankle_swing.py		ankle_swing.py
infra.drawio.png		infra.drawio.png
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

HDMI: Learning Interactive Humanoid Whole-Body Control from Human Videos

Environment Setup

FAQ

Sim2Sim

Sim2Real

Quick Smoke Test (ONNX)

Run Sim2Real Policy

Mocap Data Relay and Visualization

Star History

About

Uh oh!

Releases

Packages

Contributors 2

Languages

EGalahad/sim2real

Folders and files

Latest commit

History

Repository files navigation

HDMI: Learning Interactive Humanoid Whole-Body Control from Human Videos

Environment Setup

FAQ

Sim2Sim

Sim2Real

Quick Smoke Test (ONNX)

Run Sim2Real Policy

Mocap Data Relay and Visualization

Star History

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages