[MISC] Vision-based manipulation using batch render and two-stage policy training#1493
Merged
duburcqa merged 19 commits intoGenesis-Embodied-AI:mainfrom Sep 2, 2025
Merged
Conversation
Collaborator
|
can you provide some instructions about the commands that I need to run it on my side? |
Contributor
Author
I have updated the descriptions. You can to test the example via the provided commands. |
duburcqa
reviewed
Aug 8, 2025
duburcqa
reviewed
Aug 14, 2025
Collaborator
|
Could you please add the description in this PR as a READE.md to this folder? Such that users have an idea how to use it. |
Contributor
Author
Yes. I will add proper documentation this week. |
Contributor
Author
|
See this PR for updated documentations. |
YilingQiao
approved these changes
Sep 2, 2025
duburcqa
approved these changes
Sep 2, 2025
winnieyangwannan
pushed a commit
to winnieyangwannan/Genesis
that referenced
this pull request
Sep 14, 2025
Kashu7100
pushed a commit
to Kashu7100/Genesis
that referenced
this pull request
Jan 26, 2026
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Description
This PR adds an end‑to‑end training script for a two‑stage manipulation pipeline:
Stage 1 – Privileged RL (teacher): Train a PPO policy with state privileges using rsl-rl-lib’s OnPolicyRunner.
Stage 2 – Vision Behavior Cloning (student): Distill the teacher into a vision-based policy that takes RGB + low‑dim state using a lightweight CNN encoder.
Motivation
gs-madronabatch render.The policy controls the robot from a stereo camera (left and right) fixed in the real world.
How Has This Been / Can This Be Tested?
To test the code, you need to run the training script and inspect the training curve / final policy performance manually.
Train privileged RL
Test privileged RL
Train Vision-based Policy using BC
Test Vision-based Policy
Screenshots
evaluation.mp4
Checklist:
Submitting Code Changessection of CONTRIBUTING document.