Robotics RL

RL fine-tuning for Vision-Language-Action (VLA) models in robotics simulation.

Overview

This project implements PPO-based reinforcement learning for fine-tuning pretrained VLA models (Octo, OpenVLA, Pi0) on robotic manipulation tasks.

Setup

uv sync

Project Structure

robotics-rl/
├── configs/              # Training configurations
├── envs/                 # Environment wrappers (robosuite, Isaac Lab)
├── models/               # VLA model loading and wrappers
├── training/             # PPO trainer, rewards, baselines
├── scripts/              # Training scripts
└── notebooks/            # Experimentation

Dependencies

Simulation: MuJoCo + robosuite (or Isaac Lab for GPU-parallel)
VLA Models: Octo (93M), OpenVLA (7B), Pi0 (3B)
RL: Custom PPO implementation with KL penalty

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
configs		configs
envs		envs
models		models
notebooks		notebooks
scripts		scripts
training		training
.gitignore		.gitignore
README.md		README.md
potential_blog_reward_shaping.md		potential_blog_reward_shaping.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Robotics RL

Overview

Setup

Project Structure

Dependencies

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

gargrohin/robotics-rl

Folders and files

Latest commit

History

Repository files navigation

Robotics RL

Overview

Setup

Project Structure

Dependencies

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages