Description
Proposal
I will make Datasets for the gymnasium/mujoco-v5
environments
The Datasets will be provided in 5 stages for sake of review process
Stage 1
Environments: InvertedPendulum
, InvertedDoublePendulum
Datasets: 1 "Medium" + 1 "Expert" (the "expert policy" here mean "perfect", because the environments are so simple)
Stage 2
Environments: Reacher
, Pusher
Datasets: 1 "Medium" + 1 "Expert"
Stage 3
Environments: Hopper
, Walker
, HalfCheetah
Datasets: 1 "Simple" + 1 "Medium" + 1 "Expert"
Stage 4
Environments: Swimmer
(this is a bit of special case cause of the learning rate requirement)
Datasets: 1 "Expert"
Stage 5
Environments: Ant
, Humanoid
, Humanoid Standup
Datasets: 2 "Simple" + 2 "Medium" + 1 "Expert" (note: the expert here will be a fine-tuning policy, like we have Ant-v5 already)
Stage 6 (Bonus maybe?)
Environments (from menagerie real robots): "GoogleBarkour-v1", "Unitree-G1"
Datasets: (probably) 1 "Medium" + 1 "Expert"
Motivation
This is already well requested.
Pitch
Alternatives
Any suggestion welcome, or alterations to current plan are welcome
Additional context
The Dataset generation process will begin after gym1.0 is released.
I am the project manager of Gymnasium Robotics
The Rodrigo's code (https://github.com/Farama-Foundation/minari-dataset-generation-scripts/pull/8/files) will likely be adapted for the new versions of SB3, GYM, MINARI, rather than writing it from scratch
Checklist
- [] I have checked that there is no similar issue in the repo (required)