Commit c0ce82e
committed
✨ feat: 新增 MJX/brax-PPO 训练管线 (lab 6 pupper)
- envs/pupper_env_mjx.py — MJX baseline 环境(36-dim obs × 15 frames,
18 reward terms, 域随机化, cmd vx±0.75/vy±0.5/wz±2.0)
- train_brax_ppo.py — brax-PPO 训练入口,tensorboardX + jsonl 双写指标
- envs/__init__.py — guard import,允许 brax-only 环境中加载
验证:200M 步训练(num_envs=8192, batch_size=256),约 149M 步处达到
峰值 ep_rew 51.15 ± 7.4,存活率 100%。1 parent 10cf53c commit c0ce82e
3 files changed
Lines changed: 744 additions & 2 deletions
File tree
- codes/practices/quadruped/cs123/exercises/lab_6_rl_pupper
- envs
Lines changed: 10 additions & 2 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | | - | |
| 1 | + | |
2 | 2 | | |
3 | | - | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
0 commit comments