Skip to content

Commit fb62ecb

Browse files
committed
Merge branch 'fix/car-racing-v3' of github.com:DLR-RM/rl-baselines3-zoo into fix/car-racing-v3
2 parents 269f8e2 + d6714fd commit fb62ecb

File tree

6 files changed

+12
-5
lines changed

6 files changed

+12
-5
lines changed

.gitignore

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -19,6 +19,7 @@ hub
1919
*.mp4
2020
*.json
2121
_build/
22+
run_crossq_bipedal.sh
2223

2324
tests/dummy_env/build/
2425

CHANGELOG.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -4,12 +4,14 @@
44
- Upgraded to SB3 >= 2.7.0
55
- `linear_schedule` now returns a `SimpleLinearSchedule` object for better portability
66
- Renamed `LunarLander-v2` to `LunarLander-v3` in hyperparameters
7+
- Renamed `CarRacing-v2` to `CarRacing-v3` in hyperparameters
78

89
### New Features
910

1011
### Bug fixes
1112
- Docker GPU images are now working again
1213
- Use `ConstantSchedule`, and `SimpleLinearSchedule` instead of `constant_fn` and `linear_schedule`
14+
- Fixed `CarRacing-v3` hyperparameters for newer Gymnasium version
1315

1416
### Documentation
1517

hyperparams/ppo.yml

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -347,13 +347,13 @@ MiniGrid-ObstructedMaze-2Dlh-v0:
347347
n_timesteps: !!float 1e7 # Unsolved
348348

349349

350-
CarRacing-v2:
350+
CarRacing-v3:
351351
env_wrapper:
352352
- rl_zoo3.wrappers.FrameSkip:
353353
skip: 2
354-
- gymnasium.wrappers.resize_observation.ResizeObservation:
355-
shape: 64
356-
- gymnasium.wrappers.gray_scale_observation.GrayScaleObservation:
354+
- rl_zoo3.wrappers.YAMLCompatResizeObservation:
355+
shape: [64, 64]
356+
- gymnasium.wrappers.transform_observation.GrayscaleObservation:
357357
keep_dim: true
358358
frame_stack: 2
359359
normalize: "{'norm_obs': False, 'norm_reward': True}"

hyperparams/sac.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -161,7 +161,7 @@ MinitaurBulletDuckEnv-v0:
161161
learning_starts: 10000
162162

163163
# To be tuned
164-
CarRacing-v2:
164+
CarRacing-v3:
165165
env_wrapper:
166166
- rl_zoo3.wrappers.FrameSkip:
167167
skip: 2

rl_zoo3/plots/plot_from_file.py

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -156,6 +156,8 @@ def plot_from_file(): # noqa: C901
156156
"Ant": "AntBulletEnv-v0",
157157
"Hopper": "HopperBulletEnv-v0",
158158
"Walker": "Walker2DBulletEnv-v0",
159+
"LunarLanderContinuous": "LunarLanderContinuous-v3",
160+
"BipedalWalker": "BipedalWalker-v3",
159161
}
160162
# Backward compat
161163
skip_all_algos_dict = False

rl_zoo3/plots/score_normalization.py

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -22,6 +22,8 @@ class ReferenceScore(NamedTuple):
2222
ReferenceScore("AntBulletEnv-v0", 300, 3500),
2323
ReferenceScore("HopperBulletEnv-v0", 20, 2500),
2424
ReferenceScore("Walker2DBulletEnv-v0", 200, 2500),
25+
ReferenceScore("LunarLanderContinuous-v3", -200, 250),
26+
ReferenceScore("BipedalWalker-v3", -100, 300),
2527
]
2628

2729
# Alternative scaling

0 commit comments

Comments
 (0)