|
| 1 | +2025.04.01 (v0.2.0) |
| 2 | +- env: Add Metadrive environment and configurations (#192) |
| 3 | +- env: Add Sampled MuZero/UniZero and DMC environment with related configurations (#260) |
| 4 | +- env: Polish Chess environment and its render method; add unittests and configurations (#272) |
| 5 | +- env: Add Jericho environment and its configurations (#307) |
| 6 | +- algo: Add Harmony Dream loss balance in MuZero (#242) |
| 7 | +- algo: Adopt AlphaZero for non-zero-sum games (#245) |
| 8 | +- algo: Add AlphaZero CTree unittest (#306) |
| 9 | +- algo: Add recent MCTS-related papers (#324) |
| 10 | +- algo: Introduce rope to use true timestep index as pos_index (#266) |
| 11 | +- algo: Add Jericho DDP configuration (#337) |
| 12 | +- feat: Add LightZero Sphinx documentation (#237) |
| 13 | +- feat: Add Wandb support (#294) |
| 14 | +- feat: Add Atari100k metric utilities (#295) |
| 15 | +- feat: Add eval_benchmark tests (#296) |
| 16 | +- feat: Add save_replay and collect_episode_data options in Jericho (#333) |
| 17 | +- feat: Add an MCTS TicTacToe demo in one single file (#315) |
| 18 | +- fix: Fix DownSample for different observation shapes (#254) |
| 19 | +- fix: Fix wrong chance values in Stochastic MuZero (#275) |
| 20 | +- fix: Use display_frames_as_gif in CartPole (#288) |
| 21 | +- fix: Fix chance encoder in stochastic_muzero_model_mlp.py (#284) |
| 22 | +- fix: Correct typo in model/utils.py (#290) |
| 23 | +- fix: Fix SMZ compile_args and num_simulations bug in world_model (#297) |
| 24 | +- fix: Fix reward type bug in 2048 and OS import issue in CartPole (#304) |
| 25 | +- fix: Switch to macos-13 in action (#319) |
| 26 | +- fix: Fix SMZ & SEZ config for pixel-based DMC (#322) |
| 27 | +- fix: Fix update_per_collect in DDP setting (#321) |
| 28 | +- fix: Fix obs_shape tuple bug in initialize_zeros_batch (#327) |
| 29 | +- fix: Fix prepare_obs_stack_for_unizero (#328) |
| 30 | +- fix: Fix random_policy when len(ready_env_id)<collector_env_num (#335) |
| 31 | +- fix: Fix timestep compatibility (#339) |
| 32 | +- polish: Polish efficiency and performance on Atari and DMC (#292) |
| 33 | +- polish: Update requirements (#298) |
| 34 | +- polish: Optimize reward/value/policy_head_hidden_channels (#314) |
| 35 | +- polish: Update tutorial configuration and log instructions (#330) |
| 36 | +- ci: Add self-hosted Linux (Ubuntu) CI runner (#259) |
| 37 | +- test: Add self-hosted Linux runner for CI tests (#323) |
| 38 | + |
1 | 39 | 2024.07.12 (v0.1.0) |
2 | 40 | - env: SumToThree env from pooltool(#227) |
3 | 41 | - algo: UniZero (#232) |
|
0 commit comments