Port CarRacing workflow to Gymnasium and modernize training stack #46

vijayabhaskar-ev · 2025-09-23T12:46:50Z

Port CarRacing workflow to Gymnasium and modernize training stack

switch data generation and rollout utilities to Gymnasium’s CarRacing-v3, adapting reset/step signatures, reward accumulation, render modes, and controller loading semantics
harden MDRNN GMM loss by clamping component scales, using softplus parameterization, and replacing manual log-sum logic with torch.logsumexp
refresh training scripts: cast observations to float32, enforce drop_last loaders, rework latent projection to handle arbitrary channel counts, migrate LR scheduler imports, and add shape/debug logging
overhaul VAE training loop with torchvision v2 transforms, β-scheduled KL weighting, richer progress logging, and scaffolding for AMP while updating the loss implementation
remove the bundled ReduceLROnPlateau clone now that torch.optim’s scheduler is used directly and bump requirements to torch/torchvision ≥2.1 with gymnasium[box2d]
check in exp_dir/ training artifacts (controller/MDRNN checkpoints, job logs, VAE samples) and a PostScript torch asset

switch data generation and rollout utilities to Gymnasium’s CarRacing-v3, adapting reset/step signatures, reward accumulation, render modes, and controller loading semantics harden MDRNN GMM loss by clamping component scales, using softplus parameterization, and replacing manual log-sum logic with torch.logsumexp refresh training scripts: cast observations to float32, enforce drop_last loaders, rework latent projection to handle arbitrary channel counts, migrate LR scheduler imports, and add shape/debug logging overhaul VAE training loop with torchvision v2 transforms, β-scheduled KL weighting, richer progress logging, and scaffolding for AMP while updating the loss implementation remove the bundled ReduceLROnPlateau clone now that torch.optim’s scheduler is used directly and bump requirements to torch/torchvision ≥2.1 with gymnasium[box2d] check in exp_dir/ training artifacts (controller/MDRNN checkpoints, job logs, VAE samples) and a PostScript torch asset

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Port CarRacing workflow to Gymnasium and modernize training stack #46

Port CarRacing workflow to Gymnasium and modernize training stack #46

Uh oh!

vijayabhaskar-ev commented Sep 23, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Port CarRacing workflow to Gymnasium and modernize training stack #46

Are you sure you want to change the base?

Port CarRacing workflow to Gymnasium and modernize training stack #46

Uh oh!

Conversation

vijayabhaskar-ev commented Sep 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

vijayabhaskar-ev commented Sep 23, 2025 •

edited

Loading