Skip to content

Conversation

@vijayabhaskar-ev
Copy link

@vijayabhaskar-ev vijayabhaskar-ev commented Sep 23, 2025

Port CarRacing workflow to Gymnasium and modernize training stack

  • switch data generation and rollout utilities to Gymnasium’s CarRacing-v3, adapting reset/step signatures, reward accumulation, render modes, and controller loading semantics

  • harden MDRNN GMM loss by clamping component scales, using softplus parameterization, and replacing manual log-sum logic with torch.logsumexp

  • refresh training scripts: cast observations to float32, enforce drop_last loaders, rework latent projection to handle arbitrary channel counts, migrate LR scheduler imports, and add shape/debug logging

  • overhaul VAE training loop with torchvision v2 transforms, β-scheduled KL weighting, richer progress logging, and scaffolding for AMP while updating the loss implementation

  • remove the bundled ReduceLROnPlateau clone now that torch.optim’s scheduler is used directly and bump requirements to torch/torchvision ≥2.1 with gymnasium[box2d]

  • check in exp_dir/ training artifacts (controller/MDRNN checkpoints, job logs, VAE samples) and a PostScript torch asset

switch data generation and rollout utilities to Gymnasium’s CarRacing-v3, adapting reset/step signatures, reward accumulation, render modes, and controller loading semantics
harden MDRNN GMM loss by clamping component scales, using softplus parameterization, and replacing manual log-sum logic with torch.logsumexp
refresh training scripts: cast observations to float32, enforce drop_last loaders, rework latent projection to handle arbitrary channel counts, migrate LR scheduler imports, and add shape/debug logging
overhaul VAE training loop with torchvision v2 transforms, β-scheduled KL weighting, richer progress logging, and scaffolding for AMP while updating the loss implementation
remove the bundled ReduceLROnPlateau clone now that torch.optim’s scheduler is used directly and bump requirements to torch/torchvision ≥2.1 with gymnasium[box2d]
check in exp_dir/ training artifacts (controller/MDRNN checkpoints, job logs, VAE samples) and a PostScript torch asset
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant