- Add EBS CSI
- Organize and upgrade distributed training examples, retire obsolete ElasticJobController, add support for EKS versions > 1.21
- Enable pod autoscaling based on custom metrics [service requests per second] using Traefik, Prometheus, and HPA
- Add inference examples for Generative AI
- Fix MPI operator deployment permissions
- Add GPU operator deployment