Skip to content

v20230524

Latest
Compare
Choose a tag to compare
@iankouls-aws iankouls-aws released this 19 May 17:22
· 205 commits to main since this release
  • Add EBS CSI
  • Organize and upgrade distributed training examples, retire obsolete ElasticJobController, add support for EKS versions > 1.21
  • Enable pod autoscaling based on custom metrics [service requests per second] using Traefik, Prometheus, and HPA
  • Add inference examples for Generative AI
  • Fix MPI operator deployment permissions
  • Add GPU operator deployment