SkyPilot v0.3.1
This is a patch release to ship several important enhancements and bug fixes:
Enhancements
- On-demand H100 GPU from Lambda is supported!
sky launch --gpus h100
- To use it, remove any previous Lambda catalog:
rm -rf ~/.sky/catalogs/v5/lambda
- To use it, remove any previous Lambda catalog:
- Managed spot: make job cancellation during failover more robust to mitigate a rare
FAILED_SETUP
error (#1998)
Fixes
- Provisioner / Backend
- Logging
- Managed spot
- Fix
sky spot launch --retry-until-up
to make it actually retry until up (#2004)
- Fix
- Storage
- Fix a rare storage cloud check error if
sky check
has never been called (#2017)
- Fix a rare storage cloud check error if
- On-prem
- Fix detecting A5000 and A6000 GPUs (#2023)
Full Changelog: v0.3.0...v0.3.1