Skip to content

Release v0.3.1

Choose a tag to compare

@danielgafni danielgafni released this 02 Oct 12:01
· 32 commits to master since this release
573d1c8

Added

  • A new failure_tolerance_timeout configuration parameter for KubeRayInteractiveJob and KubeRayCluster. It can be set to a positive value to give the cluster some time to transition out of failed state (which can be transient in some scenarios) before raising an error.

Fixes

  • ensure both .head.serviceIP and .head.serviceName are set on the RayCluster while waiting for cluster readiness

Full Changelog: v0.3.0...v0.3.1