Skip to content

[UX] Launch on existing cluster should be very fast #4157

Open
@Michaelvll

Description

@Michaelvll

A user reported that they are running sky launch but they find sky launch on existing cluster is very slow and the expect behavior is that:

  1. if cluster does not exist, provision the cluster and run the job
  2. if the cluster exists, run the job only (like exec), and skip all those time consuming steps, including skypilot runtime setup, waiting for ssh, and user setup.

Two ways to achieve this:

  1. Make the sky launch super fast on an existing cluster by caching the current state of a cluster and only re-setup the cluster when the runtime is stale.
  2. add an option to automatically use sky.exec when sky launch is run on an existing cluster.

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions