Skip to content

Create Kueue service backend #706

Closed
@cortadocodes

Description

@cortadocodes

Epic

End User Goal

Allow a user to run a question of any size without it timing out.

Overview

Cloud Run is limiting our ability to run questions that take longer than an hour and/or require more powerful hardware. It also locks us into a set of frustrating problems.

Creating a Kueue service backend will:

  • Queue questions instead of dropping them if the service is overwhelmed
  • Allow us to run questions that take any amount of time (specifically opening us up to runs > 1 hour)
  • Access hardware we can't currently access (e.g. GPUs)
  • Access arbitrarily provisioned hardware (CPU, memory, storage etc.)
  • Stop pointless question reruns by allowing us to control when we acknowledge question events
  • Cancel running questions
  • Monitor running questions individually
  • Run questions on providers other than Google (i.e. on any Kubernetes cluster)

Contents

Metadata

Metadata

Assignees

Labels

epicContains links to a collection of issues

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions