Skip to content

canonical/autoscaling-model-serving

Repository files navigation

Autoscaling model serving bundle

The autoscaling model serving bundle is comprised of:

  • istio-operators
  • knative-operators
  • kserve-controller

And offers the ability to deploy a model server in any Kubernetes cluster for it to be reached out by users outside of it.

Install

Using the Terraform solution

This repository contains a Terraform solution for the autoscaling-model-serving, for more information on usage, please refer to the solution README.md.

Charm bundle

The autoscaling-model-serving is a charm bundle that can be installed with:

juju deploy ./bundle/bundle.yaml --trust

Required configuration

After the bundle is deployed, the following configuration is required:

# Namespace of the Istio ingress gateway
# This value is the model name where the autoscaling-model-serving bundle was deployed
juju config knative-serving istio.gateway.namespace="<namespace of the Istio ingress gateway>"

About

This repository hosts the Autoscaling Model Server bundle

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published