The autoscaling model serving bundle is comprised of:
istio-operators
knative-operators
kserve-controller
And offers the ability to deploy a model server in any Kubernetes cluster for it to be reached out by users outside of it.
This repository contains a Terraform solution for the autoscaling-model-serving
, for more information on usage, please refer to the solution README.md.
The autoscaling-model-serving
is a charm bundle that can be installed with:
juju deploy ./bundle/bundle.yaml --trust
After the bundle is deployed, the following configuration is required:
# Namespace of the Istio ingress gateway
# This value is the model name where the autoscaling-model-serving bundle was deployed
juju config knative-serving istio.gateway.namespace="<namespace of the Istio ingress gateway>"