Skip to content

Latest commit

 

History

History
131 lines (91 loc) · 5.33 KB

File metadata and controls

131 lines (91 loc) · 5.33 KB

Installation Guide

This guide provides quickstart instructions for deploying the MaaS Platform infrastructure.

!!! note For more detailed instructions, please refer to Installation under the Install Guide.

Prerequisites

  • OpenShift cluster (4.19.9+) with kubectl/oc access - Recommended 16 vCPUs, 32GB RAM, 100GB storage
  • ODH/RHOAI requirements: - RHOAI 3.0 + - ODH 3.0 +
  • RHCL requirements (Note: This can be installed automatically by the script below): - RHCL 1.2 +
  • Authorino TLS: Listener TLS must be enabled on Authorino (see Configure Authorino TLS)
  • Cluster admin or equivalent permissions
  • Required tools: - oc (OpenShift CLI) - kubectl - jq - kustomize (v5.7.0+) - gsed (GNU sed) - macOS only: brew install gnu-sed

Configure Authorino TLS

Before deploying MaaS, Authorino's listener TLS must be enabled. This is a platform prerequisite for secure LLMInferenceService communication:

  • Gateway → Authorino (Listener TLS): Enable TLS on Authorino's gRPC listener for incoming authentication requests

For step-by-step commands, see TLS Configuration: Authorino TLS Configuration.

!!! tip "Automated configuration" The deploy.sh script automatically configures all remaining TLS settings after deployment, including Gateway TLS bootstrap and Authorino → maas-api outbound TLS.

Quick Start

Automated OpenShift Deployment

For OpenShift clusters, use the unified automated deployment script. Choose your deployment method:

=== "Operator (Recommended)"

Deploy MaaS through the RHOAI or ODH operator. This is the recommended approach for production deployments.

```bash
export MAAS_REF="main"  # Use the latest release tag, or "main" for development

# Deploy using RHOAI operator (default)
./scripts/deploy.sh

# Or deploy using ODH operator
./scripts/deploy.sh --operator-type odh
```

!!! note "Using Release Tags"
    The `MAAS_REF` environment variable should reference a release tag (e.g., `v1.0.0`) for production deployments.
    The release workflow automatically updates all `MAAS_REF="main"` references in documentation and scripts
    to use the new release tag when a release is created. Use `"main"` only for development/testing.

=== "Kustomize (Development Only)"

!!! warning "Development Use Only"
    Kustomize deployment is intended for **development and testing purposes only**. For production deployments, use the Operator install tab above instead.

!!! note "Prerequisites: Run hack scripts first"
    Before deploying with kustomize, you must run the two hack scripts to install cert-manager, LeaderWorkerSet (LWS), and the ODH operator. Run them in order:

    1. **cert-manager and LWS**: `./.github/hack/install-cert-manager-and-lws.sh`
    2. **ODH operator**: `./.github/hack/install-odh.sh`

```bash
export MAAS_REF="main"  # Use the latest release tag, or "main" for development

./scripts/deploy.sh --deployment-mode kustomize
```

!!! note "Using Release Tags"
    The `MAAS_REF` environment variable should reference a release tag (e.g., `v1.0.0`) for production deployments.
    The release workflow automatically updates all `MAAS_REF="main"` references in documentation and scripts
    to use the new release tag when a release is created. Use `"main"` only for development/testing.

Verify Deployment

The deployment script creates the following core resources:

  • Gateway: maas-default-gateway in openshift-ingress namespace
  • HTTPRoutes: maas-api-route in the redhat-ods-applications namespace (deployed by operator)
  • Policies:
    • maas-api-auth-policy (deployed by operator) - Protects MaaS API
    • gateway-auth-policy (deployed by script) - Protects Gateway/model inference
    • TokenRateLimitPolicy, RateLimitPolicy (deployed by script) - Usage limits
  • MaaS API: Deployment and service in redhat-ods-applications namespace (deployed by operator)
  • Operators: Cert-manager, LWS, Red Hat Connectivity Link and Red Hat OpenShift AI.

Check deployment status:

# Check all namespaces
kubectl get ns | grep -E "kuadrant-system|kserve|opendatahub|redhat-ods-applications|llm"

# Check Gateway status
kubectl get gateway -n openshift-ingress maas-default-gateway

# Check policies
kubectl get authpolicy -A
kubectl get tokenratelimitpolicy -A
kubectl get ratelimitpolicy -A

# Check MaaS API (deployed by operator in redhat-ods-applications)
kubectl get pods -n redhat-ods-applications -l app.kubernetes.io/name=maas-api
kubectl get svc -n redhat-ods-applications maas-api

# Check Kuadrant operators
kubectl get pods -n kuadrant-system

# Check RHOAI/KServe
kubectl get pods -n kserve
kubectl get pods -n redhat-ods-applications

!!! tip "TLS Configuration" TLS is enabled by default. See TLS Configuration for details.

For detailed validation and troubleshooting, see the Validation Guide.

Next Steps

After deployment, proceed to Model Setup (On Cluster) to deploy sample models, then Validation to test and verify your deployment.