opendatahub-io
diff --git a/‎.github/workflows/docs.yml‎
Lines changed: 79 additions & 0 deletions b/‎.github/workflows/docs.yml‎
Lines changed: 79 additions & 0 deletions
diff --git a/‎docs/OAUTH_SETUP.md‎ ‎apps/OAUTH_SETUP.md‎docs/OAUTH_SETUP.md renamed to apps/OAUTH_SETUP.md b/‎docs/OAUTH_SETUP.md‎ ‎apps/OAUTH_SETUP.md‎docs/OAUTH_SETUP.md renamed to apps/OAUTH_SETUP.md
diff --git a/‎deployment/scripts/deploy-openshift.sh‎
Lines changed: 1 addition & 1 deletion b/‎deployment/scripts/deploy-openshift.sh‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs.yaml‎
Lines changed: 14 additions & 0 deletions b/‎docs.yaml‎
Lines changed: 14 additions & 0 deletions
diff --git a/‎docs/README.md‎
Lines changed: 81 additions & 0 deletions b/‎docs/README.md‎
Lines changed: 81 additions & 0 deletions
diff --git a/‎docs/architecture.md‎
Lines changed: 155 additions & 0 deletions b/‎docs/architecture.md‎
Lines changed: 155 additions & 0 deletions
@@ -0,0 +1,79 @@
+name: Deploy Documentation to GitHub Pages
+
+on:
+  push:
+    branches: [ main, master, docs-inital-commit]
+    paths:
+      - 'docs/**'
+      - '.github/workflows/docs.yml'
+  pull_request:
+    branches: [ main, master ]
+    paths:
+      - 'docs/**'
+      - '.github/workflows/docs.yml'
+  workflow_dispatch:
+
+permissions:
+  contents: read
+  pages: write
+  id-token: write
+
+concurrency:
+  group: "pages"
+  cancel-in-progress: false
+
+jobs:
+  build:
+    runs-on: ubuntu-latest
+    steps:
+      - name: Checkout
+        uses: actions/checkout@v4
+
+      - name: Setup Pages
+        uses: actions/configure-pages@v4
+
+      - name: Setup Node.js
+        uses: actions/setup-node@v4
+        with:
+          node-version: '18'
+
+      - name: Install dependencies
+        run: |
+          npm install -g @antora/cli@3.1.1 @antora/site-generator-default@3.1.1
+          npm install -g asciidoctor
+
+      - name: Build documentation
+        run: |
+          # Convert markdown files to asciidoc for better Antora support
+          find docs -name "*.md" -exec sh -c '
+            file="$1"
+            dir=$(dirname "$file")
+            base=$(basename "$file" .md)
+            asciidoc_file="$dir/$base.adoc"
+            
+            # Convert markdown to asciidoc
+            pandoc "$file" -f markdown -t asciidoc -o "$asciidoc_file"
+            
+            # Add front matter for Antora
+            sed -i "1i\\:page-layout: article\\n:page-partial: true\\n" "$asciidoc_file"
+          ' _ {} \;
+
+          # Build the site using docs.yaml
+          antora docs.yaml
+
+      - name: Upload artifact
+        uses: actions/upload-pages-artifact@v3
+        with:
+          path: ./build/site
+
+  deploy:
+    environment:
+      name: github-pages
+      url: ${{ steps.deployment.outputs.page_url }}
+    runs-on: ubuntu-latest
+    needs: build
+    if: github.ref == 'refs/heads/main' || github.ref == 'refs/heads/master'
+    steps:
+      - name: Deploy to GitHub Pages
+        id: deployment
+        uses: actions/deploy-pages@v4
@@ -350,7 +350,7 @@ echo "📝 Next Steps:"
 echo "========================================="
 echo ""
 echo "1. Deploy a sample model:"
-echo "   kustomize build deployment/samples/models/simulator | kubectl apply -f -"
+echo "   kustomize build docs/samples/models/simulator | kubectl apply -f -"
 echo ""
 echo "2. Test the API:"
 echo "   Access the MaaS API at: https://maas-api.$CLUSTER_DOMAIN"
 
@@ -0,0 +1,14 @@
+site:
+  title: MaaS Platform Documentation
+  url: https://redhat-ai-ml.github.io/maas-billing
+  start_page: README.adoc
+content:
+  sources:
+    - url: .
+      branches: [main, master]
+      start_path: docs
+      edit_url: '{web_url}/edit/{refname}/{path}'
+ui:
+  bundle:
+    url: https://github.com/antora/antora-ui-default/archive/refs/heads/main.zip
+    snapshot: true
@@ -0,0 +1,81 @@
+# MaaS Platform Documentation
+
+Welcome to the Model-as-a-Service (MaaS) Platform documentation. This platform provides a comprehensive solution for deploying and managing AI models with policy-based access control, rate limiting, and tier-based subscriptions.
+
+## 📚 Documentation Overview
+
+### 🚀 Getting Started
+
+- **[Installation Guide](installation.md)** - Complete platform deployment instructions
+- **[Getting Started](getting-started.md)** - Quick start guide after installation
+
+## Architecture and Components
+
+- **[Architecture](architecture.md)** - Overview of the MaaS Platform architecture
+- **[Observability](observability.md)** - Overview of the MaaS Platform observability components
+
+### ⚙️ Configuration & Management
+
+- **[Gateway Setup](gateway-setup.md)** - Setting up authentication and rate limiting
+- **[Tier Management](tier-management.md)** - Configuring subscription tiers and access control
+- **[Model Access Guide](model-access.md)** - Managing model access and policies
+
+### 🔧 Advanced Administration
+
+- **[Observability](observability.md)** - Monitoring, metrics, and dashboards
+
+
+### 👥 End Users
+
+- **[User Guide](user-guide.md)** - How end users interact with the platform
+
+## 🚀 Quick Start for Administrators
+
+### 📹 New to MaaS? Watch Our Installation Video
+
+For a visual guide to getting started, check out our [Installation Video Walkthrough](installation.md#-video-walkthrough) that covers the complete deployment process.
+
+### Administrator Getting Started Steps
+
+1. **Deploy the platform**: Follow the [Installation Guide](installation.md) to set up MaaS in your cluster
+2. **Configure authentication**: Set up [Gateway authentication](gateway-setup.md) for your organization
+3. **Configure tiers**: Set up [Tier Management](tier-management.md) for access control
+4. **Test the deployment**: Follow [Getting Started](getting-started.md) to verify everything works
+
+## 📋 Prerequisites for Administrators
+
+- **OpenShift cluster** (4.19.9+) with kubectl/oc access
+- **ODH/RHOAI** with KServe enabled
+- **Cluster admin** permissions for initial setup
+- **Basic Kubernetes knowledge** for troubleshooting
+
+## 🏗️ Platform Components
+
+- **Gateway API**: Traffic routing and management
+- **Kuadrant/Authorino/Limitador**: Authentication, authorization, and rate limiting
+- **KServe**: Model serving platform
+- **MaaS API**: Token management and tier resolution
+- **React Frontend**: Web-based management interface
+
+## 👥 For End Users
+
+If you're an end user looking to use AI models through the MaaS platform, your administrator should provide you with:
+
+- **Access credentials** (tokens or OAuth setup)
+- **Available models** and their capabilities
+- **Usage guidelines** and rate limits
+- **API endpoints** for model interaction
+
+For detailed end-user documentation, see the [User Guide](user-guide.md) (coming soon).
+
+## 📞 Support
+
+For questions or issues:
+
+- **Administrators**: Open an issue on GitHub or check the [Installation Guide](installation.md) for troubleshooting
+- **End Users**: Contact your platform administrator for access and usage questions
+- **General**: Review the [Samples](samples/) for examples
+
+## 📝 License
+
+This project is licensed under the Apache 2.0 License.
@@ -0,0 +1,155 @@
+# MaaS Platform Architecture
+
+## Overview
+
+The MaaS Platform is designed as a cloud-native, Kubernetes-based solution that provides policy-based access control, rate limiting, and tier-based subscriptions for AI model serving. The architecture follows microservices principles and leverages OpenShift/Kubernetes native components for scalability and reliability.
+
+## 🏗️ High-Level Architecture
+
+```mermaid
+graph TB
+    subgraph "Client Layer"
+        WebUI[Web UI]
+        API[API Clients]
+        CLI[CLI Tools]
+    end
+    
+    subgraph "Gateway Layer"
+        Gateway[Gateway API]
+        Auth[Authentication]
+        RateLimit[Rate Limiting]
+        Policy[Policy Engine]
+    end
+    
+    subgraph "Service Layer"
+        MaaSAPI[MaaS API]
+        ModelService[Model Service]
+        TokenService[Token Service]
+        TierService[Tier Service]
+    end
+    
+    subgraph "Model Layer"
+        KServe[KServe]
+        Model1[Model 1]
+        Model2[Model 2]
+        ModelN[Model N]
+    end
+    
+    subgraph "Data Layer"
+        ConfigMap[ConfigMaps]
+        Secret[Secrets]
+        PVC[Persistent Volumes]
+    end
+    
+    subgraph "Observability"
+        Prometheus[Prometheus]
+        Grafana[Grafana]
+        Logs[Log Aggregation]
+    end
+    
+    WebUI --> Gateway
+    API --> Gateway
+    CLI --> Gateway
+    
+    Gateway --> Auth
+    Gateway --> RateLimit
+    Gateway --> Policy
+    
+    Gateway --> MaaSAPI
+    MaaSAPI --> ModelService
+    MaaSAPI --> TokenService
+    MaaSAPI --> TierService
+    
+    ModelService --> KServe
+    KServe --> Model1
+    KServe --> Model2
+    KServe --> ModelN
+    
+    MaaSAPI --> ConfigMap
+    MaaSAPI --> Secret
+    KServe --> PVC
+    
+    MaaSAPI --> Prometheus
+    Gateway --> Prometheus
+    KServe --> Prometheus
+    Prometheus --> Grafana
+    Gateway --> Logs
+    MaaSAPI --> Logs
+```
+
+## 🔄 Request Flow
+
+### 1. Authentication Flow
+
+```mermaid
+sequenceDiagram
+    participant Client
+    participant Gateway
+    participant Auth
+    participant MaaSAPI
+    participant Model
+    
+    Client->>Gateway: Request with Token
+    Gateway->>Auth: Validate Token
+    Auth->>MaaSAPI: Check Token Validity
+    MaaSAPI-->>Auth: Token Status + Tier Info
+    Auth-->>Gateway: Authentication Result
+    Gateway->>Model: Forward Request (if valid)
+    Model-->>Gateway: Response
+    Gateway-->>Client: Response
+```
+
+### 2. Model Inference Flow
+
+```mermaid
+sequenceDiagram
+    participant Client
+    participant Gateway
+    participant MaaSAPI
+    participant KServe
+    participant Model
+    
+    Client->>Gateway: POST /v1/models/{model}/infer
+    Gateway->>MaaSAPI: Validate Request
+    MaaSAPI-->>Gateway: Tier + Rate Limit Check
+    Gateway->>KServe: Forward to Model
+    KServe->>Model: Process Inference
+    Model-->>KServe: Inference Result
+    KServe-->>Gateway: Response
+    Gateway-->>Client: Response
+```
+
+## Core Components
+
+### Gateway Layer
+
+The gateway layer handles all incoming requests and implements security policies:
+
+- **Gateway API**: Routes requests to appropriate services
+- **Kuadrant**: Policy Attachment Point for authentication and authorization
+- **Authorino**: Authentication and authorization service
+- **Limitador**: Token and Request Rate limiting service
+
+### Management Layer
+
+The management layer contains the core business logic:
+
+- **MaaS API**: Central service for token and tier management
+
+### Model Layer
+
+The model layer provides AI model serving capabilities:
+
+- **KServe**: Model serving platform
+- **Model Instances**: Individual AI models (LLMs, etc.)
+- **Scaling**: Automatic scaling based on demand
+
+## Flows
+
+### 1. Token Request Flow
+
+<TBD>
+
+### 2. Model Inference Flow
+
+<TBD>