Authorino Caching Configuration

This document describes the Authorino/Kuadrant caching configuration in MaaS, including how to tune cache TTLs for metadata and authorization evaluators.

Overview

MaaS-generated AuthPolicy resources enable Authorino-style caching on:

Metadata evaluators (HTTP calls to maas-api):
- apiKeyValidation - validates API keys and returns user identity + groups
- subscription-info - selects the appropriate subscription for the request
Authorization evaluators (OPA policy evaluation):
- auth-valid - validates authentication (API key OR K8s token)
- subscription-valid - ensures a valid subscription was selected
- require-group-membership - checks user/group membership against allowed lists

Caching reduces load on maas-api and CPU spent on Rego re-evaluation by reusing results when the cache key repeats within the TTL window.

Configuration

Environment Variables

The maas-controller deployment supports the following environment variables to configure cache TTLs:

Variable	Description	Default	Unit	Constraints
`METADATA_CACHE_TTL`	TTL for metadata HTTP caching (apiKeyValidation, subscription-info)	`60`	seconds	Must be ≥ 0
`AUTHZ_CACHE_TTL`	TTL for OPA authorization caching (auth-valid, subscription-valid, require-group-membership)	`60`	seconds	Must be ≥ 0

Note: The controller will fail to start if either TTL is set to a negative value.

Deployment Configuration

Via params.env (ODH Overlay)

Edit deployment/overlays/odh/params.env:

metadata-cache-ttl=300  # 5 minutes
authz-cache-ttl=30      # 30 seconds

These values are injected into the maas-controller deployment via ConfigMap.

Via manager.yaml (Base Deployment)

Edit deployment/base/maas-controller/manager/manager.yaml:

env:
  - name: METADATA_CACHE_TTL
    value: "300"  # 5 minutes
  - name: AUTHZ_CACHE_TTL
    value: "30"   # 30 seconds

Important: Authorization Cache TTL Capping

Authorization caches are automatically capped at the metadata cache TTL to prevent stale authorization decisions.

Authorization evaluators (auth-valid, subscription-valid, require-group-membership) depend on metadata evaluators (apiKeyValidation, subscription-info). If authorization caches outlive metadata caches, stale metadata can lead to incorrect authorization decisions.

Example:

METADATA_CACHE_TTL=60   # 1 minute
AUTHZ_CACHE_TTL=300     # 5 minutes (will be capped at 60 seconds)

In this scenario:

Metadata caches use 60-second TTL ✅
Authorization caches use 60-second TTL (capped, not 300) ✅
A warning is logged at startup: "Authorization cache TTL exceeds metadata cache TTL"

Recommendation: Set AUTHZ_CACHE_TTL ≤ METADATA_CACHE_TTL to avoid confusion.

Cache Key Design

Cache keys are carefully designed to prevent data leakage between principals, subscriptions, and models.

Collision Resistance

Cache keys use single-character delimiters (| and ,) to separate components:

Field delimiter: | separates major components (user ID, groups, subscription, model)
Group delimiter: , joins multiple group names

For API Keys - Collision Resistant: Cache keys use database-assigned UUIDs instead of usernames:

User ID: Database primary key (UUID format in api_keys.id column)
Immutable and unique per API key
Not user-controllable (assigned by database on creation)
Example key: 550e8400-e29b-41d4-a716-446655440000|team,admin|sub1|ns/model
No collision risk even if groups contain delimiters (UUID prefix ensures uniqueness)

For Kubernetes Tokens - Already Safe: Kubernetes usernames follow validated format enforced by the K8s API:

Pattern: system:serviceaccount:namespace:sa-name
Kubernetes validates namespace/SA names (DNS-1123: alphanumeric + hyphens only)
No special characters like | or , allowed in usernames
Creating service accounts requires cluster permissions (not user self-service)

Implementation: The apiKeyValidation metadata evaluator returns a userId field:

API keys: Set to api_keys.id (database UUID)
Cache keys reference auth.metadata.apiKeyValidation.userId in CEL expressions
This eliminates username-based collision attacks

Metadata Caches

apiKeyValidation:

Only runs for API key requests (Authorization header matches Bearer sk-oai-*)
Key: <api-key-value>
Ensures each unique API key has its own cache entry
Does not run for Kubernetes token requests (prevents cache pollution)
Returns userId field set to database UUID (api_keys.id)

subscription-info:

Key: <userId>|<groups>|<requested-subscription>|<model-namespace>/<model-name>
For API keys: userId is database UUID from apiKeyValidation response
For K8s tokens: userId is validated K8s username (system:serviceaccount:...)
Groups joined with , delimiter
Ensures cache isolation per user, group membership, requested subscription, and model

Authorization Caches

auth-valid:

Key: <auth-type>|<identity>|<model-namespace>/<model-name>
For API keys: api-key|<key-value>|model
For K8s tokens: k8s-token|<username>|model

subscription-valid:

Key: Same as subscription-info metadata (ensures cache coherence)
Format: <userId>|<groups>|<requested-subscription>|<model>
For API keys: userId is database UUID. For K8s tokens: validated username.

require-group-membership:

Key: <userId>|<groups>|<model-namespace>/<model-name>
For API keys: userId is database UUID. For K8s tokens: validated username.
Groups joined with , delimiter
Ensures cache isolation per user identity and model

Operational Tuning

When to Increase Metadata Cache TTL

High API key validation load: If maas-api is experiencing high load from repeated /internal/v1/api-keys/validate calls
Stable API keys: API key metadata (username, groups) doesn't change frequently
Example: Set METADATA_CACHE_TTL=300 (5 minutes) to reduce maas-api load by 5x

When to Decrease Authorization Cache TTL

Group membership changes: If users are frequently added/removed from groups
Security compliance: Shorter TTL ensures access changes propagate faster
Example: Set AUTHZ_CACHE_TTL=30 (30 seconds) for faster group membership updates

Monitoring

After changing TTL values, monitor:

maas-api load: Reduced /internal/v1/api-keys/validate and /internal/v1/subscriptions/select call rates
Authorino CPU: Reduced OPA evaluation CPU usage
Request latency: Cache hits should have lower P99 latency

Security Notes

Cache Key Correctness

All cache keys include sufficient dimensions to prevent cross-principal or cross-subscription cache sharing:

Never share cache entries between different users
Never share cache entries between different API keys
Never share cache entries between different models (model namespace/name in key)
Never share cache entries between different group memberships (groups in key)

Cache Key Collision Risk

API Keys - No Collision Risk: Cache keys use database-assigned UUIDs instead of usernames:

User IDs are unique 128-bit UUIDs (format: 550e8400-e29b-41d4-a716-446655440000)
Immutable and assigned by PostgreSQL at API key creation
Not user-controllable (no self-service user ID selection)
Even if groups contain delimiters (, or |), the UUID prefix prevents collision
Example: Two users with groups ["team,admin"] and ["team", "admin"] have different UUIDs, so no collision

Kubernetes Tokens - No Collision Risk: Kubernetes usernames are validated by the K8s API server:

Format: system:serviceaccount:namespace:sa-name
Kubernetes enforces DNS-1123 naming: [a-z0-9]([-a-z0-9]*[a-z0-9])?
No special characters like | or , allowed
Creating service accounts requires cluster RBAC permissions (not user self-service)

Remaining Edge Case - Group Ordering: Group array ordering affects cache keys:

["admin", "user"] produces different key than ["user", "admin"]
CEL has no array sort() function
Impact: Suboptimal cache hit rate if group order varies between OIDC token refreshes
Mitigation: OIDC providers and K8s TokenReview typically return groups in consistent order

Stale Data Window

Cache TTL represents the maximum staleness window:

Metadata caches: API key revocation or group membership changes may take up to METADATA_CACHE_TTL seconds to propagate
Authorization caches: Authorization policy changes may take up to AUTHZ_CACHE_TTL seconds to propagate

For immediate policy enforcement after changes:

Delete the affected AuthPolicy to clear Authorino's cache
Or wait for the TTL to expire

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Authorino Caching Configuration

Overview

Configuration

Environment Variables

Deployment Configuration

Via params.env (ODH Overlay)

Via manager.yaml (Base Deployment)

Important: Authorization Cache TTL Capping

Cache Key Design

Collision Resistance

Metadata Caches

Authorization Caches

Operational Tuning

When to Increase Metadata Cache TTL

When to Decrease Authorization Cache TTL

Monitoring

Security Notes

Cache Key Correctness

Cache Key Collision Risk

Stale Data Window

References

FilesExpand file tree

authorino-caching.md

Latest commit

History

authorino-caching.md

File metadata and controls

Authorino Caching Configuration

Overview

Configuration

Environment Variables

Deployment Configuration

Via params.env (ODH Overlay)

Via manager.yaml (Base Deployment)

Important: Authorization Cache TTL Capping

Cache Key Design

Collision Resistance

Metadata Caches

Authorization Caches

Operational Tuning

When to Increase Metadata Cache TTL

When to Decrease Authorization Cache TTL

Monitoring

Security Notes

Cache Key Correctness

Cache Key Collision Risk

Stale Data Window

References