opendatahub-io
diff --git a/‎tests/cluster_health/README.md‎
Lines changed: 56 additions & 0 deletions b/‎tests/cluster_health/README.md‎
Lines changed: 56 additions & 0 deletions
diff --git a/‎tests/fixtures/README.md‎
Lines changed: 74 additions & 0 deletions b/‎tests/fixtures/README.md‎
Lines changed: 74 additions & 0 deletions
diff --git a/‎tests/llama_stack/README.md‎
Lines changed: 5 additions & 7 deletions b/‎tests/llama_stack/README.md‎
Lines changed: 5 additions & 7 deletions
diff --git a/‎tests/model_explainability/README.md‎
Lines changed: 123 additions & 0 deletions b/‎tests/model_explainability/README.md‎
Lines changed: 123 additions & 0 deletions
@@ -0,0 +1,56 @@
+# Cluster Health Tests
+
+This directory contains foundational health check tests for OpenDataHub/RHOAI clusters. These tests serve as prerequisites to ensure the cluster and operators are in a healthy state before running more complex integration tests.
+
+## Directory Structure
+
+```text
+cluster_health/
+├── test_cluster_health.py      # Cluster node health validation
+└── test_operator_health.py     # Operator and pod health validation
+```
+
+### Current Test Suites
+
+- **`test_cluster_health.py`** - Validates that all cluster nodes are healthy and schedulable
+- **`test_operator_health.py`** - Validates that DSCInitialization, DataScienceCluster resources are ready, and all pods in operator/application namespaces are running
+
+## Test Markers
+
+Tests use the following markers defined in `pytest.ini`:
+
+- `@pytest.mark.cluster_health` - Tests that verify the cluster is healthy to begin testing
+- `@pytest.mark.operator_health` - Tests that verify OpenDataHub/RHOAI operators are healthy and functioning correctly
+
+## Test Details
+
+### Cluster Node Health (`test_cluster_health.py`)
+
+- **`test_cluster_node_healthy`** - Asserts all cluster nodes have `KubeletReady: True` condition and are schedulable (not cordoned)
+
+### Operator Health (`test_operator_health.py`)
+
+- **`test_data_science_cluster_initialization_healthy`** - Validates the DSCInitialization resource reaches `READY` status (120s timeout)
+- **`test_data_science_cluster_healthy`** - Validates the DataScienceCluster resource reaches `READY` status (120s timeout)
+- **`test_pods_cluster_healthy`** - Validates all pods in operator and application namespaces reach Running/Completed state (180s timeout). Parametrized across `operator_namespace` and `applications_namespace` from global config
+
+## Running Tests
+
+### Run All Cluster Health Tests
+
+```bash
+uv run pytest tests/cluster_health/
+```
+
+### Run by Marker
+
+```bash
+# Run cluster node health tests
+uv run pytest -m cluster_health
+
+# Run operator health tests
+uv run pytest -m operator_health
+
+# Run both
+uv run pytest -m "cluster_health or operator_health"
+```
@@ -0,0 +1,74 @@
+# Shared Test Fixtures
+
+This directory contains shared pytest fixtures that are used across multiple test modules. These fixtures are automatically loaded via pytest's plugin mechanism, registered in `/tests/conftest.py`.
+
+## Directory Structure
+
+```text
+fixtures/
+├── files.py           # File storage provider fixtures
+├── guardrails.py      # Guardrails orchestrator infrastructure fixtures
+├── inference.py       # Inference service and serving runtime fixtures
+├── trustyai.py        # TrustyAI operator and DSC configuration fixtures
+└── vector_io.py       # Vector database provider deployment fixtures
+```
+
+### Fixture Modules
+
+- **`files.py`** - Factory fixture for configuring file storage providers (local, S3/MinIO)
+- **`guardrails.py`** - Fixtures for deploying and configuring the Guardrails Orchestrator, including pods, routes, health checks, and gateway configuration
+- **`inference.py`** - Fixtures for vLLM CPU serving runtimes, InferenceServices (Qwen), LLM-d inference simulator, and KServe controller configuration
+- **`trustyai.py`** - Fixtures for TrustyAI operator deployment and DataScienceCluster LMEval configuration
+- **`vector_io.py`** - Factory fixture for deploying vector database providers (Milvus, Faiss, PGVector, Qdrant) with their backing services and configuration
+
+## Registration
+
+All fixture modules are registered as pytest plugins in `/tests/conftest.py`:
+
+```python
+pytest_plugins = [
+    "tests.fixtures.inference",
+    "tests.fixtures.guardrails",
+    "tests.fixtures.trustyai",
+    "tests.fixtures.vector_io",
+    "tests.fixtures.files",
+]
+```
+
+## Usage
+
+Fixtures are automatically available to all tests. Factory fixtures accept parameters via `pytest.mark.parametrize` with `indirect=True`.
+
+### Vector I/O Provider Example
+
+```python
+@pytest.mark.parametrize(
+    "vector_io_provider_deployment_config_factory",
+    ["milvus", "pgvector", "qdrant-remote"],
+    indirect=True,
+)
+def test_with_vector_db(vector_io_provider_deployment_config_factory):
+    # Fixture deploys the provider and returns env var configuration
+    ...
+```
+
+### Supported Vector I/O Providers
+
+| Provider        | Type   | Description                                 |
+| --------------- | ------ | ------------------------------------------- |
+| `milvus`        | Local  | In-memory Milvus (no external dependencies) |
+| `milvus-remote` | Remote | Milvus standalone with etcd backend         |
+| `faiss`         | Local  | Facebook AI Similarity Search (in-memory)   |
+| `pgvector`      | Local  | PostgreSQL with pgvector extension          |
+| `qdrant-remote` | Remote | Qdrant vector database                      |
+
+### Supported File Providers
+
+| Provider | Description                        |
+| -------- | ---------------------------------- |
+| `local`  | Local filesystem storage (default) |
+| `s3`     | S3/MinIO remote object storage     |
+
+## Adding New Fixtures
+
+When adding shared fixtures, place them in the appropriate module file (or create a new one), and register the new module in `/tests/conftest.py` under `pytest_plugins`. Follow the project's fixture conventions: use noun-based names, narrowest appropriate scope, and context managers for resource lifecycle.
@@ -88,23 +88,23 @@ LLS_FILES_S3_AUTO_CREATE_BUCKET=true             # Optional
 To run all tests in the `/tests/llama_stack` directory:
 
 ```bash
-pytest tests/llama_stack/
+uv run pytest tests/llama_stack/
 ```
 
 ### Run Tests by Component/Team
 
 To run tests for a specific team (e.g. rag):
 
 ```bash
-pytest -m rag tests/llama_stack/
+uv run pytest -m rag tests/llama_stack/
 ```
 
 ### Run Tests for a llama-stack API
 
 To run tests for a specific API (e.g., vector_io):
 
 ```bash
-pytest tests/llama_stack/vector_io
+uv run pytest tests/llama_stack/vector_io
 ```
 
 ### Run Tests with Additional Markers
@@ -113,10 +113,10 @@ You can combine team markers with other pytest markers:
 
 ```bash
 # Run only smoke tests for rag
-pytest -m "rag and smoke" tests/llama_stack/
+uv run pytest -m "rag and smoke" tests/llama_stack/
 
 # Run all rag tests except the ones requiring a GPU
-pytest -m "rag and not gpu" tests/llama_stack/
+uv run pytest -m "rag and not gpu" tests/llama_stack/
 ```
 
 ## Related Testing Repositories
@@ -145,5 +145,3 @@ For information about the APIs and Providers available in the Red Hat LlamaStack
 ## Additional Resources
 
 - [Llama Stack Documentation](https://llamastack.github.io/docs/)
-- [OpenDataHub Documentation](https://opendatahub.io/docs)
-- [OpenShift AI Documentation](https://docs.redhat.com/en/documentation/red_hat_openshift_ai_self-managed)
@@ -0,0 +1,123 @@
+# Model Explainability Tests
+
+This directory contains tests for AI/ML model explainability, trustworthiness, evaluation, and safety components in OpenDataHub/RHOAI. It covers TrustyAI Service, Guardrails Orchestrator, LM Eval, EvalHub, and the TrustyAI Operator.
+
+## Directory Structure
+
+```text
+model_explainability/
+├── conftest.py                          # Shared fixtures (PVC, TrustyAI configmap)
+├── utils.py                             # Image validation utilities
+│
+├── evalhub/                             # EvalHub service tests
+│   ├── conftest.py
+│   ├── constants.py
+│   ├── test_evalhub_health.py           # Health endpoint validation
+│   └── utils.py
+│
+├── guardrails/                          # AI Safety Guardrails tests
+│   ├── conftest.py                      # Detectors, Tempo, OpenTelemetry fixtures
+│   ├── constants.py
+│   ├── test_guardrails.py               # Built-in, HuggingFace, autoconfig tests
+│   ├── upgrade/
+│   │   └── test_guardrails_upgrade.py   # Pre/post-upgrade tests
+│   └── utils.py
+│
+├── lm_eval/                             # Language Model Evaluation tests
+│   ├── conftest.py                      # LMEvalJob fixtures (HF, local, vLLM, S3, OCI)
+│   ├── constants.py                     # Task definitions (UNITXT, LLMAAJ)
+│   ├── data/                            # Test data files
+│   ├── test_lm_eval.py                  # HuggingFace, offline, vLLM, S3 tests
+│   └── utils.py
+│
+├── trustyai_operator/                   # TrustyAI Operator validation
+│   ├── test_trustyai_operator.py        # Operator image validation
+│   └── utils.py
+│
+└── trustyai_service/                    # TrustyAI Service core tests
+    ├── conftest.py                      # MariaDB, KServe, ISVC fixtures
+    ├── constants.py                     # Storage configs, model formats
+    ├── trustyai_service_utils.py        # TrustyAI REST client, metrics validation
+    ├── utils.py                         # Service creation, RBAC, MariaDB utilities
+    │
+    ├── drift/                           # Drift detection tests
+    │   ├── model_data/                  # Test data batches
+    │   └── test_drift.py                # Meanshift, KSTest, ApproxKSTest, FourierMMD
+    │
+    ├── fairness/                        # Fairness metrics tests
+    │   ├── conftest.py
+    │   ├── model_data/                  # Fairness test data
+    │   └── test_fairness.py             # SPD, DIR fairness metrics
+    │
+    ├── service/                         # Core service tests
+    │   ├── conftest.py
+    │   ├── test_trustyai_service.py     # Image validation, DB migration, DB cert tests
+    │   ├── utils.py
+    │   └── multi_ns/                    # Multi-namespace tests
+    │       └── test_trustyai_service_multi_ns.py
+    │
+    └── upgrade/                         # Upgrade compatibility tests
+        └── test_trustyai_service_upgrade.py
+```
+
+### Current Test Suites
+
+- **`evalhub/`** - EvalHub service health endpoint validation via kube-rbac-proxy
+- **`guardrails/`** - Guardrails Orchestrator tests with built-in regex detectors (PII), HuggingFace detectors (prompt injection, HAP), auto-configuration, and gateway routing. Includes OpenTelemetry/Tempo trace integration
+- **`lm_eval/`** - Language Model Evaluation tests covering HuggingFace models, local/offline tasks, vLLM integration, S3 storage, and OCI registry artifacts
+- **`trustyai_operator/`** - TrustyAI operator container image validation (SHA256 digests, CSV relatedImages)
+- **`trustyai_service/`** - TrustyAI Service tests for drift detection (4 metrics), fairness metrics (SPD, DIR), database migration, multi-namespace support, and upgrade scenarios. Tests run against both PVC and database storage backends
+
+## Test Markers
+
+```python
+@pytest.mark.model_explainability  # Module-level marker
+@pytest.mark.smoke                 # Critical smoke tests
+@pytest.mark.tier1                 # Tier 1 tests
+@pytest.mark.tier2                 # Tier 2 tests
+@pytest.mark.pre_upgrade           # Pre-upgrade tests
+@pytest.mark.post_upgrade          # Post-upgrade tests
+@pytest.mark.rawdeployment         # KServe raw deployment mode
+@pytest.mark.skip_on_disconnected  # Requires internet connectivity
+```
+
+## Running Tests
+
+### Run All Model Explainability Tests
+
+```bash
+uv run pytest tests/model_explainability/
+```
+
+### Run Tests by Component
+
+```bash
+# Run TrustyAI Service tests
+uv run pytest tests/model_explainability/trustyai_service/
+
+# Run Guardrails tests
+uv run pytest tests/model_explainability/guardrails/
+
+# Run LM Eval tests
+uv run pytest tests/model_explainability/lm_eval/
+
+# Run EvalHub tests
+uv run pytest tests/model_explainability/evalhub/
+```
+
+### Run Tests with Markers
+
+```bash
+# Run only smoke tests
+uv run pytest -m "model_explainability and smoke" tests/model_explainability/
+
+# Run drift detection tests
+uv run pytest tests/model_explainability/trustyai_service/drift/
+
+# Run fairness tests
+uv run pytest tests/model_explainability/trustyai_service/fairness/
+```
+
+## Additional Resources
+
+- [TrustyAI Documentation](https://github.com/trustyai-explainability)