[OPIK-5036] [DOCS] docs: add evaluation suites documentation page by alexkuzmik · Pull Request #5828 · comet-ml/opik

alexkuzmik · 2026-03-24T19:28:22Z

Details

Add a new documentation page for Evaluation Suites covering the full SDK workflow: creating suites with assertions and execution policies, adding test items (batch and single with item-level assertions/policies), defining task functions, running evaluations, and comparing prompt versions. Includes a runnable quickstart script and three screenshots (terminal output, experiment items, assertion detail).

New MDX page added to Fern docs under Evaluation section
Standalone evaluation_suite_quickstart.py example script
Three screenshots: terminal output, passed suite, failed run detail
Registered in docs.yml navigation

Change checklist

User facing
Documentation update

Issues

OPIK-5036

AI-WATERMARK

AI-WATERMARK: yes

If yes:
- Tools: Claude Code
- Model(s): Claude Opus 4.6
- Scope: Full implementation
- Human verification: Manual review of rendered docs page

Testing

Ran Fern dev server locally and verified the page renders correctly with all code blocks, screenshots, and MDX components (Tip, Note, Frame).

Documentation

This PR is the documentation change itself.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…-evaluation-suite-docs

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

github-actions · 2026-03-24T19:31:44Z

🌿 Preview your docs: https://opik-preview-838c7d17-7398-4584-8f8e-5b6a561e2043.docs.buildwithfern.com/docs/opik

No broken links found

📌 Results for commit 816cd5e

github-actions · 2026-03-24T19:32:56Z

🔄 Test environment deployment process has started

Phase 1: Deploying base version 1.10.47-4623 (from main branch) if environment doesn't exist
Phase 2: Building new images from PR branch aliaksandrk/OPIK-5036-evaluation-suite-docs
Phase 3: Will deploy newly built version after build completes

You can monitor the progress here.

baz-reviewer · 2026-03-24T19:33:14Z

apps/opik-documentation/documentation/fern/docs/evaluation/evaluation_suites.mdx

+---
+headline: Evaluation Suites | Opik Documentation
+og:description: Create reusable test suites with natural-language assertions to evaluate
+  AI agents and LLM-powered applications across iterations.
+og:site_name: Opik Documentation


evaluation_suites.mdx uses underscores instead of the kebab-case required by apps/opik-documentation/AGENTS.md, should we rename it to evaluation-suites.mdx?
evaluation_suites.mdx => evaluation-suites.mdx

_{Finding type: AI Coding Guidelines | Severity: 🟢 Low}

Want Baz to fix this for you? Activate Fixer

Other fix methods

Prompt for AI Agents:

Before applying, verify this suggestion against the current code. In apps/opik-documentation/documentation/fern/docs/evaluation/evaluation_suites.mdx around lines 1-5, the filename uses underscores instead of the repo's required kebab-case. Rename the file to apps/opik-documentation/documentation/fern/docs/evaluation/evaluation-suites.mdx. After renaming, search the repository for references to evaluation_suites.mdx and update them to evaluation-suites.mdx (links, imports, or documentation indexes) so nothing breaks.

apps/opik-documentation/documentation/fern/docs/evaluation/evaluation_suites.mdx

CometActions · 2026-03-25T04:46:12Z