Add evaluation get-started guide (GTM-1941) by felixkrrr · Pull Request #2713 · langfuse/langfuse-docs

felixkrrr · 2026-03-25T15:46:45Z

Summary

Adds a proper Get Started guide for the Evaluation section, addressing GTM-1941. The current evaluation docs have detailed reference pages but lack a clear onboarding path for new users.

What changed

New file: `content/docs/evaluation/get-started.mdx`

A structured get-started guide that follows the same pattern as the existing observability and prompt-management get-started pages:

"Use AI" tab — points users to the Langfuse Skill for agent-assisted setup
"Do it yourself" tab — a decision tree with three paths:
1. Monitor Production — set up LLM-as-a-Judge on live traces (for teams that already have traces flowing)
2. Test Before Shipping — run experiments with datasets and evaluators via SDK (for teams building/iterating on prompts)
3. Human Review — set up annotation queues for domain expert review (for teams needing ground truth)
Full Python and JS/TS code examples for the experiments path
"Next steps" section guiding users to combine methods, build datasets, add to CI, and track trends

Updated files

content/docs/evaluation/meta.json — added get-started to the navigation, placed between overview and core-concepts
content/docs/meta.json — updated the top-level "Set up Evals" link to point to /docs/evaluation/get-started instead of /docs/evaluation/overview
content/docs/evaluation/overview.mdx — updated the "Getting Started" section to reference the new get-started guide

Analysis of current gaps

The evaluation docs currently have:

overview.mdx — high-level intro with a brief "Getting Started" section that just lists links
core-concepts.mdx — detailed concept explanations
5 evaluation method pages (LLM-as-a-Judge, annotation queues, scores via SDK/UI, score analytics)
4 experiment pages (data model, datasets, experiments via SDK/UI)

What was missing:

A proper onboarding flow that helps users choose the right evaluation method for their situation
Quick-start code examples in one place (the experiments-via-sdk page has examples but is 1300+ lines of reference docs)
A decision framework for choosing between monitoring, experiments, and human review
Consistency with the observability and prompt-management sections that both have dedicated get-started pages

Linear Issue: GTM-1941

…uation method - Create content/docs/evaluation/get-started.mdx with three paths: Monitor Production (LLM-as-a-Judge), Test Before Shipping (experiments), and Human Review (annotation queues) - Follow same pattern as observability and prompt-management get-started pages with AI agent / Do it yourself tabs - Add to evaluation section navigation (meta.json) - Update top-level docs sidebar to link to get-started instead of overview - Update overview.mdx to reference the new get-started page Co-authored-by: felixkrrr <felixkrrr@users.noreply.github.com>

vercel · 2026-03-25T15:46:52Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
langfuse-docs	Ready	Preview, Comment	Mar 25, 2026 3:48pm

vercel bot deployed to Preview March 25, 2026 15:48 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add evaluation get-started guide (GTM-1941)#2713

Add evaluation get-started guide (GTM-1941)#2713
felixkrrr wants to merge 1 commit intomainfrom
cursor/GTM-1941-evals-onboarding-guides-8f5c

felixkrrr commented Mar 25, 2026

Uh oh!

vercel bot commented Mar 25, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

felixkrrr commented Mar 25, 2026

Summary

What changed

New file: content/docs/evaluation/get-started.mdx

Updated files

Analysis of current gaps

Uh oh!

vercel bot commented Mar 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

New file: `content/docs/evaluation/get-started.mdx`

vercel bot commented Mar 25, 2026 •

edited

Loading