eval: add Gatekeeper benchmark visualization interface by Jazzcort · Pull Request #412 · rhel-lightspeed/linux-mcp-server

Jazzcort · 2026-04-16T19:09:21Z

Introduces a new standalone HTML page (eval/gatekeeper/index.html) to visualize and compare Gatekeeper model benchmark results.

Key features include:

Dynamic loading of test results from JSON files via a manifest or local directory listing (/data/).
A comparison grid mapping test cases against different models.
Color-coded cells representing test status (e.g., exact matches, mismatches, and safety status regressions/improvements).
Automated score calculation and display for each model.
An interactive, detailed popup view to inspect specific test case data (scripts, expected vs. actual results, and metadata) on click.

github-actions · 2026-04-16T19:09:30Z

For team members: test commit 8d7f7a9 in internal GitLab

codecov · 2026-04-16T19:11:14Z

Codecov Report

✅ All modified and coverable lines are covered by tests.

Flag	Coverage Δ
unittests	`96.46% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

Introduces a new standalone HTML page (`eval/gatekeeper/index.html`) to visualize and compare Gatekeeper model benchmark results. Key features include: - Dynamic loading of test results from JSON files via a manifest or local directory listing (`/data/`). - A comparison grid mapping test cases against different models. - Color-coded cells representing test status (e.g., exact matches, mismatches, and safety status regressions/improvements). - Automated score calculation and display for each model. - An interactive, detailed popup view to inspect specific test case data (scripts, expected vs. actual results, and metadata) on click.

github-actions · 2026-04-16T20:01:38Z

For team members: test commit 550764d in internal GitLab

Jazzcort requested a review from a team as a code owner April 16, 2026 19:09

Jazzcort force-pushed the eval-render-app branch from 8d7f7a9 to 550764d Compare April 16, 2026 20:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

eval: add Gatekeeper benchmark visualization interface#412

eval: add Gatekeeper benchmark visualization interface#412
Jazzcort wants to merge 1 commit intorhel-lightspeed:mainfrom
Jazzcort:eval-render-app

Jazzcort commented Apr 16, 2026

Uh oh!

github-actions bot commented Apr 16, 2026

Uh oh!

codecov bot commented Apr 16, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Apr 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Jazzcort commented Apr 16, 2026

Uh oh!

github-actions bot commented Apr 16, 2026

Uh oh!

codecov bot commented Apr 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

github-actions bot commented Apr 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

codecov bot commented Apr 16, 2026 •

edited

Loading