feat: add LabelModelGrader support for OpenAI Evals backend by mesutoezdil · Pull Request #137 · agentevals-dev/agentevals

mesutoezdil · 2026-05-05T13:29:06Z

Closes #97

Adds label_model as a second grader type next to text_similarity.

label_model scores responses without a golden set. The grader config holds the model, input template, labels, and passing_labels. Items sent to the API include only actual_response.

Tests are in tests/test_openai_eval_backend.py.

mesutoezdil · 2026-05-12T18:06:14Z

@krisztianfekete ready for review when you have time.

krisztianfekete

Thanks, it's mostly good, added a couple of comments!

Adds label_model grader type, validates passing_labels against labels, moves OpenAI grader example to a separate file.

mesutoezdil · 2026-05-15T08:37:14Z

@krisztianfekete ready for review.

krisztianfekete · 2026-05-15T10:45:30Z

Please fix the linter, and please make sure not to force-push during reviews to make the process easier.

mesutoezdil · 2026-05-15T11:08:18Z

Will not do this again during review..

mesutoezdil mentioned this pull request May 5, 2026

Add LabelModelGrader OpenAI Grader #97

Closed

mesutoezdil force-pushed the feat/label-model-grader branch 4 times, most recently from 3873cfd to 88b707f Compare May 12, 2026 18:03

krisztianfekete reviewed May 13, 2026

View reviewed changes

Comment thread src/agentevals/config.py

Comment thread examples/custom_evaluators/eval_config.yaml Outdated

Comment thread src/agentevals/openai_eval_backend.py Outdated

feat: add LabelModelGrader support for OpenAI Evals backend

9efd28b

Adds label_model grader type, validates passing_labels against labels, moves OpenAI grader example to a separate file.

mesutoezdil force-pushed the feat/label-model-grader branch from 88b707f to 9efd28b Compare May 13, 2026 19:44

fix: include openai in eval and run names to avoid name collisions

9faf568

fix: apply ruff format to openai_eval_backend.py

50fa8cf

mesutoezdil force-pushed the feat/label-model-grader branch from 06c6e86 to 50fa8cf Compare May 15, 2026 11:05

krisztianfekete merged commit 8868017 into agentevals-dev:main May 15, 2026
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add LabelModelGrader support for OpenAI Evals backend#137

feat: add LabelModelGrader support for OpenAI Evals backend#137
krisztianfekete merged 3 commits into
agentevals-dev:mainfrom
mesutoezdil:feat/label-model-grader

mesutoezdil commented May 5, 2026 •

edited

Loading

Uh oh!

mesutoezdil commented May 12, 2026

Uh oh!

krisztianfekete left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mesutoezdil commented May 15, 2026

Uh oh!

krisztianfekete commented May 15, 2026

Uh oh!

mesutoezdil commented May 15, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

mesutoezdil commented May 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mesutoezdil commented May 12, 2026

Uh oh!

krisztianfekete left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mesutoezdil commented May 15, 2026

Uh oh!

krisztianfekete commented May 15, 2026

Uh oh!

mesutoezdil commented May 15, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

mesutoezdil commented May 5, 2026 •

edited

Loading