Skip to content
Discussion options

You must be logged in to vote

Hi @jacobreesmontgomery - thanks for your patience!! I talked to the engineer on the team and we ran some tests to see if the issue you experienced was reproducible. It was. And the behavior may be the result of a different change - so their recommendation was to have you file this as an issue.

What we saw then reproducing your issue:

  1. If you have a batch evaluation with multiple evaluators - and your data row is missing values for parameters required by a specific evaluator - then the evaluation RUN will complete, but the relevant evaluators will show failures in logs.

  2. If the ground truth in the data was represented by an empty string, it scores that a 1. But if it resolved to a None …

Replies: 2 comments 3 replies

Comment options

You must be logged in to vote
1 reply
@jacobreesmontgomery
Comment options

Comment options

You must be logged in to vote
2 replies
@jacobreesmontgomery
Comment options

@nitya
Comment options

nitya Aug 5, 2025
Maintainer

Answer selected by amynic
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
2 participants