Batch evaluations #111
-
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 3 replies
-
|
Hey @jacobreesmontgomery thanks for asking the question!! This can also help us in future documentation. Evaluators do have a data validation step to check if the required data values are specified before evaluating that row. In a batch evaluation, my understanding is that if a specific row fails, the rest of the evaluation will still continue. However, we have reached out to the Evaluations SDK team to confirm if this is the case. We hope to have a response to you by tomorrow. If you have any other clarifications to add, please do so. |
Beta Was this translation helpful? Give feedback.
-
|
Hi @jacobreesmontgomery - thanks for your patience!! I talked to the engineer on the team and we ran some tests to see if the issue you experienced was reproducible. It was. And the behavior may be the result of a different change - so their recommendation was to have you file this as an issue. What we saw then reproducing your issue:
Recommendation: Repo (and current evaluation issues) here: |
Beta Was this translation helpful? Give feedback.

Hi @jacobreesmontgomery - thanks for your patience!! I talked to the engineer on the team and we ran some tests to see if the issue you experienced was reproducible. It was. And the behavior may be the result of a different change - so their recommendation was to have you file this as an issue.
What we saw then reproducing your issue:
If you have a batch evaluation with multiple evaluators - and your data row is missing values for parameters required by a specific evaluator - then the evaluation RUN will complete, but the relevant evaluators will show failures in logs.
If the ground truth in the data was represented by an empty string, it scores that a 1. But if it resolved to a None …