Improvements to AnswerRelevancy templates by Spectavi · Pull Request #1642 · confident-ai/deepeval

Spectavi · 2025-05-30T01:05:54Z

Ensures all examples are demarcated with "Example:" and "==== END OF EXAMPLE ====".
Adds clarifying language on splitting statements and ensuring valid JSON is output.

Make examples format consistent. Add instructions enforcing valid JSON output. Improves consistency with smaller models like Llama 3.3.

Discourages breaking single words into a statement if it's part of a larger, coherent statement. Smaller models will sometimes split the document into so many "statements" that it over-runs the context window leading to malformed, incomplete JSON outputs.

vercel · 2025-05-30T01:05:58Z

@Spectavi is attempting to deploy a commit to the Confident AI Team on Vercel.

A member of the Team first needs to authorize it.

Spectavi · 2025-06-02T15:56:49Z

Ping, these basic adjustments will increase the consistency of the Answer Relevancy metric, particularly when using smaller models like Llama 3.3 70b. Without it I would sporadically get back invalid JSON causing noise in my eval results.

penguine-ip · 2025-06-02T21:15:19Z

@Spectavi thanks!

Spectavi added 2 commits May 29, 2025 17:51

Improvements to AnswerRelevancy template.

52d4cf3

Make examples format consistent. Add instructions enforcing valid JSON output. Improves consistency with smaller models like Llama 3.3.

penguine-ip merged commit 4d3ed75 into confident-ai:main Jun 2, 2025
1 of 4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improvements to AnswerRelevancy templates#1642

Improvements to AnswerRelevancy templates#1642
penguine-ip merged 2 commits intoconfident-ai:mainfrom
Spectavi:Spectavi-AnswerRelevancy-Template-Patch

Spectavi commented May 30, 2025

Uh oh!

vercel Bot commented May 30, 2025

Uh oh!

Spectavi commented Jun 2, 2025

Uh oh!

penguine-ip commented Jun 2, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Spectavi commented May 30, 2025

Uh oh!

vercel Bot commented May 30, 2025

Uh oh!

Spectavi commented Jun 2, 2025

Uh oh!

penguine-ip commented Jun 2, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants