Agent Evaluation with LLM as Judge

# Description

Expend on agent evaluation with a notebook showing how to evaluate an agent response using **LLM as Judge**.

Once we have the foundation of what is LLMaJ and how it works, we should build from this and have a notebook on evaluating agent response with an **Ensemble of Judges** for more accurate judging.


## Potential points to cover 

- What is LLM as Judge?
- Why use it for agent response evaluation instead of other metrics?
- Pros and cons of ensemble of judges


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Agent Evaluation with LLM as Judge #31

Description

Potential points to cover

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Agent Evaluation with LLM as Judge #31

Description

Description

Potential points to cover

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions