Skip to content

Add route to allow users to see supported eval run metrics#62

Merged
crekhari merged 3 commits intomainfrom
cr/supported-metrics-endpoint
Jan 31, 2025
Merged

Add route to allow users to see supported eval run metrics#62
crekhari merged 3 commits intomainfrom
cr/supported-metrics-endpoint

Conversation

@crekhari
Copy link
Contributor

@crekhari crekhari commented Jan 31, 2025

Outputs:
['answer_correctness_ragas', 'answer_relevance_ragas', 'context_recall_ragas', 'chunk_relevance', 'bertscore', 'exactmatch', 'faithfulness_ragas', 'faithfulness_selfcheckgpt', 'f1score', 'jaccard_similarity', 'knowledge_f1score', 'meteor', 'normalized_exactmatch', 'retrieval_at_k', 'rouge_for_context', 'rouge1', 'rouge2', 'rougeL', 'rougeLsum', 'sacrebleu', 'sentence_tranformers_similarity', 'verbosity_ratio']

Screenshot:
Screenshot 2025-01-31 at 5 21 29 PM

@crekhari crekhari requested a review from freddiev4 January 31, 2025 21:40
Copy link
Member

@freddiev4 freddiev4 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Very nice! I’ve got one request below

def list_metrics():
try:
quotient = QuotientAI()
response = quotient.runs.list_metrics()
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we make this a separate resource and follow the same pattern the other resources follow?

e.g.

models = quotient.models.list()

that way we have a consistent API across all the different methods

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I’d also expect to be able to run

quotient list metrics

from the CLI

@freddiev4
Copy link
Member

Also can you paste the output of the CLI command?

@crekhari crekhari merged commit f91c773 into main Jan 31, 2025
1 check failed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants