Open
Description
Hi Team,
I was wondering if we can add support for the following multi-turn datasets
- https://github.com/LMCache/LMCache/tree/dev/benchmarks/multi-round-qa
- https://github.com/ekwinox117/multi-challenge
- https://huggingface.co/datasets/bryanlincoln/infoquest
so that we don't have to run their benchmarking scripts for these specific datasets that are becoming quite important to evaluate cost/accuracy tradeoffs in agentic workflows.
Metadata
Metadata
Assignees
Type
Projects
Status
Backlog