Skip to content

Pull requests: confident-ai/deepeval

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add single-turn, multi-turn, arena and multimodal metrics to typescript
#2745 opened Jun 10, 2026 by A-Vamshi Collaborator Loading…
feat: inspect and bound GEval retrieval context prompts
#2742 opened Jun 10, 2026 by RitwijParmar Contributor Loading…
docs: add CometAPI custom provider example for GPTModel
#2733 opened Jun 8, 2026 by nuthalapativarun Contributor Loading…
Add regression testing on CI/CD
#2731 opened Jun 8, 2026 by A-Vamshi Collaborator Loading…
[codex] Add contextual precision grouping
#2729 opened Jun 8, 2026 by cat0825 Loading…
Add BGPT REFUTE benchmark
#2725 opened Jun 5, 2026 by connerlambden Loading…
Fix gemini costs
#2724 opened Jun 5, 2026 by A-Vamshi Collaborator Loading…
Fix LiteLLM cost calculation (was 666× over for common models)
#2723 opened Jun 4, 2026 by Aarkin7 Contributor Loading…
feat: expose score_breakdown in StepEfficiencyMetric
#2705 opened May 29, 2026 by nuthalapativarun Contributor Loading…
3 tasks
feat: add penalize_ambiguous_claims to HallucinationMetric
#2704 opened May 29, 2026 by nuthalapativarun Contributor Loading…
3 tasks
feat: add o3 and o3-2025-04-16 model constants
#2702 opened May 29, 2026 by nuthalapativarun Contributor Loading…
2 tasks
ProTip! no:milestone will show everything without a milestone.