feat: add AgentLoopDetectionMetric for detecting infinite loops and cyclical patterns by Ruthwik-Data · Pull Request #2819 · confident-ai/deepeval

Ruthwik-Data · 2026-06-29T17:33:56Z

Summary

Adds AgentLoopDetectionMetric to detect production failures where agents get stuck in infinite loops before completing their tasks. Addresses issue #2643.

What's included

Core detection mechanisms

Tool call repetition: Detects when the same tool is called repeatedly with identical (or nearly identical) arguments
Reasoning stagnation: Identifies when LLM outputs become highly similar across consecutive steps using n-gram overlap (default) or optional embedding-based similarity
Call cycles: Finds repeating patterns in tool call sequences (A→B→C→A)

API

from deepeval.metrics import AgentLoopDetectionMetric

loop_metric = AgentLoopDetectionMetric(
    threshold=0.5,
    repetition_threshold=3,          # max identical tool calls before flagging  
    min_identical_args_ratio=0.9,    # ratio of matching args to count as duplicate
    reasoning_stagnation_detector="ngram",  # "ngram" | "embedding"
    similarity_threshold=0.85,       # for reasoning stagnation window
    stall_steps=5,                   # max planning-only steps
)

loop_metric.measure(test_case)
print(loop_metric.score)           # 0.0-1.0 (1.0 = no loops)
print(loop_metric.reason)          # Human-readable explanation  
print(loop_metric.loop_triggers)   # List of LoopTrigger objects with step indices

Output structure

# Example
metric.score = 0.12
metric.reason = "search_web called 4x with identical args in steps [3,4,5,6]"
metric.loop_triggers = [
    LoopTrigger(
        type="tool_repeat",
        tool="search_web",
        steps=[3, 4, 5, 6],
        args_fingerprint="abc123",
        description="search_web called 4x with identical args in steps [3,4,5,6]"
    )
]

Design decisions

Deterministic by default: Uses n-gram overlap for reasoning stagnation to keep the metric zero-latency. Embedding-based detection is opt-in for higher recall.
min_identical_args_ratio: Prevents false positives on legitimate retry-with-variation patterns (e.g., a search tool re-querying with slightly different params).
Actionable output: The loop_triggers field annotates exactly which steps triggered detection and why, enabling teams to debug and fix agent logic.
Trace-only: Requires test_case._trace_dict with a steps field. Each step should include:
- tool_name and tool_args for tool calls
- llm_output or reasoning for reasoning detection

Testing

Before merging, will add:

Unit tests with fixture traces (healthy, infinite loop, cyclical)
Integration test with @observe decorator
Type stubs if needed

Happy to coordinate with @rohitmannur007 to avoid conflicts. Let me know if the API/approach aligns with the intended design!

This class analyzes agent execution traces to detect infinite loops and cyclical patterns, including tool call repetition, reasoning stagnation, and call cycles.

vercel · 2026-06-29T17:34:00Z

@Ruthwik-Data is attempting to deploy a commit to the Confident AI Team on Vercel.

A member of the Team first needs to authorize it.

Ruthwik-Data added 11 commits June 15, 2026 12:22

style: fix prettier formatting in deepseek-model.ts

903fed0

style: fix prettier formatting in kimi-model.ts

af9425c

style: fix prettier formatting in openai-model.ts

f22ad6d

style: fix prettier formatting in openrouter-model.ts

02429a1

style: fix prettier formatting in portkey-model.ts

fdf3058

style: fix prettier line wrapping in kimi-model.ts

63ec06d

Update openai-model.ts

4aa1973

feat: add AgentLoopDetectionMetric module __init__

9f28665

feat: add schema for AgentLoopDetectionMetric

723ab22

feat: add AgentLoopDetectionMetric implementation

7c88f8d

This class analyzes agent execution traces to detect infinite loops and cyclical patterns, including tool call repetition, reasoning stagnation, and call cycles.

feat: export AgentLoopDetectionMetric in __init__

57fd82a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add AgentLoopDetectionMetric for detecting infinite loops and cyclical patterns#2819

feat: add AgentLoopDetectionMetric for detecting infinite loops and cyclical patterns#2819
Ruthwik-Data wants to merge 11 commits into
confident-ai:mainfrom
Ruthwik-Data:feat/agent-loop-detection-metric

Ruthwik-Data commented Jun 29, 2026

Uh oh!

vercel Bot commented Jun 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

Ruthwik-Data commented Jun 29, 2026

Summary

What's included

Core detection mechanisms

API

Output structure

Design decisions

Related

Testing

Uh oh!

vercel Bot commented Jun 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant