Fix missing manager_agent tokens in usage_metrics from kickoff #2848

hegasz · 2025-05-15T20:12:16Z

The Crew.kickoff() method calculates usage_metrics by aggregating token usage from each agent.
Unless I'm missing something here, it seems to leave out manager_agent tokens (used in hierarchical crews), which could lead to under-reported usage. Is this intentional?

This PR updates the logic to delegate to calculate_usage_metrics(), which properly includes both regular agents and the manager in the aggregation.

joaomdmoura · 2025-05-15T20:14:46Z

Disclaimer: This review was made by a crew of AI Agents.

Code Review Comment for PR #2848

Overview

This pull request modifies the usage metrics collection in crew.py, specifically in the kickoff method. The aim is to prevent the loss of manager_agent tokens by restructuring the collection and aggregation of metrics.

Identified Changes

Removal of Initialization: The explicit list initialization for metrics is removed, simplifying metrics handling.
Elimination of Aggregation Loop: The previous manual loop for collecting metrics has been removed.
Introduction of calculate_usage_metrics(): This new method should centralize the logic for metrics calculation based on agents.

Issues Found

Potential Missing Method Implementation: calculate_usage_metrics() is introduced but lacks visible implementation, leading to possible runtime errors if not defined elsewhere.
Error Handling Concerns: The unchanged generic exception handling makes debugging more challenging; specific metrics-related errors should be captured.

Code Quality

Positive Aspects:

Reduced Complexity: The code is clearer without the manual metrics aggregation.
Improved Maintainability: Centralizing metrics calculations leads to better readability and potential reuse.
Memory Efficiency: Removal of temporary list creation optimizes memory usage.

Improvement Suggestions:

Documentation: The new method calculate_usage_metrics() should include a docstring outlining its purpose and return types.
Type Hints: Adding type hints for variables and method signatures would enhance clarity and maintainability.

Recommendations

Add Method Documentation
Example:

def calculate_usage_metrics(self) -> UsageMetrics:
    """
    Aggregates usage metrics from all agents, including manager agents.
    Returns:
        UsageMetrics: Combined usage metrics from all agents in the crew.
    """

Enhance Error Handling
Enhanced error handling could look like:

try:
    self.usage_metrics = self.calculate_usage_metrics()
except AttributeError as ae:
    raise CrewException("Failed to calculate metrics due to missing attributes.") from ae
except Exception as e:
    crewai_event_bus.emit("crew.error", {"error": str(e), "context": "metrics_calculation"})
    raise

Implement Type Hints
For better code readability:

def kickoff(self) -> Any:
    result: Any = None
    self.usage_metrics: UsageMetrics = self.calculate_usage_metrics()
    return result

Add Validation Logic
To improve the robustness of metrics calculation:

def calculate_usage_metrics(self) -> UsageMetrics:
    if not self.agents:
        raise CrewException("No agents available for metrics calculation.")
    return UsageMetrics().add_usage_metrics(*[
        agent._token_process.get_summary() for agent in self.agents
    ])

Conclusion

While the changes made enhance the efficiency of metrics collection, further improvements in documentation, type hints, and error handling are necessary for clarity and robustness. Implementing the above suggestions will not only maintain the original intent of the changes but also strengthen the code quality.

Testing Recommendations

Implement unit tests for calculate_usage_metrics(), including scenarios with empty agent lists and normal cases.
Validate that metrics are accurately aggregated across varying agent types.

Addressing the points above will ensure a smooth merging process and contribute to a maintainable codebase.

lucasgomide

could you add some tests to cover this issue?

…usage metrics

hegasz · 2025-05-16T12:51:52Z

could you add some tests to cover this issue?

Have added test_hierarchical_kickoff_usage_metrics_include_manager in tests/crew_test.py!

hegasz · 2025-05-21T16:04:51Z

Any updates on this? Thanks!

hegasz · 2025-05-31T11:55:13Z

@lucasgomide Hi, just surfacing this again. If this is indeed a bug it could be resulting in real financial impact to users (including myself); could someone please take a moment to double-check?

lucasgomide

nice work here

…IInc#2848) * fix(metrics): prevent usage_metrics from dropping manager_agent tokens * Add test to verify hierarchical kickoff aggregates manager and agent usage metrics --------- Co-authored-by: Lucas Gomide <[email protected]>

fix(metrics): prevent usage_metrics from dropping manager_agent tokens

320cae2

lucasgomide reviewed May 16, 2025

View reviewed changes

Add test to verify hierarchical kickoff aggregates manager and agent …

d9c478e

…usage metrics

Merge branch 'main' into manager-usage-metrics

e3ec823

lucasgomide approved these changes Jun 3, 2025

View reviewed changes

Merge branch 'main' into manager-usage-metrics

27902e5

lucasgomide merged commit e9d9dd2 into crewAIInc:main Jun 9, 2025
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix missing manager_agent tokens in usage_metrics from kickoff #2848

Fix missing manager_agent tokens in usage_metrics from kickoff #2848

Uh oh!

hegasz commented May 15, 2025

Uh oh!

joaomdmoura commented May 15, 2025

Uh oh!

lucasgomide left a comment •

edited

Loading

Uh oh!

hegasz commented May 16, 2025

Uh oh!

hegasz commented May 21, 2025

Uh oh!

hegasz commented May 31, 2025 •

edited

Loading

Uh oh!

lucasgomide left a comment

Uh oh!

Uh oh!

Uh oh!

Fix missing manager_agent tokens in usage_metrics from kickoff #2848

Fix missing manager_agent tokens in usage_metrics from kickoff #2848

Uh oh!

Conversation

hegasz commented May 15, 2025

Uh oh!

joaomdmoura commented May 15, 2025

Code Review Comment for PR #2848

Overview

Identified Changes

Issues Found

Code Quality

Recommendations

Conclusion

Testing Recommendations

Uh oh!

lucasgomide left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hegasz commented May 16, 2025

Uh oh!

hegasz commented May 21, 2025

Uh oh!

hegasz commented May 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lucasgomide left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

lucasgomide left a comment •

edited

Loading

hegasz commented May 31, 2025 •

edited

Loading