Fix issue #2798: Remove duplicate tool results in messages #2799

devin-ai-integration · 2025-05-09T07:47:37Z

Fix issue #2798: Remove duplicate tool results in messages

Description

This PR fixes issue #2798 where tool results were being duplicated in the LLM prompt, increasing token usage and latency.

The issue was caused by tool results being added to messages twice:

First directly in agent_utils.py with messages.append({"role": "assistant", "content": tool_result.result})
Then again when the formatted_answer.text (which already includes the tool result with "Observation:" prefix) is appended to messages

Changes

Removed the direct append of tool results to messages in agent_utils.py
Added a test to verify that tool results are not duplicated in messages

Testing

Added a test that verifies tool results are not duplicated in messages
The test confirms that the tool result appears at most once in the messages array

Link to Devin run

https://app.devin.ai/sessions/98b28116a3834a1db6aad90ff8ea278c

Requested by

Joe Moura ([email protected])

Co-Authored-By: Joe Moura <[email protected]>

devin-ai-integration · 2025-05-09T07:47:43Z

🤖 Devin AI Engineer

I'll be helping with this pull request! Here's what you should know:

✅ I will automatically:

Address comments on this PR. Add '(aside)' to your comment to have me ignore it.
Look at CI failures and help fix them

Note: I can only respond to comments from users who have write access to this repository.

⚙️ Control Options:

Disable automatic comment and CI monitoring

joaomdmoura · 2025-05-09T07:49:53Z

Disclaimer: This review was made by a crew of AI Agents.

Code Review Comment for PR #2799: Fix Tool Result Duplication

Overview

This pull request successfully addresses issue #2798 by removing duplicate tool results in messages and adding comprehensive test coverage. Noteworthy modifications include:

File Modified:
- src/crewai/utilities/agent_utils.py
- tests/test_tool_result_duplication.py (newly created)

Detailed Analysis

1. Changes in `agent_utils.py`

Modifications:
- The code responsible for appending tool results to messages has been removed to prevent duplication. This change effectively streamlines the message handling process.
Quality Assessment:
- ✅ The code is concise and focused directly on resolving the stated issue.
- ✅ No new complexities have been introduced, maintaining the original behavior and function signatures.
Improvement Suggestion:
- Adding a comment to clarify the rationale behind the removal can prevent future misunderstandings. For example:
```
# Tool results are already included in the formatted answer, 
# Duplication in messages is unnecessary and leads to token bloat.
```

2. Changes in `test_tool_result_duplication.py`

Strengths:
- Robust coverage and clear structure with well-documented test cases.
- Effective use of mocking demonstrates high-quality test practices.
Code Quality Observation:
- The docstrings provide excellent context, linking back to issue [BUG] Duplicated Tool Result in the Prompt #2798, and enhance clarity.

Improvement Suggestions:

Add Type Hints:
- To ensure better readability and maintainability, consider adding type hints to the test functions:
```
def test_tool_result_not_duplicated_in_messages() -> None:
```

Implement Parameterized Test Cases:

This will facilitate testing various inputs efficiently:

@pytest.mark.parametrize("tool_result", [
    "Sample tool result",
    "Test with special characters: @$%^&*()",
    "Long multi-line\nresult"
])
def test_tool_result_with_various_inputs(tool_result: str) -> None:

Edge Case Scenarios:

Include additional tests for empty inputs and special characters:

def test_tool_result_with_empty_message() -> None:
    # Add implementation for testing empty messages

def test_tool_result_with_special_characters() -> None:
    # Add implementation for testing special characters handling

Overall Assessment

The PR effectively tackles the duplication issue and enhances the reliability of message handling through robust test coverage.
While the changes are commendable, implementing the suggested improvements could lead to increased code maintainability and reliability.

Recommendations

Approval: The pull request should be accepted as it meets the requirements and fixes the highlighted issues.
Future Improvements: Encourage the incorporation of the proposed improvements in subsequent PRs, focusing on documentation and test enhancement.

Security Considerations

No security vulnerabilities were identified in this PR. Changes are deemed safe in terms of user data handling and authentication mechanisms.

In conclusion, this PR represents a step forward in addressing the tool result duplication while ensuring the changes maintain quality standards in code and testing practices. The ongoing attention to documentation and testing will significantly benefit future development efforts.

Co-Authored-By: Joe Moura <[email protected]>

devin-ai-integration · 2025-05-17T15:49:53Z

Closing due to inactivity for more than 7 days.

devin-ai-integration bot and others added 2 commits May 9, 2025 07:45

Fix issue #2798: Remove duplicate tool results in messages

0bb5f40

Co-Authored-By: Joe Moura <[email protected]>

Add test for tool result duplication fix

b88ea42

Co-Authored-By: Joe Moura <[email protected]>

devin-ai-integration bot and others added 5 commits May 9, 2025 08:02

Add type hints and fix CrewAgentExecutor initialization in test

d90ba65

Co-Authored-By: Joe Moura <[email protected]>

Fix import sorting in test file

d8876dc

Co-Authored-By: Joe Moura <[email protected]>

Add comment explaining tool result duplication fix

f9221ea

Co-Authored-By: Joe Moura <[email protected]>

Fix import sorting with ruff check --fix

51e34d8

Co-Authored-By: Joe Moura <[email protected]>

Fix ToolResult parameters in test

628345e

Co-Authored-By: Joe Moura <[email protected]>

devin-ai-integration bot closed this May 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix issue #2798: Remove duplicate tool results in messages #2799

Fix issue #2798: Remove duplicate tool results in messages #2799

devin-ai-integration bot commented May 9, 2025

devin-ai-integration bot commented May 9, 2025

joaomdmoura commented May 9, 2025

devin-ai-integration bot commented May 17, 2025

Fix issue #2798: Remove duplicate tool results in messages #2799

Fix issue #2798: Remove duplicate tool results in messages #2799

Conversation

devin-ai-integration bot commented May 9, 2025

Fix issue #2798: Remove duplicate tool results in messages

Description

Changes

Testing

Link to Devin run

Requested by

devin-ai-integration bot commented May 9, 2025

🤖 Devin AI Engineer

joaomdmoura commented May 9, 2025

Code Review Comment for PR #2799: Fix Tool Result Duplication

Overview

Detailed Analysis

1. Changes in agent_utils.py

2. Changes in test_tool_result_duplication.py

Overall Assessment

Recommendations

Security Considerations

devin-ai-integration bot commented May 17, 2025

1. Changes in `agent_utils.py`

2. Changes in `test_tool_result_duplication.py`