Brandon/bring back byoa #2122

bhancockio · 2025-02-13T18:26:37Z

No description provided.

joaomdmoura · 2025-02-13T18:29:44Z

Disclaimer: This review was made by a crew of AI Agents.

Code Review for crewAI PR #2122

Overview

This pull request introduces essential changes to the crewAI codebase, primarily enhancing the LangChain agent adapter functionality and improving tool handling mechanisms. The modifications have significant implications for how tasks are executed and how tools are processed, which can lead to increased efficacy within the agent framework.

Key Findings and Suggestions

1. LangChainAgentAdapter Implementation

Code Improvements

Error Handling: The current implementation in execute_task() just re-raises exceptions without context. It is recommended to wrap exceptions with logging for diagnostics:

try:
    state = self.agent_executor.invoke(init_state)
except Exception as e:
    logger.error(f"Error executing task: {str(e)}")
    raise AgentExecutionError(f"Failed to execute task: {str(e)}") from e

Logging: Utilize logging instead of print statements. For instance:
```
logger.debug(f"Raw tools: {raw_tools}")
```

Related Pull Requests

Reviewing past PRs that involved logging practices can provide insights into consistent patterns of error management throughout the codebase. A related PR that focused on logging improvements would reinforce the case for these adjustments.

2. Tool Handling Improvements

Code Improvements

Simplification of Tool Conversion Logic: Recommend creating a helper function to handle tool conversion cleanly:

def convert_tool(tool):
    return tool.to_langchain() if isinstance(tool, (CrewTool, BaseTool)) else tool

Historical Context

Past modifications related to the BaseTool class should be reviewed to ensure compatibility and acknowledge previously established conventions. Look for PRs that emphasized tool validation.

3. Agent Builder Improvements

Code Improvements

Cache Handling Logic: The cache handler logic should ensure proper logging and checks for cache validation. Adequate logging could help debug setup issues:
```
if not self.cache:
    logger.warning("Cache is disabled...")
```

4. General Code Quality Improvements

Type Hints: Enhance type hints across critical functions to improve clarity. As noted:
```
def convert_tools(cls, value: List[Union[T, Any]]) -> List[T]:
```
Documentation: Provide detailed docstrings for complex functionalities to assist current and future developers in understanding the logic.

5. Security Considerations

Code Improvements

Input Validation: Implement input sanitization functions to prevent vulnerabilities, particularly in user-provided prompts:
```
def sanitize_prompt(prompt: str) -> str:
    return re.sub(r'[<>]', '', prompt)
```

Summary of Critical Issues

Inconsistent error handling throughout the codebase.
Overuse of print statements instead of formal logging.
Insufficient input validation for user-generated content.
Incomplete type hints in function definitions.
Lack of thorough documentation for various methods.

Recommendations for Future Improvements

Develop a comprehensive logging strategy for better traceability.
Add unit tests for the newly implemented LangChainAgentAdapter.
Establish a clear error hierarchy for agent-related exceptions.
Enable runtime type checking in critical execution paths.
Employ dependency injection to enhance testability and stability.

The overall changes usher in a much-needed refinement to the codebase. Implementing these suggestions would further solidify the changes' robustness, maintainability, and security.

This commit implements a method for exporting the state of a flow into a JSON-serializable dictionary. The idea is producing a human-readable version of state that can be inspected or consumed by other systems, hence JSON and not pickling or marshalling. I consider it an export because it's a one-way process, meaning it cannot be loaded back into Python because of complex types.

* Added functionality to have any llm run test functionality * Fixed lint issues * Fixed Linting issues * Fixed unit test case * Fixed unit test * Fixed test case * Fixed unit test case --------- Co-authored-by: Brandon Hancock (bhancock_ai) <[email protected]>

Co-authored-by: Brandon Hancock (bhancock_ai) <[email protected]>

* updating prompts * fix issue * clean up thoughts as well * drop trailing set

* feat: add prompt observability code * feat: improve logic for llm call * feat: add tests for traces * feat: remove unused improt * feat: add function to clear and add task traces * feat: fix import * feat: chagne time * feat: fix type checking issues * feat: add fixed time to fix test * feat: fix datetime test issue * feat: add add task traces function * feat: add same logic as entp * feat: add start_time as reference for duplication of tool call * feat: add max_depth * feat: add protocols file to properly import on LLM --------- Co-authored-by: Brandon Hancock (bhancock_ai) <[email protected]>

Co-authored-by: Brandon Hancock (bhancock_ai) <[email protected]>

* WIP crew events emitter * Refactor event handling and introduce new event types - Migrate from global `emit` function to `event_bus.emit` - Add new event types for task failures, tool usage, and agent execution - Update event listeners and event bus to support more granular event tracking - Remove deprecated event emission methods - Improve event type consistency and add more detailed event information * Add event emission for agent execution lifecycle - Emit AgentExecutionStarted and AgentExecutionError events - Update CrewAgentExecutor to use event_bus for tracking agent execution - Refactor error handling to include event emission - Minor code formatting improvements in task.py and crew_agent_executor.py - Fix a typo in test file * Refactor event system and add third-party event listeners - Move event_bus import to correct module paths - Introduce BaseEventListener abstract base class - Add AgentOpsListener for third-party event tracking - Update event listener initialization and setup - Clean up event-related imports and exports * Enhance event system type safety and error handling - Improve type annotations for event bus and event types - Add null checks for agent and task in event emissions - Update import paths for base tool and base agent - Refactor event listener type hints - Remove unnecessary print statements - Update test configurations to match new event handling * Refactor event classes to improve type safety and naming consistency - Rename event classes to have explicit 'Event' suffix (e.g., TaskStartedEvent) - Update import statements and references across multiple files - Remove deprecated events.py module - Enhance event type hints and configurations - Clean up unnecessary event-related code * Add default model for CrewEvaluator and fix event import order - Set default model to "gpt-4o-mini" in CrewEvaluator when no model is specified - Reorder event-related imports in task.py to follow standard import conventions - Update event bus initialization method return type hint - Export event_bus in events/__init__.py * Fix tool usage and event import handling - Update tool usage to use `.get()` method when checking tool name - Remove unnecessary `__all__` export list in events/__init__.py * Refactor Flow and Agent event handling to use event_bus - Remove `event_emitter` from Flow class and replace with `event_bus.emit()` - Update Flow and Agent tests to use event_bus event listeners - Remove redundant event emissions in Flow methods - Add debug print statements in Flow execution - Simplify event tracking in test cases * Enhance event handling for Crew, Task, and Event classes - Add crew name to failed event types (CrewKickoffFailedEvent, CrewTrainFailedEvent, CrewTestFailedEvent) - Update Task events to remove redundant task and context attributes - Refactor EventListener to use Logger for consistent event logging - Add new event types for Crew train and test events - Improve event bus event tracking in test cases * Remove telemetry and tracing dependencies from Task and Flow classes - Remove telemetry-related imports and private attributes from Task class - Remove `_telemetry` attribute from Flow class - Update event handling to emit events without direct telemetry tracking - Simplify task and flow execution by removing explicit telemetry spans - Move telemetry-related event handling to EventListener * Clean up unused imports and event-related code - Remove unused imports from various event and flow-related files - Reorder event imports to follow standard conventions - Remove unnecessary event type references - Simplify import statements in event and flow modules * Update crew test to validate verbose output and kickoff_for_each method - Enhance test_crew_verbose_output to check specific listener log messages - Modify test_kickoff_for_each_invalid_input to use Pydantic validation error - Improve test coverage for crew logging and input validation * Update crew test verbose output with improved emoji icons - Replace task and agent completion icons from 👍 to ✅ - Enhance readability of test output logging - Maintain consistent test coverage for crew verbose output * Add MethodExecutionFailedEvent to handle flow method execution failures - Introduce new MethodExecutionFailedEvent in flow_events module - Update Flow class to catch and emit method execution failures - Add event listener for method execution failure events - Update event-related imports to include new event type - Enhance test coverage for method execution failure handling * Propagate method execution failures in Flow class - Modify Flow class to re-raise exceptions after emitting MethodExecutionFailedEvent - Reorder MethodExecutionFailedEvent import to maintain consistent import style * Enable test coverage for Flow method execution failure event - Uncomment pytest.raises() in test_events to verify exception handling - Ensure test validates MethodExecutionFailedEvent emission during flow kickoff * Add event handling for tool usage events - Introduce event listeners for ToolUsageFinishedEvent and ToolUsageErrorEvent - Log tool usage events with descriptive emoji icons (✅ and ❌) - Update event_listener to track and log tool usage lifecycle * Reorder and clean up event imports in event_listener - Reorganize imports for tool usage events and other event types - Maintain consistent import ordering and remove unused imports - Ensure clean and organized import structure in event_listener module * moving to dedicated eventlistener * dont forget crew level * Refactor AgentOps event listener for crew-level tracking - Modify AgentOpsListener to handle crew-level events - Initialize and end AgentOps session at crew kickoff and completion - Create agents for each crew member during session initialization - Improve session management and event recording - Clean up and simplify event handling logic * Update test_events to validate tool usage error event handling - Modify test to assert single error event with correct attributes - Use pytest.raises() to verify error event generation - Simplify error event validation in test case * Improve AgentOps listener type hints and formatting - Add string type hints for AgentOps classes to resolve potential import issues - Clean up unnecessary whitespace and improve code indentation - Simplify initialization and event handling logic * Update test_events to validate multiple tool usage events - Modify test to assert 75 events instead of a single error event - Remove pytest.raises() check, allowing crew kickoff to complete - Adjust event validation to support broader event tracking * Rename event_bus to crewai_event_bus for improved clarity and specificity - Replace all references to `event_bus` with `crewai_event_bus` - Update import statements across multiple files - Remove the old `event_bus.py` file - Maintain existing event handling functionality * Enhance EventListener with singleton pattern and color configuration - Implement singleton pattern for EventListener to ensure single instance - Add default color configuration using EMITTER_COLOR from constants - Modify log method calls to use default color and remove redundant color parameters - Improve initialization logic to prevent multiple initializations * Add FlowPlotEvent and update event bus to support flow plotting - Introduce FlowPlotEvent to track flow plotting events - Replace Telemetry method with event bus emission in Flow.plot() - Update event bus to support new FlowPlotEvent type - Add test case to validate flow plotting event emission * Remove RunType enum and clean up crew events module - Delete unused RunType enum from crew_events.py - Simplify crew_events.py by removing unnecessary enum definition - Improve code clarity by removing unneeded imports * Enhance event handling for tool usage and agent execution - Add new events for tool usage: ToolSelectionErrorEvent, ToolValidateInputErrorEvent - Improve error tracking and event emission in ToolUsage and LLM classes - Update AgentExecutionStartedEvent to use task_prompt instead of inputs - Add comprehensive test coverage for new event types and error scenarios * Refactor event system and improve crew testing - Extract base CrewEvent class to a new base_events.py module - Update event imports across multiple event-related files - Modify CrewTestStartedEvent to use eval_llm instead of openai_model_name - Add LLM creation validation in crew testing method - Improve type handling and event consistency * Refactor task events to use base CrewEvent - Move CrewEvent import from crew_events to base_events - Remove unnecessary blank lines in task_events.py - Simplify event class structure for task-related events * Update AgentExecutionStartedEvent to use task_prompt - Modify test_events.py to use task_prompt instead of inputs - Simplify event input validation in test case - Align with recent event system refactoring * Improve type hinting for TaskCompletedEvent handler - Add explicit type annotation for TaskCompletedEvent in event_listener.py - Enhance type safety for event handling in EventListener * Improve test_validate_tool_input_invalid_input with mock objects - Add explicit mock objects for agent and action in test case - Ensure proper string values for mock agent and action attributes - Simplify test setup for ToolUsage validation method * Remove ToolUsageStartedEvent emission in tool usage process - Remove unnecessary event emission for tool usage start - Simplify tool usage event handling - Eliminate redundant event data preparation step * refactor: clean up and organize imports in llm and flow modules * test: Improve flow persistence test cases and logging

* imporve HITL * fix failing test * fix failing test part 2 * Drop extra logs that were causing confusion --------- Co-authored-by: Lorenze Jay <[email protected]>

* Check the right property * Fix failing tests * Update cassettes * Update cassettes again * Update cassettes again 2 * Update cassettes again 3 * fix other test that fails in ci/cd * Fix issues pointed out by lorenze

Co-authored-by: Brandon Hancock (bhancock_ai) <[email protected]>

* Better support async * Drop coroutine

Co-authored-by: Lorenze Jay <[email protected]>

* feat: Add LLM call events for improved observability - Introduce new LLM call events: LLMCallStartedEvent, LLMCallCompletedEvent, and LLMCallFailedEvent - Emit events for LLM calls and tool calls to provide better tracking and debugging - Add event handling in the LLM class to track call lifecycle - Update event bus to support new LLM-related events - Add test cases to validate LLM event emissions * feat: Add event handling for LLM call lifecycle events - Implement event listeners for LLM call events in EventListener - Add logging for LLM call start, completion, and failure events - Import and register new LLM-specific event types * less log * refactor: Update LLM event response type to support Any * refactor: Simplify LLM call completed event emission Remove unnecessary LLMCallType conversion when emitting LLMCallCompletedEvent * refactor: Update LLM event docstrings for clarity Improve docstrings for LLM call events to more accurately describe their purpose and lifecycle * feat: Add LLMCallFailedEvent emission for tool execution errors Enhance error handling by emitting a specific event when tool execution fails during LLM calls

* feat: Enhance event listener and telemetry tracking - Update event listener to improve telemetry span handling - Add execution_span field to Task for better tracing - Modify event handling in EventListener to use new span tracking - Remove debug print statements - Improve test coverage for crew and flow events - Update cassettes to reflect new event tracking behavior * Remove telemetry references from Crew class - Remove Telemetry import and initialization from Crew class - Delete _telemetry attribute from class configuration - Clean up unused telemetry-related code * test: Improve crew verbose output test with event log filtering - Filter out event listener logs in verbose output test - Ensure no output when verbose is set to False - Enhance test coverage for crew logging behavior * dropped comment * refactor: Improve telemetry span tracking in EventListener - Remove `execution_span` from Task class - Add `execution_spans` dictionary to EventListener to track spans - Update task event handlers to use new span tracking mechanism - Simplify span management across task lifecycle events * lint

* Revert "feat: add prompt observability code (#2027)" This reverts commit 90f1bee. * Fix issues with flows post merge * Decoupling telemetry and ensure tests (#2212) * feat: Enhance event listener and telemetry tracking - Update event listener to improve telemetry span handling - Add execution_span field to Task for better tracing - Modify event handling in EventListener to use new span tracking - Remove debug print statements - Improve test coverage for crew and flow events - Update cassettes to reflect new event tracking behavior * Remove telemetry references from Crew class - Remove Telemetry import and initialization from Crew class - Delete _telemetry attribute from class configuration - Clean up unused telemetry-related code * test: Improve crew verbose output test with event log filtering - Filter out event listener logs in verbose output test - Ensure no output when verbose is set to False - Enhance test coverage for crew logging behavior * dropped comment * refactor: Improve telemetry span tracking in EventListener - Remove `execution_span` from Task class - Add `execution_spans` dictionary to EventListener to track spans - Update task event handlers to use new span tracking mechanism - Simplify span management across task lifecycle events * lint * Fix failing test --------- Co-authored-by: Lorenze Jay <[email protected]>

Missing mandatory field expected_output on task in example Co-authored-by: Brandon Hancock (bhancock_ai) <[email protected]>

Co-authored-by: Brandon Hancock (bhancock_ai) <[email protected]>

* Fixed the issue 2123 around memory command with CLI * Fixed typo, added the recommendations * Fixed Typo * Fixed lint issue * Fixed the print statement to include path as well --------- Co-authored-by: Brandon Hancock (bhancock_ai) <[email protected]>

Co-authored-by: Brandon Hancock (bhancock_ai) <[email protected]>

Co-authored-by: Lorenze Jay <[email protected]>

* feat: add context window size for o3-mini model Fixes #2191 Co-Authored-By: Joe Moura <[email protected]> * feat: add context window validation and tests - Add validation for context window size bounds (1024-2097152) - Add test for context window validation - Fix test import error Co-Authored-By: Joe Moura <[email protected]> * style: fix import sorting in llm_test.py Co-Authored-By: Joe Moura <[email protected]> --------- Co-authored-by: Devin AI <158243242+devin-ai-integration[bot]@users.noreply.github.com> Co-authored-by: Joe Moura <[email protected]> Co-authored-by: Brandon Hancock (bhancock_ai) <[email protected]>

…ilable in Amazon Bedrock (#2170) * Update constants.py This PR updates the list of foundation models available in Amazon Bedrock to reflect the latest offerings. * Update constants.py with inference profiles Add the cross-region inference profiles to increase throughput and improve resiliency by routing your requests across multiple AWS Regions during peak utilization bursts. * Update constants.py Fix the model order --------- Co-authored-by: Brandon Hancock (bhancock_ai) <[email protected]>

- Modify `Agent` class to add `set_knowledge` method - Allow setting embedder from crew-level configuration - Remove `_set_knowledge` method from initialization - Update `Crew` class to set agent knowledge during agent setup - Add default implementation in `BaseAgent` for compatibility

Co-authored-by: Lorenze Jay <[email protected]>

…s attribute to be public

…ken_process

…ner code

lucasgomide · 2025-04-14T19:39:29Z

duplicated #2523

bhancockio added 3 commits February 10, 2025 16:11

WIP

796e50a

wip

7910dc9

It works!

a38483e

bhancockio and others added 12 commits February 13, 2025 14:55

clean up

ff32880

fix type issues

265b373

Fix more type issues

fd0e1bd

More type fixues

b957fc1

more fixes

df21f01

more fixes

1e23d37

WIP

185556b

Fix issues

2b438ba

Fix ruff issues

8953af6

Fix new errors

74e6362

clean up RPM controller

134e7ab

Merge branch 'main' into brandon/bring-back-byoa

f8c74b4

lorenzejay self-requested a review February 19, 2025 23:07

vinibrsl and others added 13 commits February 27, 2025 13:20

fix user memory config issue (#2086)

afd01e3

Co-authored-by: Brandon Hancock (bhancock_ai) <[email protected]>

Bugfix/fix backtick in agent response (#2159)

e60c6e6

* updating prompts * fix issue * clean up thoughts as well * drop trailing set

Implement flow.state_utils.to_string method and improve types (#2161)

497190f

docs: update accordions and fix layout (#2110)

470254c

Co-authored-by: Brandon Hancock (bhancock_ai) <[email protected]>

making flow verbsoe false by default

377b64a

imporve HITL (#2169)

6480468

* imporve HITL * fix failing test * fix failing test part 2 * Drop extra logs that were causing confusion --------- Co-authored-by: Lorenze Jay <[email protected]>

Check the right property for tool calling (#2160)

f6393fd

* Check the right property * Fix failing tests * Update cassettes * Update cassettes again * Update cassettes again 2 * Update cassettes again 3 * fix other test that fails in ci/cd * Fix issues pointed out by lorenze

drop prints (#2181)

6e0e9b3

cassetes

84e0a9e

jannikmaierhoefer and others added 24 commits February 27, 2025 13:20

docs: add header image to langfuse guide (#2128)

e58e544

Co-authored-by: Brandon Hancock (bhancock_ai) <[email protected]>

Better support async flows (#2193)

ca9277a

* Better support async * Drop coroutine

fix reset memory issue (#2182)

6fb25a1

Co-authored-by: Lorenze Jay <[email protected]>

Update kickoff-async.mdx (#2138)

f6b0b49

Missing mandatory field expected_output on task in example Co-authored-by: Brandon Hancock (bhancock_ai) <[email protected]>

fix: typo in 'delegate_work' and 'ask_question' promps (#2144)

48d2b8c

Co-authored-by: Brandon Hancock (bhancock_ai) <[email protected]>

[MINOR]support ChatOllama from langchain_ollama (#2158)

940f064

Co-authored-by: Brandon Hancock (bhancock_ai) <[email protected]>

incorporating fix from @misrasaurabh1 with additional type fix (#2213)

92a3349

Co-authored-by: Lorenze Jay <[email protected]>

Add support for python 3.10 (#2230)

4915420

Fix type issue (#2224)

ed0b8e1

Co-authored-by: Lorenze Jay <[email protected]>

Support multiple router calls and address issue #2175 (#2231)

3f17789

Co-authored-by: Lorenze Jay <[email protected]>

Improve extract thought (#2223)

2181166

Co-authored-by: Lorenze Jay <[email protected]>

Update docs (#2226)

6c81aca

Merge branch 'main' into brandon/bring-back-byoa

33ef612

Fix token tracking in LangChainAgentAdapter and refactor token_proces…

88d8079

…s attribute to be public

Fix token tracking in Agent class to use token_process instead of _to…

ef48cbe

…ken_process

Refactor token tracking: Remove token_cost_process parameter for clea…

75d8e08

…ner code

refactor

6efee89

lucasgomide closed this Apr 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Brandon/bring back byoa #2122

Brandon/bring back byoa #2122

Uh oh!

bhancockio commented Feb 13, 2025

Uh oh!

joaomdmoura commented Feb 13, 2025

Uh oh!

lucasgomide commented Apr 14, 2025

Uh oh!

Uh oh!

Brandon/bring back byoa #2122

Brandon/bring back byoa #2122

Uh oh!

Conversation

bhancockio commented Feb 13, 2025

Uh oh!

joaomdmoura commented Feb 13, 2025

Code Review for crewAI PR #2122

Overview

Key Findings and Suggestions

1. LangChainAgentAdapter Implementation

Code Improvements

Related Pull Requests

2. Tool Handling Improvements

Code Improvements

Historical Context

3. Agent Builder Improvements

Code Improvements

4. General Code Quality Improvements

5. Security Considerations

Code Improvements

Summary of Critical Issues

Recommendations for Future Improvements

Uh oh!

lucasgomide commented Apr 14, 2025

Uh oh!

Uh oh!