sync main to staging by TimothyZhang7 · Pull Request #1368 · aden-hive/hive

TimothyZhang7 · 2026-01-27T16:34:39Z

Description

Sync main to staging branch

Add comprehensive CSV manipulation tools: - csv_read: Read CSV with pagination (limit/offset) - csv_write: Create new CSV files - csv_append: Append rows to existing CSV - csv_info: Get CSV metadata (columns, row count, file size) - csv_sql: Query CSV using SQL (powered by DuckDB) Features: - Session sandbox security (workspace_id, agent_id, session_id) - DuckDB as optional dependency for SQL queries - Security: Only SELECT queries allowed, dangerous keywords blocked - Full Unicode support - 45 tests covering all tools Install SQL support: pip install tools[sql]

- Created MockLLMProvider class that generates placeholder JSON responses - Updated AgentRunner._setup() to use MockLLMProvider when mock_mode=True - Added MockLLMProvider to llm module exports - Fixes issue where agents failed with 'LLM not available' in mock mode The MockLLMProvider extracts expected output keys from system prompts and generates mock JSON responses for structural validation without making real LLM API calls. This enables: - Testing agent structure without API keys - Fast iteration on agent graphs - CI/CD testing without credentials - Zero-cost structural validation Tested with simple agent - all nodes execute successfully in mock mode.

Signed-off-by: Arush Wadhawan <warush23+github@gmail.com>

Replace direct print() statements with Python's logging module in MCP setup and verification scripts for better configurability and production readiness. Changes: - setup_mcp.py: Convert 30+ print() calls to structured logging - verify_mcp.py: Convert 40+ print() calls to structured logging - mcp_server.py: Convert 4 print() calls to structured logging - Preserve colored CLI output using logging formatters - Maintain all functional behavior (refactor only) Benefits: - Configurable log levels (debug/info/warning/error) - Better observability in production environments - Cleaner programmatic usage (no stdout pollution) - Professional logging practices Fixes #833

Allow LLMJudge to accept any LLMProvider instance instead of being hardcoded to use Anthropic. This aligns with the framework's pluggable LLM design and enables users to: - Use the same LLM provider across their agent and tests - Run tests with cheaper or local models - Avoid requiring an Anthropic API key for testing Backward compatible: existing code using LLMJudge() without arguments continues to work by falling back to Anthropic. Closes #477 Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

feat(validation): add Pydantic model validation for LLM outputs

docs(llm): add DeepSeek models support documentation and examples

fix(tools): validate Content-Type in web_scrape tool (Closes #487)

Fix: Add MockLLMProvider to enable mock mode execution

fixed linter

…vider feat(testing): add configurable LLM provider to LLMJudge

…lable' Fixes #922

…_command_tool) The "Available Tools" table listed `execute_command` but the actual registered name is `execute_command_tool`. This aligns the docs with the runtime name in __init__.py and the tool's own README. Fixes #901

… or empty Previously, when exports/ was missing or empty, the bash glob `exports/*/` would not match anything and the loop would silently do nothing. The job would pass without actually validating anything, which was misleading. Now the job: - Explicitly checks if exports/ directory exists - Uses nullglob to handle empty directories properly - Logs clear messages when skipping validation - Reports the number of agents validated when successful Fixes #887

Centralized _get_api_key in prompts.py to support OpenAI, Cerebras, and Groq via environment variables while maintaining Anthropic support through CredentialManager.

fix(graph): add logging for JSON parsing failures in worker_node

…ame-readme docs(tools): fix tool name in README table (execute_command → execute_command_tool)

…empty ci: make Validate Agent Exports skip clearly when exports/ is missing or empty

fix(types): correct type annotation from lowercase 'callable' to 'Callable'

Fix/graph retry backoff

feat(tools): add CSV tools with DuckDB SQL support

…prompts Refactor/provider agnostic prompts

- Change single quotes to double quotes in logging formatters - Fixes: setup_mcp.py, verify_mcp.py formatter strings - Addresses Q000 linter errors from PR review

refactor(mcp): replace print() with logging in setup scripts

* feat(tools): add Google Custom Search as alternative to Brave Search Adds google_search tool using Google Custom Search API as an alternative to the existing web_search tool (Brave Search). Changes: - Add google_search_tool with full implementation - Register Google credentials (GOOGLE_API_KEY, GOOGLE_CSE_ID) - Register tool in tools/__init__.py - Add README with setup instructions Closes #793 * test(tools): add unit tests for google_search tool Adds 7 tests mirroring web_search_tool test patterns: - Missing API key error handling - Missing CSE ID error handling - Empty query validation - Long query validation - num_results clamping - Default parameters - Custom language/country parameters All tests pass. * refactor(tools): add multi-provider support to web_search tool BREAKING CHANGE: None - backward compatible. Brave remains default. - Add Google Custom Search as alternative provider in web_search - Add 'provider' parameter: 'auto' (default), 'google', 'brave' - Auto mode tries Brave first for backward compatibility - Remove separate google_search_tool (consolidated into web_search) - Update tests to cover multi-provider functionality (13 tests) - Update README documentation Users with BRAVE_SEARCH_API_KEY: No changes needed Users with GOOGLE_API_KEY + GOOGLE_CSE_ID: Can use provider='google' Users with both: Brave preferred by default, use provider='google' to force Closes #793 * feat(tools): fixed readme --------- Co-authored-by: Mustafa Abdat <abdamus@hilti.com>

…solated Logic)

…-locks-leak fix(memory): patch ConcurrentStorage leak (WeakValueDictionary)

refactor: provider-agnostic LLMJudge with auto-detection for OpenAI (#1103)

github-actions · 2026-01-27T16:34:47Z

PR Closed - Requirements Not Met

This PR has been automatically closed because it doesn't meet the requirements.

Missing: No linked issue found.

To fix:

Create or find an existing issue for this work
Assign yourself to the issue
Re-open this PR and add Fixes #123 in the description

Why is this required? See #472 for details.

sync main to staging

vakrahul and others added 30 commits January 25, 2026 23:09

fix(graph): implement exponential backoff for node retries

491e658

chore(graph): fix lint issues in retry backoff loggings

5923147

verifying

0653519

style: fix F821 undefined name and E501 line length errors

1a7ed9c

fix(tools): validate Content-Type in web_scrape tool (Closes #487)

5168ed3

docs(llm): add DeepSeek models support documentation and examples

40e39d2

Signed-off-by: Arush Wadhawan <warush23+github@gmail.com>

Merge branch 'main' into feat/llm-judge-configurable-provider

69ad0be

Merge pull request #349 from Himanshu-ABES/feat/pydantic-llm-validation

798f3cf

feat(validation): add Pydantic model validation for LLM outputs

Merge pull request #788 from SoulSniper-V2/feat/add-deepseek-docs

0a8c30c

docs(llm): add DeepSeek models support documentation and examples

Merge pull request #528 from gaurav-code098/fix/web-scrape-content-type

396e5c3

fix(tools): validate Content-Type in web_scrape tool (Closes #487)

Merge pull request #576 from savankansagara1/fix/mock-mode-llm-provider

25fabd8

Fix: Add MockLLMProvider to enable mock mode execution

fixed linter

d064c98

Merge pull request #906 from adenhq/fix/ruff-tests

5cf25c6

fixed linter

Merge pull request #871 from pradyten/feat/llm-judge-configurable-pro…

9230ac6

…vider feat(testing): add configurable LLM provider to LLMJudge

fix(types): correct type annotation from lowercase 'callable' to 'Cal…

2b86046

…lable' Fixes #922

fix(graph): add logging for JSON parsing failures in worker_node

8523324

refactor: implement provider-agnostic logic for test templates

e846ad6

Centralized _get_api_key in prompts.py to support OpenAI, Cerebras, and Groq via environment variables while maintaining Anthropic support through CredentialManager.

merge: resolve conflicts in executor.pyx

1631d01

style: fix linting issues in output_cleaner.py

68264b5

Merge pull request #927 from saboor2632/fix/worker-node-json-logging

ed88129

fix(graph): add logging for JSON parsing failures in worker_node

Merge pull request #933 from adionit7/docs/fix-execute-command-tool-n…

3eb964e

…ame-readme docs(tools): fix tool name in README table (execute_command → execute_command_tool)

Merge branch 'adenhq:main' into refactor/provider-agnostic-prompts

b0435a1

Merge pull request #934 from adionit7/fix/validate-exports-skip-when-…

8525aec

…empty ci: make Validate Agent Exports skip clearly when exports/ is missing or empty

Merge pull request #946 from not-anas-ali/fix/callable-type-annotations

6d025c8

fix(types): correct type annotation from lowercase 'callable' to 'Callable'

vakrahul and others added 16 commits January 27, 2026 07:26

fix(graph): restore node.max_retries and fix type check per review

a122345

Merge branch 'main' into fix/graph-retry-backoff

03910d5

style: fix linting issues (whitespace and newline)

e59bb2d

style: add required trailing newline to prompts.py

500876d

Merge pull request #941 from vakrahul/fix/graph-retry-backoff

bc8cdfd

Fix/graph retry backoff

Merge pull request #558 from Hundao/feature/csv-tools

a4b0c66

feat(tools): add CSV tools with DuckDB SQL support

Merge pull request #948 from TanujaNair03/refactor/provider-agnostic-…

6acdb65

…prompts Refactor/provider agnostic prompts

style: fix ruff quote style violations (Q000)

407816d

- Change single quotes to double quotes in logging formatters - Fixes: setup_mcp.py, verify_mcp.py formatter strings - Addresses Q000 linter errors from PR review

refactor: make LLMJudge provider-agnostic with OpenAI support (#1103)

3605f37

refactor: provider-agnostic LLMJudge with ruff styling fixes (#1103)

598cc8b

Merge pull request #973 from AryanyAI/refactor/logging-mcp-scripts

9d39c09

refactor(mcp): replace print() with logging in setup scripts

fix(memory): patch ConcurrentStorage leak with WeakValueDictionary (I…

112b1ba

…solated Logic)

Merge branch 'adenhq:main' into fix/concurrent-storage-file-locks-leak

0381a5c

Merge pull request #1353 from Tahir-yamin/fix/concurrent-storage-file…

197f4f9

…-locks-leak fix(memory): patch ConcurrentStorage leak (WeakValueDictionary)

Merge pull request #1113 from TanujaNair03/refactor/llm-judge-agnostic

e1bea18

refactor: provider-agnostic LLMJudge with auto-detection for OpenAI (#1103)

github-actions bot closed this Jan 27, 2026

TimothyZhang7 merged commit c441494 into staging Jan 27, 2026
6 of 8 checks passed

jhalak999 pushed a commit to jhalak999/hive that referenced this pull request Feb 17, 2026

Merge pull request aden-hive#1368 from adenhq/main

840e8b9

sync main to staging

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sync main to staging#1368

sync main to staging#1368
TimothyZhang7 merged 46 commits intostagingfrom
main

TimothyZhang7 commented Jan 27, 2026

Uh oh!

github-actions bot commented Jan 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

18 participants

Conversation

TimothyZhang7 commented Jan 27, 2026

Description

Uh oh!

github-actions bot commented Jan 27, 2026

PR Closed - Requirements Not Met

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

18 participants