fix: managed agents sandbox security (fixes #1426) by praisonai-triage-agent[bot] · Pull Request #1438 · MervinPraison/PraisonAI

praisonai-triage-agent · 2026-04-17T05:46:45Z

Fixes #1426

Security Improvements

Add ManagedSandboxRequired exception for package installation safety
Modify LocalManagedAgent to use compute providers for secure execution
Implement compute-based tool execution routing for shell commands
Add host_packages_ok safety opt-out flag for developer workflows
Remove unused sandbox_type config field
Add comprehensive tests for security functionality

Behavior Changes

Packages install in sandbox when compute provider attached
Raise exception when packages specified without compute/opt-out
Route execute_command, read_file, write_file, list_files through compute
Maintain backward compatibility with explicit opt-out flag

Generated with Claude Code

Summary by CodeRabbit

Release Notes

New Features
- Compute provider integration for managed agents—execute tools and install packages within sandboxed compute environments.
- Enhanced security controls with explicit configuration requirements for local package installation.
- Improved error handling with remediation guidance when package installation or tool execution lacks proper compute provider setup.

- Add ManagedSandboxRequired exception for package installation safety - Modify LocalManagedAgent to use compute providers for secure execution - Implement compute-based tool execution routing for shell commands - Add host_packages_ok safety opt-out flag for developer workflows - Remove unused sandbox_type config field - Add comprehensive tests for security functionality Security improvements: - Packages install in sandbox when compute provider attached - Raise exception when packages specified without compute/opt-out - Route execute_command, read_file, write_file, list_files through compute - Maintain backward compatibility with explicit opt-out flag 🤖 Generated with [Claude Code](https://claude.ai/code) Co-authored-by: Mervin Praison <MervinPraison@users.noreply.github.com>

MervinPraison · 2026-04-17T05:46:54Z

@coderabbitai review

MervinPraison · 2026-04-17T05:46:55Z

/review

coderabbitai · 2026-04-17T05:47:01Z

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

coderabbitai · 2026-04-17T05:47:11Z

Caution

Review failed

An error occurred during the review process. Please try again later.

📝 Walkthrough

Walkthrough

This PR implements security controls to prevent unintended host-level package installation and tool execution in LocalManagedAgent. It introduces compute-bridged tool execution, validates compute provider attachment for packages, adds a ManagedSandboxRequired exception, and replaces sandbox_type configuration with host_packages_ok flag.

Changes

Cohort / File(s)	Summary
Security Foundation `src/praisonai/praisonai/integrations/managed_agents.py`	Added `ManagedSandboxRequired` exception with default security-focused message to signal when package installation/tool execution requires a compute provider.
Configuration & Provisioning `src/praisonai/praisonai/integrations/managed_local.py`	Updated `LocalManagedConfig`: removed `sandbox_type`, added `host_packages_ok: bool = False`. Implemented `_provision_compute_if_needed()` helper and extended `_ensure_agent()` to auto-provision compute instances when attached.
Package Installation Routing `src/praisonai/praisonai/integrations/managed_local.py`	Refactored `_install_packages()` to execute pip installs inside compute sandbox when provider attached; raises `ManagedSandboxRequired` when compute absent and `host_packages_ok=False`.
Tool Execution Bridge `src/praisonai/praisonai/integrations/managed_local.py`	Added compute-bridged tool wrappers (`_create_compute_execute_command`, `_create_compute_read_file`, `_create_compute_write_file`, `_create_compute_list_files`). Updated `_resolve_tools()` to substitute host tools with compute-bridged variants when compute provider attached.
Test Coverage `src/praisonai-agents/tests/managed/test_managed_factory.py`	Updated `test_defaults` to validate `host_packages_ok=False` instead of `sandbox_type`. Added `TestManagedSandboxSafety` suite (package installation security scenarios) and `TestComputeToolBridge` suite (tool routing and compute command generation).

Sequence Diagram

sequenceDiagram
    participant Agent as LocalManagedAgent
    participant Config as LocalManagedConfig
    participant Compute as Compute Provider
    participant Bridge as ToolBridge
    participant Host as Host System

    Note over Agent,Host: Scenario 1: Package Installation with Compute
    Agent->>Config: has packages & compute attached?
    Config-->>Agent: true
    Agent->>Agent: _provision_compute_if_needed()
    Agent->>Compute: execute(instance_id, "pip install ...")
    Compute-->>Agent: success (installed in sandbox)

    Note over Agent,Host: Scenario 2: Package Installation without Compute
    Agent->>Config: has packages & compute attached?
    Config-->>Agent: false
    Agent->>Config: host_packages_ok check
    Config-->>Agent: false (default)
    Agent->>Agent: raise ManagedSandboxRequired
    
    Note over Agent,Host: Scenario 3: Tool Execution with Compute
    Agent->>Agent: _resolve_tools()
    alt compute attached
        Agent->>Bridge: return compute-bridged tools
        Bridge->>Compute: execute(instance_id, command, timeout)
        Compute-->>Bridge: stdout/stderr/exit_code
        Bridge-->>Agent: formatted result
    else compute absent
        Agent->>Host: load host tools
        Host-->>Agent: execute_command, read_file, etc.
    end

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~50 minutes

Possibly related issues

Managed Agents: wire compute provider, block host pip, clarify sandbox docs (safety) #1426: This PR directly implements all acceptance criteria from the linked issue—adding ManagedSandboxRequired exception, routing _install_packages through compute or blocking with host_packages_ok, creating compute-bridged tool execution, removing sandbox_type, and adding comprehensive test coverage for sandbox safety and tool routing.
Meta: Managed Agents follow-ups — safety, observability, tests, CLI parity #1425: Referenced as the meta-analysis issue; this PR addresses the concrete code-level gaps identified there (host pip leakage, in-process tool execution, unused sandbox_type).

Poem

🐰 Hops with glee through sandboxed paths,
No pip on host—compute takes the baths!
Tools now bridge with every leap,
Security promises we keep,
Config flags for those who dare,
Safe execution everywhere!

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch claude/issue-1426-20260417-0542

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

MervinPraison · 2026-04-17T05:47:21Z

@copilot Do a thorough review of this PR. Read ALL existing reviewer comments above from Qodo, Coderabbit, and Gemini first — incorporate their findings.

Review areas:

Bloat check: Are changes minimal and focused? Any unnecessary code or scope creep?
Security: Any hardcoded secrets, unsafe eval/exec, missing input validation?
Performance: Any module-level heavy imports? Hot-path regressions?
Tests: Are tests included? Do they cover the changes adequately?
Backward compat: Any public API changes without deprecation?
Code quality: DRY violations, naming conventions, error handling?
Address reviewer feedback: If Qodo, Coderabbit, or Gemini flagged valid issues, include them in your review
Suggest specific improvements with code examples where possible

greptile-apps · 2026-04-17T05:50:20Z

Greptile Summary

This PR addresses sandbox security issue #1426 by routing shell/file tool execution through a compute provider (Docker, E2B, Modal, etc.) when one is attached, and blocking host-level pip install by default via the new ManagedSandboxRequired exception and host_packages_ok opt-out flag.

Key changes:

ManagedSandboxRequired exception added to managed_agents.py with actionable remediation guidance in the message.
Compute-bridged tool functions (_create_compute_execute_command, _create_compute_read_file, _create_compute_write_file, _create_compute_list_files) in managed_local.py now properly shell-escape parameters using shlex.quote() and the -- end-of-options separator, addressing the injection issues from the prior review.
_install_packages raises ManagedSandboxRequired when packages are requested without a compute provider and host_packages_ok=False (the default), and raises RuntimeError on non-zero exit from sandbox pip install.
Test suite was extended for the new security behavior, but three assertions in TestComputeToolBridge are stale — they reflect the pre-fix command strings (no shlex.quote, no --, echo instead of printf) and will fail in CI.

Confidence Score: 3/5

Safe to merge once the three stale test assertions are corrected; the security logic itself is sound.

The prior security injection issues (unescaped file_path, directory, and package names) are all addressed in the implementation. However, three test assertions in TestComputeToolBridge directly contradict the new implementation — they expect the old unquoted command strings and echo — and will fail in CI. Merging with known broken tests sets a bad precedent and hides regression risk, hence 3 rather than 4.

src/praisonai-agents/tests/managed/test_managed_factory.py — three assertions at lines 679, 703, and 726 need to be updated to match the shlex.quote-escaped command strings now generated by the implementation.

Important Files Changed

Filename	Overview
src/praisonai/praisonai/integrations/managed_local.py	Adds compute-bridged tool functions with `shlex.quote()` shell-escaping, `host_packages_ok` safety flag, and `ManagedSandboxRequired` exception; one fragile exception-type check in `_install_packages`.
src/praisonai/praisonai/integrations/managed_agents.py	Introduces `ManagedSandboxRequired` exception with descriptive docstring and default message; no other functional changes.
src/praisonai-agents/tests/managed/test_managed_factory.py	Three test assertions are stale: they check for the old unescaped `cat`/`ls`/`echo` command forms, but the implementation was updated in a follow-up commit to use `shlex.quote()`, `--` separators, and `printf`; these tests will fail in CI.

Flowchart

%%{init: {'theme': 'neutral'}}%%
flowchart TD
    A[Tool call: execute_command / read_file / write_file / list_files] --> B{compute provider attached?}
    B -- Yes --> C[_create_compute_* bridge]
    C --> D[shlex.quote parameters]
    D --> E[compute.execute in sandbox]
    E --> F[Return result]
    B -- No --> G[Host tool from praisonaiagents.tools]
    G --> F

    H[_install_packages called] --> I{compute provider attached?}
    I -- Yes --> J[pip install in compute sandbox]
    J --> K{exit_code == 0?}
    K -- No --> L[raise RuntimeError]
    K -- Yes --> M[Done]
    I -- No --> N{host_packages_ok == True?}
    N -- Yes --> O[subprocess pip install on HOST - unsafe]
    N -- No --> P[raise ManagedSandboxRequired]

_{Reviews (2): Last reviewed commit: "fix: secure shell command execution in m..." | Re-trigger Greptile}

Copilot · 2026-04-17T05:50:20Z

@copilot Do a thorough review of this PR. Read ALL existing reviewer comments above from Qodo, Coderabbit, and Gemini first — incorporate their findings.

Review areas:...

Reviewed commit 2ca4005 thoroughly.

Findings:

Bloat: Scope is mostly focused, but test additions are large and include repeated per-test imports that could be consolidated.
Security (important): command injection risk in compute-bridged helpers due to unescaped path interpolation:
- f"cat {file_path}"
- f"echo {escaped_content} > {file_path}" (path unescaped)
- f"ls -la {directory}"
  Suggested hardening:
```
import shlex
cmd = f"cat {shlex.quote(file_path)}"
cmd = f"echo {escaped_content} > {shlex.quote(file_path)}"
cmd = f"ls -la {shlex.quote(directory)}"
```
Performance: multiple methods create/set a new event loop on every tool call; this is costly and can conflict with existing loop state in threaded/async contexts.
Tests: good coverage for sandbox-required behavior and compute bridging; missing tests for malicious path inputs and preserved backward compatibility.
Backward compatibility (important): removing sandbox_type from LocalManagedConfig is a breaking API change for callers passing sandbox_type=....
Code quality: repeated async loop boilerplate in 6+ places; should be centralized in a helper.
Reviewer feedback: I checked current PR comments/reviews from bots; Coderabbit is still in progress and no actionable Qodo/Gemini findings are posted yet.
CI status check: latest failed runs on this branch currently show no failed jobs (0 jobs), so no actionable log failures yet.

If you want, I can push a minimal follow-up commit to fix the path escaping + restore sandbox_type as deprecated/no-op for compatibility.

MervinPraison · 2026-04-17T05:50:31Z

@claude You are the FINAL architecture reviewer. If the branch is under MervinPraison/PraisonAI (not a fork), you are able to make modifications to this branch and push directly. SCOPE: Focus ONLY on Python packages (praisonaiagents, praisonai). Do NOT modify praisonai-rust or praisonai-ts. Read ALL comments above from Gemini, Qodo, CodeRabbit, and Copilot carefully before responding.

Phase 1: Review per AGENTS.md

Protocol-driven: check heavy implementations vs core SDK
Backward compatible: ensure zero feature regressions
Performance: no hot-path regressions

Phase 2: FIX Valid Issues
4. For any VALID bugs or architectural flaws found by Gemini, CodeRabbit, Qodo, Copilot, or any other reviewer: implement the fix
5. Push all code fixes directly to THIS branch (do NOT create a new PR)
6. Comment a summary of exact files modified and what you skipped

Phase 3: Final Verdict
7. If all issues are resolved, approve the PR / close the Issue
8. If blocking issues remain, request changes / leave clear action items

praisonai-triage-agent · 2026-04-17T05:50:46Z

- Add shlex.quote() to all file paths and directory arguments - Use printf instead of echo to preserve file content integrity - Quote package names in pip install commands - Convert silent pip failures to structured RuntimeError exceptions - Add -- separators to prevent option injection Addresses command injection vulnerabilities identified by security review. Co-authored-by: Mervin Praison <MervinPraison@users.noreply.github.com>

greptile-apps · 2026-04-17T05:56:38Z

+        result = read_file("/path/to/file.txt")
+
+        assert result == "file contents"
+        mock_compute.execute.assert_called_once_with("test_instance", "cat /path/to/file.txt", timeout=60)


Stale test assertions — test_compute_read_file_bridge will fail

The assertion at line 679 expects the raw, unquoted path, but the implementation in managed_local.py was updated (in commit b91ab6c5) to use shlex.quote() and add the -- end-of-options separator:

# managed_local.py line 342 (actual) f"cat -- {shlex.quote(file_path)}" # produces: "cat -- '/path/to/file.txt'"

The test therefore expects a command that is never generated:

# Expected (test line 679) mock_compute.execute.assert_called_once_with( "test_instance", "cat /path/to/file.txt", timeout=60 ) # Actual call "cat -- '/path/to/file.txt'"

This test will fail at CI time. Fix:

Suggested change

mock_compute.execute.assert_called_once_with("test_instance", "cat /path/to/file.txt", timeout=60)

mock_compute.execute.assert_called_once_with("test_instance", "cat -- '/path/to/file.txt'", timeout=60)

MervinPraison · 2026-04-17T06:20:37Z

Subsumed by #1442 which merged earlier today and closes #1426.

praisonai-triage-agent bot mentioned this pull request Apr 17, 2026

Managed Agents: wire compute provider, block host pip, clarify sandbox docs (safety) #1426

Closed

6 tasks

Copilot started work on behalf of MervinPraison April 17, 2026 05:47 View session

greptile-apps bot reviewed Apr 17, 2026

View reviewed changes

Copilot finished work on behalf of MervinPraison April 17, 2026 05:51

Copilot AI requested a review from MervinPraison April 17, 2026 05:51

greptile-apps bot reviewed Apr 17, 2026

View reviewed changes

MervinPraison closed this Apr 17, 2026

MervinPraison deleted the claude/issue-1426-20260417-0542 branch April 17, 2026 06:20

	mock_compute.execute.assert_called_once_with("test_instance", "cat /path/to/file.txt", timeout=60)
	mock_compute.execute.assert_called_once_with("test_instance", "cat -- '/path/to/file.txt'", timeout=60)

Uh oh!

Conversation

praisonai-triage-agent bot commented Apr 17, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Security Improvements

Behavior Changes

Summary by CodeRabbit

Release Notes

Uh oh!

MervinPraison commented Apr 17, 2026

Uh oh!

MervinPraison commented Apr 17, 2026

Uh oh!

coderabbitai bot commented Apr 17, 2026

Uh oh!

coderabbitai bot commented Apr 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review failed

Walkthrough

Changes

Sequence Diagram

Estimated code review effort

Possibly related issues

Poem

Uh oh!

MervinPraison commented Apr 17, 2026

Uh oh!

greptile-apps bot commented Apr 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 3/5

Important Files Changed

Flowchart

Uh oh!

Copilot AI commented Apr 17, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MervinPraison commented Apr 17, 2026

Uh oh!

praisonai-triage-agent bot commented Apr 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Architecture Review & Security Fix ✅

✅ FINAL VERDICT: APPROVE WITH FIXES APPLIED

Security Issues RESOLVED

Architecture Compliance ✅

Code Quality ✅

Changes Applied

🔒 Security Assessment: SAFE TO MERGE

Uh oh!

greptile-apps bot Apr 17, 2026

Choose a reason for hiding this comment

Uh oh!

MervinPraison commented Apr 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

praisonai-triage-agent bot commented Apr 17, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Apr 17, 2026 •

edited

Loading

greptile-apps bot commented Apr 17, 2026 •

edited

Loading

praisonai-triage-agent bot commented Apr 17, 2026 •

edited

Loading