security-related mitigations by derekmisler · Pull Request #3 · docker/cagent-action

derekmisler · 2025-11-03T19:08:18Z

This PR transforms the cagent-action into a production-ready, security-hardened GitHub Action with comprehensive protections against prompt injection, secret leakage, and unauthorized access.

🔒 Security Features

Universal Security (All Modes):

Secret Leak Prevention: Scans all agent outputs for API keys/tokens (Anthropic, OpenAI, GitHub) using regex patterns
Automatic Incident Response: Creates security issues and fails workflows when secrets are detected
Output Sanitization: Universal scanning applied to every agent execution

PR Review Mode Security (when pr-number provided):

Authorization: Hardcoded OWNER/MEMBER-only access (cannot be disabled)
Input Sanitization: Removes code comments and blocks malicious diff patterns
Safe Prompt Construction: Builds prompts from sanitized sources with anti-extraction rules
Size Limits: Enforces max PR size (default 3000 lines) to prevent DoS attacks
Prompt Injection Detection: Warns about suspicious patterns in user prompts (general mode)

📁 New Files

Security Scripts (security/):

check-auth.sh - Authorization validation (OWNER/MEMBER only)
sanitize-input.sh - PR diff sanitization (removes comments, detects injection)
sanitize-output.sh - Secret leak detection in AI responses
sanitize-prompt.sh - User prompt injection detection (general mode)
secret-patterns.sh - Shared secret detection regex patterns
README.md - Complete security documentation

Agent Configuration (agents/):

pr-reviewer.yaml - Multi-agent PR reviewer with coordinator + specialists
- Root coordinator with hierarchical delegation
- review_rules - Base CI/CD review rules
- frontend_engineer - TypeScript/React/Redux/Electron specialist
- golang_engineer - Go/concurrency/stdlib specialist
- python_engineer - Django/LangGraph/async specialist
- Optimized model configurations (temperature: 0 for deterministic behavior)
- Context7 MCP integration for documentation lookup

Test Suite (tests/):

test-security.sh - Authorization, sanitization, secret detection tests
test-exploits.sh - Prompt injection attack simulation tests

🔧 Action Changes (action.yml)

New Inputs:

pr-number - Enable secure PR review mode (optional, mutually exclusive with manual agent)
max-pr-size - PR size limit in lines (default 3000)

New Outputs:

security-blocked - Whether execution was blocked for security reasons
secrets-detected - Whether secrets were found in output
prompt-suspicious - Whether user prompt contains suspicious patterns

Execution Flow:

Validation - Ensure either agent or pr-number provided
Authorization - Check contributor role (PR review mode only)
Fetch PR Data - Get diff via GitHub API with size validation
Sanitize Input - Remove comments, detect injection patterns
Build Safe Prompt - Construct prompt from sanitized sources with security rules
Run Agent - Execute with secure argument handling (no eval)
Sanitize Output - Scan for leaked secrets (universal)
Post Comment - Only if safe (no secrets detected)
Handle Incident - Create security issue and fail on secret detection

Security Improvements:

Removed eval usage - direct command execution with quoted args
Early output file creation for defensive programming
Comprehensive error handling for missing files
Shared secret patterns for consistency across checks

📚 Documentation (README.md)

Added:

Security features overview section
PR review mode usage examples
Security-focused workflow examples
Updated all version references to v2.0.0
New input/output documentation

🧪 CI/CD (test.yml)

New Test Jobs:

test-security - Runs authorization and sanitization tests
test-exploits - Tests prompt injection attack prevention
Security-focused test organization with explicit permissions

Improvements:

Pinned actions/checkout to SHA for supply chain security
Explicit contents: read permissions on all jobs
Proper job naming and organization

🔄 Workflow Updates

Manual Test Workflows:

Updated to v2.0.0
Added explicit contents: read permissions
Consistent permission declarations

Example Workflows:

Updated pr-review.yml with security best practices
Added issues: write permission for incident reporting

📊 Statistics

17 files changed
1,695 insertions, 49 deletions
6 commits on branch
5 new security scripts with comprehensive test coverage
266-line multi-agent PR reviewer configuration

🎯 Breaking Changes

v1 → v2 Migration:

agent input now optional when pr-number provided (uses built-in secure reviewer)
New security outputs may affect downstream workflow logic
PR review mode enforces OWNER/MEMBER authorization (hardcoded)

✅ Testing

All security measures validated with:

176 lines of security tests (test-security.sh)
131 lines of exploit tests (test-exploits.sh)
Automated CI/CD test execution on PR/push

This establishes cagent-action@v2 as a secure-by-default solution for AI-powered code reviews with defense-in-depth against common attack vectors.

.github/workflows/test.yml

Copilot

Pull Request Overview

This PR introduces comprehensive security mitigations for the cagent-action to enable safe AI-powered code reviews in public repositories. The implementation includes multi-layer defenses against prompt injection attacks, secret leakage, and unauthorized access.

Key Changes:

Universal secret scanning for all agent outputs to prevent API key/token leakage
PR review mode with authorization checks (OWNER/MEMBER only) and input sanitization
Automated security incident response with issue creation and workflow failure

Reviewed Changes

Copilot reviewed 14 out of 15 changed files in this pull request and generated 6 comments.

Show a summary per file

File	Description
tests/test.diff	Test fixture containing prompt injection attempt in comment
tests/test-security.sh	Security test suite covering 10 scenarios for auth, sanitization, and scanning
tests/test-exploits.sh	Red team tests simulating prompt injection and secret leak attacks
security/sanitize-prompt.sh	Detects prompt injection patterns in user-provided prompts (general mode)
security/sanitize-output.sh	Scans agent responses for leaked secrets before posting
security/sanitize-input.sh	Removes comments and blocks suspicious patterns in PR diffs
security/check-auth.sh	Validates user authorization for PR review triggers
examples/pr-review.yml	Updated example workflow using new PR review mode
agents/pr-reviewer.yaml	Multi-agent PR reviewer with hardened security rules
action.yml	Core action logic implementing defense-in-depth security architecture
README.md	Documentation of security features and usage examples
.github/workflows/test.yml	CI workflow running security and exploit tests
.github/workflows/manual-test-pirate-agent.yml	Manual test workflow with version update
.github/workflows/manual-test-github-agent.yml	Manual test workflow with version update

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

security/sanitize-input.sh

action.yml

security/sanitize-output.sh

action.yml

security/check-auth.sh

Copilot

Pull Request Overview

Copilot reviewed 14 out of 15 changed files in this pull request and generated 4 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

security/sanitize-input.sh

security/check-auth.sh

action.yml

Start [2025-11-03 14:08:15]

e607564

derekmisler self-assigned this Nov 3, 2025

WIP [2025-11-03 14:09:05]

8b6cf38

github-advanced-security bot found potential problems Nov 3, 2025

View reviewed changes

.github/workflows/test.yml Fixed Show fixed Hide fixed

derekmisler added the kind/feature New feature or request label Nov 3, 2025

WIP [2025-11-03 14:13:31]

1064627

derekmisler marked this pull request as ready for review November 3, 2025 20:36

derekmisler requested a review from Copilot November 3, 2025 20:36

Copilot AI reviewed Nov 3, 2025

View reviewed changes

derekmisler requested a review from Copilot November 3, 2025 20:50

Copilot AI reviewed Nov 3, 2025

View reviewed changes

security/sanitize-input.sh Show resolved Hide resolved

security/check-auth.sh Show resolved Hide resolved

action.yml Show resolved Hide resolved

action.yml Outdated Show resolved Hide resolved

derekmisler added 3 commits November 4, 2025 11:35

WIP [2025-11-03 15:07:27]

dd46e22

WIP [2025-11-03 15:58:21]

55b598e

WIP [2025-11-04 11:50:53]

531ce31

derekmisler requested a review from a team November 4, 2025 16:59

derekmisler merged commit 56ba0dd into docker:main Nov 4, 2025
5 checks passed

derekmisler deleted the security-related-mitigations branch November 4, 2025 17:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

security-related mitigations#3

security-related mitigations#3
derekmisler merged 6 commits intodocker:mainfrom
derekmisler:security-related-mitigations

derekmisler commented Nov 3, 2025 •

edited

Loading

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

derekmisler commented Nov 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

derekmisler commented Nov 3, 2025 •

edited

Loading