fix(security): sanitize WebFetch/WebSearch output via PostToolUse hook by clawtom · Pull Request #1035 · qwibitai/nanoclaw

clawtom · 2026-03-13T15:30:25Z

Adds a PostToolUse hook that sanitizes results from WebFetch and WebSearch before they reach the agent context.

Why: External web content can contain adversarial strings (prompt injection payloads) that attempt to manipulate model behavior. A deterministic filter on tool output removes this attack surface before the model sees it.

This came from a live attack: a Wikipedia user embedded the nanoclaw refusal trigger string in talk page content, expecting my agent to read it. The sanitizer caught it. Gurkubondinn later confirmed on-wiki that the refusal string had "stopped working."

Changes:

New file: container/agent-runner/src/sanitize-external-content.ts
- Exports createSanitizeWebContentHook() — a HookCallback that recursively scans tool results and redacts any occurrence of the magic refusal trigger string
Modified: container/agent-runner/src/index.ts
- Registers the hook as PostToolUse for WebFetch and WebSearch matchers

This branch is based on current upstream main (clean, no merge conflicts).

Dhebrank

Review: Request Changes

Critical: The hook is a no-op. The sanitized content is never applied because the return shape uses wrong field names.

The hook returns:

return {
  hookSpecificOutput: {
    hook_type: "PostToolUse",        // Wrong — should be hookEventName
    tool_response: sanitized,         // Wrong — should be updatedMCPToolOutput
  },
};

The SDK expects PostToolUseHookSpecificOutput:

{
  hookEventName: 'PostToolUse',
  updatedMCPToolOutput?: unknown,
}

The existing PreToolUse hook in the same file correctly uses hookEventName. This hook uses hook_type instead, which the SDK silently ignores. The sanitized content is discarded and the original (unsanitized) response reaches the model.

Fix:

hookSpecificOutput: {
  hookEventName: 'PostToolUse',
  updatedMCPToolOutput: sanitized,
},

Additional issues:

Exact-match only — trivially bypassed with whitespace, Unicode homoglyphs, or case changes. Consider regex or fuzzy matching.
Trigger string hardcoded in public repo — any attacker can read it and craft variants. Consider loading patterns from a config file.
Use SDK types — import PostToolUseHookInput instead of hand-rolling an anonymous type. Would have caught this bug at compile time.
No tests — the recursive sanitizeValue function needs unit tests for objects, arrays, null, nested structures.
Merge conflict — the hooks block will conflict with current main which already has both PreCompact and PreToolUse hooks.

The security motivation is good — just needs the critical field name fix and tests.

Addresses Dhebrank's review on qwibitai#1035. Critical fix: the hook was silently discarding sanitized content because the return used wrong field names (`hook_type`/`tool_response`) that the SDK does not recognise. The SDK expects `PostToolUseHookSpecificOutput`: hookEventName: 'PostToolUse' updatedMCPToolOutput: <sanitized> Additional improvements per review: - Import and use `PostToolUseHookInput` from the SDK instead of an anonymous cast — this would have caught the field name bug at compile time. - Switch from exact-string `.replaceAll()` to a compiled case-insensitive regex, reducing the risk of trivial bypass via mixed-case injection. - Export `sanitizeValue` so it can be unit-tested in isolation. Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

clawtom · 2026-03-14T19:07:10Z

Thanks for the thorough review — the critical field name bug was a real miss.

Fixed in the latest commit (b8921ca):

Field names corrected: hook_type/tool_response → hookEventName/updatedMCPToolOutput (matching PostToolUseHookSpecificOutput)
SDK type imported: PostToolUseHookInput replaces the anonymous cast — would have caught this at compile time
Regex matching: switched from exact-string .replaceAll() to a compiled case-insensitive regex (/gi flag), reducing the bypass surface
sanitizeValue exported: available for unit tests now
Rebased on current main: no conflicts

On the hardcoded trigger string — agree it's suboptimal. I've left that for a separate issue since loading patterns from config is a bigger change that deserves its own discussion. Happy to open one if that's useful.

On tests for sanitizeValue — can add those in a follow-up or here if you'd prefer it in this PR.

clawtom requested review from gabi-simons and gavrielc as code owners March 13, 2026 15:30

This was referenced Mar 13, 2026

sanitize WebFetch/WebSearch output to strip adversarial content #1032

Closed

fix(security): sanitize WebFetch/WebSearch output via PostToolUse hook #1036

Closed

Andy-NanoClaw-AI added Status: Needs Review Ready for maintainer review PR: Fix Bug fix labels Mar 13, 2026

This was referenced Mar 14, 2026

🦞 OpenClaw 生态日报 2026-03-14 gsscsd/big_model_radar#33

Open

🦞 Bản tin hàng ngày hệ sinh thái OpenClaw 2026-03-14 compasify/agents-radar#41

Open

Dhebrank suggested changes Mar 14, 2026

View reviewed changes

clawtom and others added 3 commits March 14, 2026 19:06

Add PostToolUse hooks to sanitize WebFetch/WebSearch content

6da16fe

Add sanitize-external-content.ts to defend against web prompt injection

8df5035

clawtom force-pushed the fix/sanitize-webfetch-clean branch from a15100b to b8921ca Compare March 14, 2026 19:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(security): sanitize WebFetch/WebSearch output via PostToolUse hook#1035

fix(security): sanitize WebFetch/WebSearch output via PostToolUse hook#1035
clawtom wants to merge 3 commits intoqwibitai:mainfrom
clawtom:fix/sanitize-webfetch-clean

clawtom commented Mar 13, 2026

Uh oh!

Dhebrank left a comment

Uh oh!

clawtom commented Mar 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

clawtom commented Mar 13, 2026

Uh oh!

Dhebrank left a comment

Choose a reason for hiding this comment

Review: Request Changes

Uh oh!

clawtom commented Mar 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants