fix: bound advanced pickle global extraction by mldangelo · Pull Request #700 · promptfoo/modelaudit

mldangelo · 2026-03-13T22:48:17Z

Motivation

Prevent a denial-of-service risk where _extract_globals_advanced() materialized every opcode via list(pickletools.genops(data)) with no opcode or timeout bounds, allowing crafted pickles to exhaust CPU/RAM before guarded scanning.

Description

Enforce extraction budgets inside PickleScanner._extract_globals_advanced() by incrementally iterating pickletools.genops() and checking self.max_opcodes and the scan timeout during iteration, and stop extraction gracefully when budgets are exceeded.
Preserve already-discovered globals when extraction is truncated and avoid raising on internal parser errors (maintains previous error-handling behavior).
Add regression tests in tests/scanners/test_pickle_scanner.py that assert advanced extraction exits when the opcode budget is exceeded and when the scan timeout budget is already exhausted.
Files changed: modelaudit/scanners/pickle_scanner.py, tests/scanners/test_pickle_scanner.py.

Testing

Ran formatting and linting: uv run ruff format modelaudit/ tests/ and uv run ruff check --fix modelaudit/ tests/ (passed).
Type checks: uv run mypy modelaudit/ (passed).
Full test suite: uv run pytest -n auto -m "not slow and not integration" --maxfail=1 (one pre-existing unrelated test failed: tests/utils/helpers/test_secure_hasher.py::TestErrorHandling::test_hash_permission_denied, all other tests ran; this failure is not caused by this change).
Targeted tests: uv run pytest tests/scanners/test_pickle_scanner.py (47 passed) and uv run pytest tests/scanners/test_pickle_scanner.py -k "extract_globals_advanced_respects" (2 passed), confirming the regression tests and the bounded extraction behavior.

Codex Task

Summary by CodeRabbit

New Features
- Pickle scanning now enforces opcode-count limits and scan timeouts, with early termination and partial-result reporting to avoid long-running or resource-heavy scans.
Tests
- Added tests confirming opcode limit enforcement and timeout-based early termination during pickle scanning.

Stop unbounded opcode materialization in _extract_globals_advanced by enforcing max_opcodes and timeout during genops iteration. Add regression tests to verify extraction exits when opcode or timeout budgets are exceeded.

coderabbitai · 2026-03-13T22:48:34Z

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info

⚙️ Run configuration

Configuration used: Repository UI

Review profile: ASSERTIVE

Plan: Pro

Run ID: 3bd34981-eba7-45c9-9265-868408e66b1e

📥 Commits

Reviewing files that changed from the base of the PR and between 55e9cb5 and b4b8df4.

📒 Files selected for processing (2)

modelaudit/scanners/pickle_scanner.py
tests/scanners/test_pickle_scanner.py

Walkthrough

Added time- and opcode-bounded behavior to the pickle scanner's advanced global extraction: _extract_globals_advanced accepts an optional scan_start_time, tracks opcode count and effective start time, enforces max_opcodes and timeout with early termination and an extraction_truncated flag, and returns partial results when truncation occurs.

Changes

Cohort / File(s)	Summary
Pickle Scanner Resource Limits `modelaudit/scanners/pickle_scanner.py`	Changed `_extract_globals_advanced` signature to accept optional `scan_start_time`. Added tracking for `extracted_opcodes`, `effective_scan_start_time`, and `extraction_truncated`. Enforced `max_opcodes` and timeout within opcode iteration, with early breaks and warning logs; returns partial globals on truncation.
Bounds Checking Tests `tests/scanners/test_pickle_scanner.py`	Added tests `test_extract_globals_advanced_respects_max_opcodes` and `test_extract_globals_advanced_respects_scan_timeout` that mock `pickletools.genops` and `time.time` to validate opcode-limiting and timeout behavior. Updated imports (`Iterator`, `BytesIO`) for the new tests.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Poem

🐰 I nibble through opcodes, counting every hop,
A timeout bell rings — I gracefully stop.
Truncated but tidy, I carry a few,
Safe little globals, all cozy and new. 🥕

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 75.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title 'fix: bound advanced pickle global extraction' directly summarizes the main change: adding bounds/limits to the advanced pickle global extraction to prevent DOS.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings (stacked PR)
📝 Generate docstrings (commit on current branch)

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch codex/propose-fix-for-opcode-exhaustion-vulnerability

📝 Coding Plan

Generate coding plan for human review comments

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 2

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

modelaudit/scanners/pickle_scanner.py (1)

3837-3862: ⚠️ Potential issue | 🟠 Major

Timeout budget is not enforced during post-processing

At Line 3837, _build_symbolic_reference_maps(ops) and the subsequent full for loop still run even after timeout has already been exceeded, which can add significant O(n) work and blunt the timeout safeguard.

Proposed fix

-            stack_global_refs, _callable_refs = _build_symbolic_reference_maps(ops)
+            if (
+                effective_scan_start_time is not None
+                and (time.time() - effective_scan_start_time) > self.timeout
+            ):
+                logger.warning(
+                    f"Advanced global extraction post-processing stopped after exceeding timeout ({self.timeout}s)"
+                )
+                extraction_truncated = True
+                stack_global_refs = {}
+            else:
+                stack_global_refs, _callable_refs = _build_symbolic_reference_maps(ops)

             last_byte = data.read(1)
             if last_byte:
                 data.seek(-1, 1)

             for n, (opcode, arg, _pos) in enumerate(ops):
+                if (
+                    effective_scan_start_time is not None
+                    and (time.time() - effective_scan_start_time) > self.timeout
+                ):
+                    extraction_truncated = True
+                    break
                 op_name = opcode.name

As per coding guidelines: "Preserve or strengthen security detections; test both benign and malicious samples when adding scanner/feature changes."

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@modelaudit/scanners/pickle_scanner.py` around lines 3837 - 3862, The
post-processing currently always calls _build_symbolic_reference_maps(ops) and
iterates the entire for n,(opcode,arg,_pos) in enumerate(ops) loop even if the
scanner timeout has already been exceeded; modify the code to first check the
scanner timeout right after reading ops (and before calling
_build_symbolic_reference_maps) and again inside the for-loop (e.g., at loop
start) to break/return early if the timeout is exceeded, ensuring you avoid
calling expensive work when timed out; update references around
_build_symbolic_reference_maps, ops, the for-loop that computes globals_found,
and the STACK_GLOBAL handling (including logger.debug and adding
("unknown","unknown")) so behavior is preserved when not timed out and aborted
cleanly when it is.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@tests/scanners/test_pickle_scanner.py`:
- Around line 1213-1228: The test currently yields only NOP ops so it can't
prove max_opcodes truncation; modify the mocked genops used by
test_extract_globals_advanced_respects_max_opcodes to yield GLOBAL-like opcodes
(or ops whose handling in PickleScanner._extract_globals_advanced would add
names to the globals set) and include a side-effect consumption counter in the
mock (or a flag set when the generator is exhausted) so you can assert that only
up to 3 ops were processed: verify globals_found contains only the limited
subset expected from the first 3 GLOBAL-like yields and/or assert the generator
consumption counter is < total yielded to prove parsing stopped after
max_opcodes; keep the monkeypatch target
"modelaudit.scanners.pickle_scanner.pickletools.genops" and the test call to
PickleScanner._extract_globals_advanced.
- Around line 1230-1247: The test
test_extract_globals_advanced_respects_scan_timeout uses NOP-only _fake_genops
which never yields globals so the timeout behavior isn't exercised; modify
_fake_genops used in PickleScanner (scanner variable and its method
_extract_globals_advanced) to emit at least one opcode that would produce a
global (or a countered side-effect) while still iterating many times so the
timeout check is actually reached, and assert behavior based on that emission
(e.g., that the scanner stops/returns an empty set when time advances); also
move the BytesIO import out of the function to the module-level imports for
consistency with other tests.

---

Outside diff comments:
In `@modelaudit/scanners/pickle_scanner.py`:
- Around line 3837-3862: The post-processing currently always calls
_build_symbolic_reference_maps(ops) and iterates the entire for
n,(opcode,arg,_pos) in enumerate(ops) loop even if the scanner timeout has
already been exceeded; modify the code to first check the scanner timeout right
after reading ops (and before calling _build_symbolic_reference_maps) and again
inside the for-loop (e.g., at loop start) to break/return early if the timeout
is exceeded, ensuring you avoid calling expensive work when timed out; update
references around _build_symbolic_reference_maps, ops, the for-loop that
computes globals_found, and the STACK_GLOBAL handling (including logger.debug
and adding ("unknown","unknown")) so behavior is preserved when not timed out
and aborted cleanly when it is.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Repository UI

Review profile: ASSERTIVE

Plan: Pro

Run ID: 04413eca-941a-4027-97ae-e96838cb8964

📥 Commits

Reviewing files that changed from the base of the PR and between 698859d and 55e9cb5.

📒 Files selected for processing (2)

modelaudit/scanners/pickle_scanner.py
tests/scanners/test_pickle_scanner.py

tests/scanners/test_pickle_scanner.py

…xtraction

fix: bound advanced pickle global extraction

55e9cb5

Stop unbounded opcode materialization in _extract_globals_advanced by enforcing max_opcodes and timeout during genops iteration. Add regression tests to verify extraction exits when opcode or timeout budgets are exceeded.

mldangelo added codex aardvark labels Mar 13, 2026 — with ChatGPT Codex Connector

coderabbitai bot reviewed Mar 13, 2026

View reviewed changes

tests/scanners/test_pickle_scanner.py Show resolved Hide resolved

tests/scanners/test_pickle_scanner.py Show resolved Hide resolved

mldangelo added 2 commits March 13, 2026 19:54

Merge branch 'main' into review/pr-700-bound-advanced-pickle-global-e…

a864191

…xtraction

fix: fail closed on pickle extraction budgets

b4b8df4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: bound advanced pickle global extraction#700

fix: bound advanced pickle global extraction#700
mldangelo wants to merge 3 commits intomainfrom
codex/propose-fix-for-opcode-exhaustion-vulnerability

mldangelo commented Mar 13, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Mar 13, 2026 •

edited

Loading

❌ Failed checks (1 warning)

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

mldangelo commented Mar 13, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Description

Testing

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Mar 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Poem

❌ Failed checks (1 warning)

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

mldangelo commented Mar 13, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Mar 13, 2026 •

edited

Loading