Harden privacy-filter remote-code allowlist for 1.5.2 by maziyarpanahi · Pull Request #59 · maziyarpanahi/openmed

maziyarpanahi · 2026-05-27T08:46:15Z

Summary

This PR prepares the 1.5.2 security release branch after the merged MLX converter fix in #58.

It hardens the privacy-filter loading path so attacker-controlled Hugging Face repository names that merely contain privacy-filter no longer route through a loader path that can enable trust_remote_code=True.

What Changed

Added an explicit first-party remote-code allowlist for the privacy-filter family:
- openai/privacy-filter
- OpenMed/privacy-filter-multilingual
- OpenMed/privacy-filter-nemotron
Changed PrivacyFilterTorchPipeline so trust_remote_code defaults to False.
Updated create_privacy_filter_pipeline() to opt in to trust_remote_code=True only after resolving the actual fallback model and checking the allowlist.
Tightened privacy-filter identifier matching so arbitrary names such as attacker/foo-privacy-filter-bar no longer route through the privacy-filter dispatcher.
Added OPENMED_TRUSTED_REMOTE_CODE_MODELS as an operator escape hatch for controlled/private fine-tunes.
Added security regression coverage at both unit and HTTP-service levels.
Bumped public release/version surfaces to 1.5.2.
Updated CHANGELOG.md for the full 1.5.2 release, including the previously merged fix: add MLX weight remapping for openai_privacy_filter / nemotron architecture #58 MLX conversion fix.

Root Cause

The privacy-filter dispatcher previously identified privacy-filter models with a substring match. That meant any model name containing privacy-filter could reach the privacy-filter-specific path. The PyTorch privacy-filter wrapper also defaulted trust_remote_code=True, which is required for first-party OpenAI/OpenMed privacy-filter repos but unsafe for arbitrary repositories.

This PR separates two concerns:

routing: only first-party privacy-filter identifiers and local privacy-filter artifacts are routed as privacy-filter-family requests;
remote code execution: only allowlisted repositories or operator-controlled local/env-configured models may opt in to trust_remote_code=True.

Release Notes

1.5.2 now includes both:

the security hardening in this PR;
the merged fix: add MLX weight remapping for openai_privacy_filter / nemotron architecture #58 fix for raw HuggingFace-to-MLX conversion of the OpenAI Privacy Filter family, including BF16-to-float32 NumPy conversion, OPF/Nemotron weight remapping, QKV fusion, classifier-bias preservation, and weight key/shape validation.

Validation

python -m pytest
- 1194 passed, 1 skipped, 15 warnings
python scripts/release/check_repo_policy.py
- passed

The warnings are existing deprecation/span-validation warnings and are not introduced by this change.

Harden privacy-filter remote-code allowlist for 1.5.2

maziyarpanahi added 14 commits May 27, 2026 10:45

Harden privacy filter torch allowlist

ba816d0

Restrict privacy filter identifier matching

65201aa

Gate privacy filter remote code dispatch

28ebc3a

Add privacy filter security regression tests

41ac218

Update privacy filter routing expectations

9354102

Add service privacy filter security regressions

efc9e42

Bump package version to 1.5.2

f8e2949

Update README for 1.5.2

7b1e529

Update docs index for 1.5.2

b819c6c

Update examples docs for 1.5.2

0a46eaf

Update MLX backend docs for 1.5.2

e7fbecd

Update Swift OpenMedKit docs for 1.5.2

84f2d0e

Update website copy for 1.5.2

6ab46a4

Document 1.5.2 changes

f1b5e45

maziyarpanahi self-assigned this May 27, 2026

maziyarpanahi marked this pull request as ready for review May 27, 2026 09:28

maziyarpanahi merged commit 98724f6 into master May 27, 2026
13 checks passed

maziyarpanahi deleted the security/model-allowlist branch May 27, 2026 10:08

maziyarpanahi added a commit that referenced this pull request May 27, 2026

Merge pull request #59 from maziyarpanahi/security/model-allowlist

6d094a0

Harden privacy-filter remote-code allowlist for 1.5.2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Harden privacy-filter remote-code allowlist for 1.5.2#59

Harden privacy-filter remote-code allowlist for 1.5.2#59
maziyarpanahi merged 14 commits into
masterfrom
security/model-allowlist

maziyarpanahi commented May 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

maziyarpanahi commented May 27, 2026

Summary

What Changed

Root Cause

Release Notes

Validation

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant