[Multi-Agent Privacy] Detection tools implementation by JCHAVEROT · Pull Request #1 · JCHAVEROT/mmore

JCHAVEROT · 2026-05-12T16:05:06Z

Summary

Related issue: #292
Depending on: #285
Target branch: swiss-ai:mmore/v2

This PR adds a new Personally Identifiable Information (PII) detection toolkit to be later used as tools by the agentic privacy system.

What this adds

A PII detection toolkit under mmore.privacy.detection with four interchangeable engines:
- GLiNER
- OpenAI privacy-filter
- Presidio
- LLM engine using DSPy for structured LLM output
Each engine takes a shared DetectionConfig, and registers itself in a tool registry so agents can call it
Model loading is lazy and shared across engines via one model registry global to the pipeline, so each model loads once and can be reused across agents:
- LRU eviction with a memory budget, defaults to a fraction of device memory (CUDA/MPS/CPU), and can be override with MMORE_PRIVACY_MODEL_BUDGET_MB
- Disable entirely with MMORE_PRIVACY_MODEL_CACHE=0

Dependencies / CI

the extra privacy now has new dependencies (gliner, presidio, spacy, dspy, and psutils for memory measurements)
new separate extra privacy-openai-filter (transformers>=5, peft) as currently there is a conflict with marker-pdf from the extra process (will be solved once #191 is closed)

Tests

Unit tests for all four engines that use mocks so they run without downloading models
These tests are intentionally temporary and will be replaced by end-to-end tests once the full privacy multi-agent system is in place

Disclaimer: the big numbers in the line differences most come from new dependencies, hence changes in the uv.lock file

Demo

Input note (AI generated)

# Progress Note - Internal Medicine

Pt is a 58 yo M, goes by "Bobby," seen this AM on rounds (bed 4B, Tower 3).
Known to our service from the 3/2 admission - see prior note by Dr. Garcia.
Wife (Linda, reachable on her cell 617-555-0148, or at the house on Linwood
Ave) was at bedside overnight and is the HCP. Hx obtained partly from pt,
partly from the daughter who flew in from Austin.

Pt c/o "the same chest thing as before," denies fevers. Says he stopped the
metoprolol ~2 wks ago bc he "ran out and couldn't get through to the office."
Smokes ~1 ppd, quit date keeps changing.

Of note, records faxed over from Dr. R. Lee's office (St. Mary's, the one off
123 Main) list a different DOB than what we have - pt says 4/23/65 but the
face sheet says 04/23/1955, needs reconciling. MRN on the wristband (12345678)
matches the chart; the outside packet had 1245-6788 which is probably a
transcription error. Insurance still showing the old BCBS plan, member id
BCXY 99-88-77, though pt thinks he switched in Jan. Front desk left a vm at
555 867 5309 re: policy AB1234567.

Pt mentioned he emailed photos of the rash to "the dermatology guy" at
jsmith@hosp-derm.org last week, unclear which provider. Asked us to call his
brother (no name given, "he's a nurse over at the VA"). SSN partially visible
on a scanned form in the chart (xxx-xx-4321) - flagged to HIM.

A/P: 58M w/ CP, likely demand ischemia i/s/o med noncompliance. Will discuss
code status with pt and Linda. F/u cardiology (Dr. Maria Garcia, pager 12345)
after d/c. Tentative d/c 4/1, ride being arranged. Pt verbalized understanding,
quote: "just don't call me at work, my boss doesn't know."

GLiNER (nvidia/gliner-PII)

15 spans at confidence_threshold = 0.4

start	end	label	score	text
63	68	PERSON	0.859	`Bobby`
143	146	DATE	0.990	`3/2`
195	200	PERSON	0.990	`Linda`
224	236	PHONE	0.782	`617-555-0148`
381	387	LOCATION	0.942	`Austin`
640	650	LOCATION	0.987	`St. Mary's`
723	730	DATE	0.959	`4/23/65`
755	765	DATE	0.974	`04/23/1955`
808	816	MRN	0.990	`12345678`
964	977	INSURANCE_ID	1.000	`BCXY 99-88-77`
1147	1167	EMAIL	1.000	`jsmith@hosp-derm.org`
1334	1345	SSN	0.963	`xxx-xx-4321`
1467	1472	PERSON	0.992	`Linda`
1494	1506	PERSON	0.637	`Maria Garcia`
1546	1549	DATE	0.789	`4/1`

openai/privacy-filter

78 spans at confidence_threshold = 0.4

start	end	label	score	text
63	64	B-private_person	0.999	`B`
64	68	E-private_person	0.999	`obby`
176	179	B-private_person	1.000	`Dr`
179	180	I-private_person	1.000	`.`
180	187	E-private_person	1.000	`Garcia`
195	200	S-private_person	0.999	`Linda`
224	227	B-private_phone	1.000	`617`
227	228	I-private_phone	1.000	`-`
228	231	I-private_phone	1.000	`555`
231	232	I-private_phone	1.000	`-`
232	235	I-private_phone	1.000	`014`
235	236	E-private_phone	1.000	`8`
256	260	B-private_address	0.999	`Lin`
260	264	I-private_address	0.997	`wood`
264	265	I-private_address	0.999
265	268	E-private_address	0.995	`Ave`
618	621	B-private_person	1.000	`Dr`
621	622	I-private_person	1.000	`.`
622	624	I-private_person	1.000	`R`
624	625	I-private_person	1.000	`.`
625	629	E-private_person	1.000	`Lee`
723	724	B-private_date	0.928	`4`
724	725	I-private_date	0.852	`/`
725	727	I-private_date	0.859	`23`
727	728	I-private_date	0.817	`/`
728	730	E-private_date	0.787	`65`
755	757	B-private_date	0.784	`04`
757	758	I-private_date	0.775	`/`
758	760	I-private_date	0.728	`23`
760	761	I-private_date	0.662	`/`
761	764	I-private_date	0.633	`195`
764	765	E-private_date	0.620	`5`
808	811	B-account_number	0.998	`123`
811	814	I-account_number	0.995	`456`
814	816	E-account_number	0.995	`78`
964	966	B-account_number	1.000	`BC`
966	968	I-account_number	0.999	`XY`
968	969	I-account_number	1.000
969	971	I-account_number	0.999	`99`
971	972	I-account_number	0.999	`-`
972	974	I-account_number	0.999	`88`
974	975	I-account_number	0.997	`-`
975	977	E-account_number	0.999	`77`
1010	1014	S-private_person	0.494	`Jan`
1040	1043	B-private_phone	0.954	`555`
1043	1044	I-private_phone	0.894
1044	1047	I-private_phone	0.880	`867`
1047	1048	I-private_phone	0.929
1048	1051	I-private_phone	0.971	`530`
1051	1052	E-private_phone	0.947	`9`
1063	1066	B-account_number	0.992	`AB`
1066	1069	I-account_number	0.998	`123`
1069	1072	I-account_number	0.996	`456`
1072	1073	E-account_number	0.998	`7`
1147	1149	B-private_email	1.000	`js`
1149	1153	I-private_email	1.000	`mith`
1153	1154	I-private_email	1.000	`@`
1154	1155	I-private_email	1.000	`h`
1155	1158	I-private_email	1.000	`osp`
1158	1159	I-private_email	1.000	`-`
1159	1162	I-private_email	1.000	`der`
1162	1163	I-private_email	1.000	`m`
1163	1167	E-private_email	1.000	`.org`
1334	1337	B-account_number	0.977	`xxx`
1337	1338	I-account_number	0.991	`-`
1338	1340	I-account_number	0.950	`xx`
1340	1341	I-account_number	0.892	`-`
1341	1344	I-account_number	0.750	`432`
1344	1345	E-account_number	0.851	`1`
1466	1472	S-private_person	0.561	`Linda`
1490	1492	B-private_person	1.000	`Dr`
1492	1493	I-private_person	1.000	`.`
1493	1499	I-private_person	1.000	`Maria`
1499	1506	E-private_person	1.000	`Garcia`
1507	1513	B-account_number	0.993	`pager`
1513	1514	I-account_number	0.978
1514	1517	I-account_number	0.945	`123`
1517	1519	E-account_number	0.978	`45`

Presidio + custom clinical recognizers

26 spans at confidence_threshold = 0.4

start	end	label	score	text
63	68	PERSON	0.850	`Bobby`
103	110	LOCATION	0.850	`Tower 3`
181	187	PERSON	0.850	`Garcia`
195	200	PERSON	0.850	`Linda`
224	236	PHONE_NUMBER	0.750	`617-555-0148`
381	387	LOCATION	0.850	`Austin`
480	487	DATE_TIME	0.850	`wks ago`
623	631	PERSON	0.850	`R. Lee's`
640	650	PERSON	0.850	`St. Mary's`
723	730	DATE_TIME	0.600	`4/23/65`
755	765	DATE_TIME	0.600	`04/23/1955`
755	765	HOSPITAL_DATE	0.600	`04/23/1955`
808	816	MRN	0.750	`12345678`
860	869	DATE_TIME	0.850	`1245-6788`
1011	1015	DATE_TIME	0.850	`Jan.`
1040	1052	PHONE_NUMBER	0.400	`555 867 5309`
1064	1073	INSURANCE_ID	1.000	`AB1234567`
1064	1073	US_DRIVER_LICENSE	0.650	`AB1234567`
1147	1167	EMAIL_ADDRESS	1.000	`jsmith@hosp-derm.org`
1154	1167	URL	0.500	`hosp-derm.org`
1168	1177	DATE_TIME	0.850	`last week`
1274	1276	LOCATION	0.850	`VA`
1467	1472	PERSON	0.850	`Linda`
1494	1506	PERSON	0.850	`Maria Garcia`
1514	1519	DATE_TIME	0.850	`12345`
1529	1544	PERSON	0.850	`c. Tentative d/`

LLM Qwen/Qwen2.5-7B-Instruct via DSPy

21 spans at confidence_threshold = 0.4

start	end	label	score	text
63	68	PERSON	0.800	`Bobby`
143	146	DATE	0.800	`3/2`
177	187	PERSON	0.800	`Dr. Garcia`
195	200	PERSON	0.800	`Linda`
224	236	PHONE	0.950	`617-555-0148`
466	476	MEDICATION	0.800	`metoprolol`
478	483	DURATION	0.800	`2 wks`
619	629	PERSON	0.800	`Dr. R. Lee`
640	650	LOCATION	0.800	`St. Mary's`
664	672	LOCATION	0.800	`123 Main`
808	816	MRN	0.950	`12345678`
860	869	MRN	0.950	`1245-6788`
964	977	INSURANCE_ID	0.950	`BCXY 99-88-77`
1040	1052	PHONE	0.950	`555 867 5309`
1147	1167	EMAIL	0.950	`jsmith@hosp-derm.org`
1274	1276	LOCATION	0.800	`VA`
1334	1345	SSN	0.950	`xxx-xx-4321`
1490	1506	PERSON	0.800	`Dr. Maria Garcia`
1508	1519	PHONE	0.800	`pager 12345`
1546	1549	DATE	0.800	`4/1`
1631	1635	LOCATION	0.800	`work`

…to feat/detection-toolkit

…ch 'me' into feat/detection-toolkit

…k-integration

…oolkit

…to feat/detection-toolkit

Copilot

Pull request overview

This PR introduces a PII detection toolkit under mmore.privacy.detection to be later consumed as agent tools by the upcoming multi-agent privacy system. It adds four interchangeable detection engines (GLiNER, OpenAI privacy-filter, Presidio with custom clinical recognizers, and a DSPy-driven LLM engine), all sharing a common DetectionConfig, DetectionEngine interface, and PIISpan output type, with module-level caches for the underlying models/pipelines. Each engine self-registers a default tool function in the global tool_registry so agents can resolve them by name.

Changes:

Adds DetectionConfig, DetectionEngine/PIISpan base, and four engine implementations with lazy + thread-safe model/pipeline caching, plus dspy_llm.build_dspy_lm (with a LocalHFLM for local HF chat models).
Wires each engine to the agent tool registry via @register_tool and exposes them through mmore.privacy.detection.__init__.
Declares new optional privacy extras (gliner, presidio, spacy, dspy) and a separate privacy-openai-filter extra (transformers>=5, peft), with conflict declarations against process/all; adds mock-based unit tests covering all four engines.

Reviewed changes

Copilot reviewed 11 out of 12 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
pyproject.toml	Adds new dependencies to `privacy` extra and a new `privacy-openai-filter` extra with conflict declarations against `process`/`all`.
src/mmore/privacy/detection/init.py	Public re-exports of engines, config, base types, and registered tool callables.
src/mmore/privacy/detection/base.py	Defines `PIISpan` dataclass and abstract `DetectionEngine` interface.
src/mmore/privacy/detection/config.py	`DetectionConfig` dataclass schema for the YAML privacy.detection block.
src/mmore/privacy/detection/defaults.py	Shared default thresholds, labels, model names, clinical regex patterns, and default LLMConfig.
src/mmore/privacy/detection/gliner_engine.py	GLiNER engine with thread-safe model cache and `detect_pii_gliner` tool.
src/mmore/privacy/detection/openai_filter_engine.py	HF token-classification engine over `openai/privacy-filter` with pipeline cache and `detect_pii_openai` tool.
src/mmore/privacy/detection/presidio_engine.py	Presidio engine extended with custom clinical recognizers (MRN, HOSPITAL_DATE, INSURANCE_ID) and `detect_pii_presidio` tool.
src/mmore/privacy/detection/llm_engine.py	DSPy-based LLM engine with typed signature, demo examples, error-tolerant span post-processing, and `detect_pii_llm` tool.
src/mmore/privacy/dspy_llm.py	`build_dspy_lm` factory and `LocalHFLM` (dspy.BaseLM) wrapper around a cached transformers chat pipeline.
tests/test_detection.py	Mock-based unit tests covering config loading, tool registration, engine behavior, caching, and threshold/entity filtering across all four engines.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

…oolkit

…to feat/detection-toolkit

Copilot

Pull request overview

Copilot reviewed 11 out of 12 changed files in this pull request and generated 4 comments.

…to feat/detection-toolkit

…lways be ignored

…to feat/detection-toolkit

fabnemEPFL · 2026-06-08T18:02:07Z

+class DetectionConfig:
+    """Schema for the ``privacy.detection`` block of a YAML config."""
+
+    engine: str


the field engine seems to be used only in assertion tests so it sounds useless

unless it's meant to be saved

Yes it is still not used in this PR (it will be the case most likely in the PR were we wire the privacy layer into the mmore's RAG pipeline), but I think we want to keep it because later using this parameter the user will be able to choose a specific detection engine (instead of falling back to the default one or having the Analyzer agent infer one for the task)

then it could make sense to have an enum with the supported detection engines

Done in 8f16202

…to feat/detection-toolkit

Co-authored-by: Jérémy Chaverot <jeremy.chaverot@epfl.ch>

JCHAVEROT added 10 commits May 5, 2026 20:48

feat: add dependencies and skeletton for detection tools

794e23e

feat: add gliner detection tool

d8503b1

feat: add openai-filter detection tool

11205df

feat: add presidio detection tool

ec9126f

feat: add first version of LLM with dspy as detection tool

9797115

chores: update imported functions

8be4dc8

Merge remote-tracking branch 'me/feat/agent-framework-integration' in…

58b8572

…to feat/detection-toolkit

Merge branch 'feat/agent-framework-integration', remote-tracking bran…

66b0c77

…ch 'me' into feat/detection-toolkit

Merge remote-tracking branch 'origin/master' into feat/agent-framewor…

f1060f9

…k-integration

Merge branch 'feat/agent-framework-integration' into feat/detection-t…

5814294

…oolkit

JCHAVEROT force-pushed the feat/agent-framework-integration branch 2 times, most recently from be1f766 to 4374156 Compare May 13, 2026 10:07

JCHAVEROT added 2 commits May 13, 2026 17:16

Merge remote-tracking branch 'me/feat/agent-framework-integration' in…

d175fd4

…to feat/detection-toolkit

refactor: create a file gathering default configurations

6a91ebb

JCHAVEROT self-assigned this May 15, 2026

JCHAVEROT added the enhancement New feature or request label May 15, 2026

JCHAVEROT added 3 commits May 15, 2026 18:45

tests: add unit tests on detection engines to be later replaced

3eae01f

refactor: extract DSPy llm so that it can be reused by sanitizer later

df24d81

refactor: extract prompts to clean llm_engine.py

825ff24

JCHAVEROT force-pushed the feat/detection-toolkit branch from 0968f9d to 825ff24 Compare May 15, 2026 16:46

fabnemEPFL requested a review from Copilot May 15, 2026 17:20

Copilot started reviewing on behalf of fabnemEPFL May 15, 2026 17:20 View session

Copilot AI reviewed May 15, 2026

View reviewed changes

Comment thread src/mmore/privacy/detection/gliner_engine.py Outdated

Comment thread src/mmore/privacy/detection/llm_engine.py Outdated

Comment thread src/mmore/privacy/detection/openai_filter_engine.py Outdated

JCHAVEROT added 3 commits May 18, 2026 12:59

review: apply changes

53e384b

Merge branch 'feat/agent-framework-integration' into feat/detection-t…

a050c18

…oolkit

Merge remote-tracking branch 'me/feat/agent-framework-integration' in…

fd784c6

…to feat/detection-toolkit

fabnemEPFL requested a review from Copilot May 27, 2026 09:55

Copilot started reviewing on behalf of fabnemEPFL May 27, 2026 09:55 View session

Copilot AI reviewed May 27, 2026

View reviewed changes

Comment thread src/mmore/privacy/detection/llm_engine.py Outdated

Comment thread src/mmore/privacy/detection/presidio_engine.py Outdated

Comment thread src/mmore/privacy/detection/openai_filter_engine.py Outdated

Comment thread src/mmore/privacy/detection/__init__.py

Merge remote-tracking branch 'me/feat/agent-framework-integration' in…

39d6a58

…to feat/detection-toolkit

JCHAVEROT added 5 commits May 27, 2026 12:58

review: fix llm_engine in case the same fragment appear multiple times

ec8001e

review: fix openai_filter_engine as passed entity_types list should a…

46bc5ab

…lways be ignored

tests: update dict keys

fd49131

feat: download spacy language model when missing

4c34eab

Merge remote-tracking branch 'me/feat/agent-framework-integration' in…

07848bb

…to feat/detection-toolkit

fabnemEPFL requested changes Jun 8, 2026

View reviewed changes

JCHAVEROT force-pushed the feat/agent-framework-integration branch from 7794813 to b42f4c1 Compare June 9, 2026 09:32

JCHAVEROT added 6 commits June 9, 2026 18:54

Merge remote-tracking branch 'me/feat/agent-framework-integration' in…

f2ae78f

…to feat/detection-toolkit

review: fix types and other small changes

1d9c58e

fix: add device mapping for gliner engine

9e0b74d

fix: corrections in tests, dependencies, and fix runtime errors

e5723a5

feat: add a model registry to have efficient caching in privacy pipeline

52f4e8c

[Multi-Agent Privacy] Agent framework integration (swiss-ai#285)

a9b21bc

JCHAVEROT deleted the branch feat/agent-framework-integration June 11, 2026 09:18

JCHAVEROT closed this Jun 11, 2026

JCHAVEROT reopened this Jun 11, 2026

JCHAVEROT and others added 4 commits June 11, 2026 15:31

review: create an enum for the detection engine type

8f16202

Various typing fixes (EPFLiGHT#320)

570b013

Co-authored-by: Jérémy Chaverot <jeremy.chaverot@epfl.ch>

Bump version from 1.2.3 to 1.2.4

e3890d1

Fix torch import crash in Docker images (EPFLiGHT#322)

6281137

JCHAVEROT mentioned this pull request Jun 15, 2026

[Multi-Agent Privacy] Add privacy agents: Analyzer, Detector, Sanitizer #2

Open

JCHAVEROT added 3 commits June 20, 2026 14:14

RAG CLI: fix empty answers and improve UX (EPFLiGHT#323)

03f3426

Merge remote-tracking branch 'origin/master' into v2

88ee637

Merge remote-tracking branch 'origin/v2' into feat/detection-toolkit

42dbcdf

Conversation

JCHAVEROT commented May 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

What this adds

Dependencies / CI

Tests

Demo

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

fabnemEPFL Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

fabnemEPFL Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

JCHAVEROT Jun 10, 2026

Choose a reason for hiding this comment

Uh oh!

fabnemEPFL Jun 11, 2026

Choose a reason for hiding this comment

Uh oh!

JCHAVEROT Jun 11, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

JCHAVEROT commented May 12, 2026 •

edited

Loading