Skip to content

Slim NLP Engine#1916

Open
SharonHart wants to merge 6 commits intomainfrom
shhart/test-slim-nlp
Open

Slim NLP Engine#1916
SharonHart wants to merge 6 commits intomainfrom
shhart/test-slim-nlp

Conversation

@SharonHart
Copy link
Copy Markdown
Contributor

@SharonHart SharonHart commented Mar 22, 2026

Add Slim NLP Engine with GLiNER as default recognizer
Introduces SlimSpacyNlpEngine — a lightweight NLP engine that provides only tokenization and lemmatization, with NER disabled. Entity extraction is delegated to self-contained recognizers like GLiNERRecognizer instead of the shared NLP pipeline.

What changed:

New SlimSpacyNlpEngine class with auto-download support for 18 language models
New slim.yaml / slim_nlp.yaml configs using GLiNER instead of SpacyRecognizer
Default configuration updated to use the slim engine
Original spaCy/Stanza/Transformers engines remain fully available

@github-actions
Copy link
Copy Markdown

Coverage report (presidio-anonymizer)

This PR does not seem to contain any modification to coverable code.

@github-actions
Copy link
Copy Markdown

Coverage report (presidio-structured)

This PR does not seem to contain any modification to coverable code.

@github-actions
Copy link
Copy Markdown

Coverage report (presidio-cli)

This PR does not seem to contain any modification to coverable code.

@SharonHart SharonHart marked this pull request as ready for review March 24, 2026 11:12
@SharonHart SharonHart changed the title test slim nlp Slim NLP Engine Mar 24, 2026
@github-actions
Copy link
Copy Markdown

Coverage report (presidio-image-redactor)

This PR does not seem to contain any modification to coverable code.

@github-actions
Copy link
Copy Markdown

Coverage report (presidio-analyzer)

Click to see where and how coverage changed

FileStatementsMissingCoverageCoverage
(new stmts)
Lines missing
  presidio-analyzer/presidio_analyzer/llm_utils
  config_loader.py
  presidio-analyzer/presidio_analyzer/nlp_engine
  __init__.py
  nlp_engine_provider.py
  slim_spacy_nlp_engine.py 161-163, 212
Project Total  

This report was generated by python-coverage-comment-action

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants