minor bug fixes #163

omri374 · 2025-12-07T20:16:49Z

This pull request updates the evaluation logic for span matching, focusing on improving the accuracy of character-based IoU calculations and adjusting the threshold for matches. It also adds new tests to ensure correct behavior, especially for split entities. The most important changes are grouped below:

Evaluation Logic Improvements

Changed the default iou_threshold in SpanEvaluator from 0.9 to 0.75 to allow more flexibility in span matching.
Fixed an off-by-one error in character span calculation within _calculate_combined_iou by removing the unnecessary + 1 in the range calls for both annotation and prediction spans. This ensures that character indices are properly exclusive at the end, improving the accuracy of IoU calculations. [1] [2]

Testing Enhancements

Added two new tests in test_span_evaluator.py:
- test_calculate_combined_iou_char_based_split_entity verifies that split predictions covering an annotation result in an IoU of 1.0, confirming the off-by-one bug is fixed.
- test_calculate_combined_iou_char_based_exact_match checks that exact matches yield an IoU of 1.0, ensuring no regression in the basic matching logic.

Copilot

Pull request overview

This pull request addresses character-based IoU calculation issues in span matching evaluation by fixing an off-by-one error and adjusting the matching threshold for more flexible span matching.

Fixed off-by-one error in character span calculation by removing unnecessary + 1 in range calls for both annotation and prediction end indices
Reduced the default IoU threshold from 0.9 to 0.75 to allow more flexibility in span matching
Added comprehensive tests to verify the fix for split entity scenarios and exact matches

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 3 comments.

File	Description
presidio_evaluator/evaluation/span_evaluator.py	Updated IoU threshold default to 0.75 and fixed off-by-one error in character-based IoU calculations by making end indices properly exclusive
tests/test_span_evaluator.py	Added two new test cases to verify character-based IoU calculations work correctly for split entities and exact matches
presidio_evaluator/evaluation/scorers.py	Removed obsolete scoring pipeline code that is no longer needed

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-12-07T20:20:22Z

presidio_evaluator/evaluation/span_evaluator.py

        generic_entities: Optional[List[str]] = None,
        skip_words: Optional[List] = None,
-        iou_threshold: float = 0.9,
+        iou_threshold: float = 0.75,


The documentation comment at line 35 states the default iou_threshold is 0.5, but the actual default has been changed to 0.75. The documentation should be updated to reflect this change:

:param iou_threshold: Minimum Intersection over Union (IoU) threshold for considering spans as matching. Value between 0 and 1, where higher values require more overlap (default: 0.75)

tests/test_span_evaluator.py

minor bug fixes and obsolete code removal

c6a5827

omri374 requested review from Copilot and negruber1 December 7, 2025 20:17

Copilot started reviewing on behalf of omri374 December 7, 2025 20:17 View session

Copilot AI reviewed Dec 7, 2025

View reviewed changes

omri374 removed the request for review from negruber1 December 7, 2025 21:30

omri374 added 4 commits December 10, 2025 15:43

additional bug fixes

a806ae4

minor updates to schema and entities

b06247f

changed test to SpanEvaluator instead of TokenEvaluator

83963a8

changed test to SpanEvaluator instead of TokenEvaluator

9418956

omri374 changed the title ~~minor bug fixes and obsolete code removal~~ minor bug fixes Dec 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

minor bug fixes #163

minor bug fixes #163

Uh oh!

omri374 commented Dec 7, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Dec 7, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

minor bug fixes #163

Are you sure you want to change the base?

minor bug fixes #163

Uh oh!

Conversation

omri374 commented Dec 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Evaluation Logic Improvements

Testing Enhancements

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Dec 7, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

omri374 commented Dec 7, 2025 •

edited

Loading