[BUG] Fix Benchmarking duplicate estimator label collisions by ashum9 · Pull Request #299 · gc-os-ai/pyaptamer

ashum9 · 2026-04-05T20:17:19Z

Closes #297.

What this changes

disambiguates duplicate estimator class names in Benchmarking results with deterministic 1-based suffixes
preserves the original class name when there is only one estimator of that class
adds a regression test proving two DummyClassifier instances no longer overwrite one another
adds a regression test proving distinct estimator classes keep their original labels

Validation

pytest -q pyaptamer/benchmarking/tests/test_benchmarking_core.py

Copilot

Pull request overview

Fixes a Benchmarking result-collision bug where multiple estimators of the same class overwrote each other by generating deterministic, disambiguated estimator display names (with 1-based suffixes only when needed), and adds regression tests for both duplicate and unique estimator naming.

Changes:

Add _get_estimator_names() to generate stable estimator labels and suffix duplicates (e.g., DummyClassifier[1], DummyClassifier[2]).
Update Benchmarking.run() to key results by these stable labels instead of raw class names.
Add regression tests to ensure duplicate estimator classes stay distinct and unique classes keep original labels.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
`pyaptamer/benchmarking/_base.py`	Introduces stable estimator naming and uses it to prevent result-row overwrites for duplicate estimator classes.
`pyaptamer/benchmarking/tests/test_benchmarking_core.py`	Adds regression coverage for duplicate-name disambiguation and unique-name preservation.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

siddharth7113

left the comment to change the suffix structure, otherwise looks good.

siddharth7113 · 2026-04-07T15:05:59Z

+
+    summary = bench.run()
+
+    assert ("DummyClassifier[1]", "accuracy_score") in summary.index


this notation wouldn't be right, and could be confusing in downstream usage, user would have to make a call something like this df.loc["DummyClassifier[1]"] , I would suggest to index along with strategy something to make it either like DummyClassifer_1 or based on Pattern should be fine as well.

ashum9 · 2026-04-10T21:56:41Z

@siddharth7113 Thanks for the feedback. I updated the duplicate-estimator label format to use underscore suffixes instead of brackets.

Now duplicate estimators of the same class are labeled deterministically as DummyClassifier_1, DummyClassifier_2 (1-based, only when needed), while unique estimator classes keep their original class name.

Updated the regression tests accordingly; pytest -q pyaptamer/benchmarking/tests/test_benchmarking_core.py passes.

siddharth7113 · 2026-04-12T18:31:03Z

Please follow PR guidelines and make sure to install and run pre-commit , the current CI checks are failing.

siddharth7113 · 2026-04-20T13:01:33Z

cc @satvshr , for your feedback

satvshr · 2026-04-20T13:06:11Z

+    assert len(summary) == 2
+
+
+def test_benchmarking_preserves_unique_estimator_names():


Was this also a problem which you solved or are you adding a random test

Fix Benchmarking duplicate estimator labels

c2548c4

Copilot AI review requested due to automatic review settings April 5, 2026 20:17

Copilot started reviewing on behalf of ashum9 April 5, 2026 20:17 View session

Copilot AI reviewed Apr 5, 2026

View reviewed changes

Comment thread pyaptamer/benchmarking/_base.py Outdated

Comment thread pyaptamer/benchmarking/_base.py Outdated

MNT: tighten Benchmarking naming logic

b635024

ashum9 mentioned this pull request Apr 5, 2026

[BUG] Benchmarking collapses multiple estimators of the same class into one result row #297

Open

siddharth7113 marked this pull request as draft April 6, 2026 18:27

siddharth7113 requested changes Apr 7, 2026

View reviewed changes

MNT: use underscore suffix for duplicate estimator labels

45ff5b2

siddharth7113 marked this pull request as ready for review April 12, 2026 18:28

MNT: satisfy ruff pre-commit in Benchmarking

ea7cb09

siddharth7113 requested a review from satvshr April 20, 2026 13:00

satvshr reviewed Apr 20, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[BUG] Fix Benchmarking duplicate estimator label collisions#299

[BUG] Fix Benchmarking duplicate estimator label collisions#299
ashum9 wants to merge 4 commits into
gc-os-ai:mainfrom
ashum9:codex/benchmarking-estimator-labels

ashum9 commented Apr 5, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

siddharth7113 left a comment

Uh oh!

siddharth7113 Apr 7, 2026

Uh oh!

ashum9 commented Apr 10, 2026

Uh oh!

siddharth7113 commented Apr 12, 2026

Uh oh!

siddharth7113 commented Apr 20, 2026

Uh oh!

satvshr Apr 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants


		summary = bench.run()

		assert ("DummyClassifier[1]", "accuracy_score") in summary.index

		assert len(summary) == 2


		def test_benchmarking_preserves_unique_estimator_names():

Uh oh!

Conversation

ashum9 commented Apr 5, 2026

What this changes

Validation

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

siddharth7113 left a comment

Choose a reason for hiding this comment

Uh oh!

siddharth7113 Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

ashum9 commented Apr 10, 2026

Uh oh!

siddharth7113 commented Apr 12, 2026

Uh oh!

siddharth7113 commented Apr 20, 2026

Uh oh!

satvshr Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants