feat(EstimatorReport): Show permutation at different stages of a pipeline #1988

auguste-probabl · 2025-08-28T09:26:03Z

Added support for calculating permutation importance at different stages of a pipeline.
One can choose to compute feature importance either at the start of the pipeline or at the end.

Closes #1398

Supersedes #1888

Co-authored-by: waridrox [email protected]

github-actions · 2025-08-28T09:37:01Z

Coverage Report for skore/

File	Stmts	Miss	Cover	Missing
skore/src/skore
__init__.py	23	0	100%
_config.py	31	0	100%
exceptions.py	4	4	0%	4, 15, 19, 23
skore/src/skore/_sklearn
__init__.py	6	0	100%
_base.py	198	14	92%	45, 58, 127, 130, 183, 186–187, 189–192, 225, 228–229
find_ml_task.py	61	0	100%
types.py	27	1	96%	28
skore/src/skore/_sklearn/_comparison
__init__.py	7	0	100%
feature_importance_accessor.py	35	2	94%	88, 107
metrics_accessor.py	178	3	98%	173, 253, 1215
report.py	107	0	100%
utils.py	54	0	100%
skore/src/skore/_sklearn/_cross_validation
__init__.py	9	0	100%
data_accessor.py	45	3	93%	134, 137, 140
feature_importance_accessor.py	24	0	100%
metrics_accessor.py	182	1	99%	244
report.py	135	1	99%	487
skore/src/skore/_sklearn/_estimator
__init__.py	9	0	100%
data_accessor.py	66	1	98%	82
feature_importance_accessor.py	168	2	98%	251–252
metrics_accessor.py	356	8	97%	200, 202, 209, 300, 369, 373, 388, 423
report.py	165	2	98%	448–449
skore/src/skore/_sklearn/_plot
__init__.py	3	0	100%
base.py	98	6	93%	61–62, 224–226, 230
utils.py	77	0	100%
skore/src/skore/_sklearn/_plot/data
__init__.py	2	0	100%
table_report.py	185	1	99%	706
skore/src/skore/_sklearn/_plot/metrics
__init__.py	6	0	100%
confusion_matrix.py	70	4	94%	92, 100, 122, 230
feature_importance_display.py	67	21	68%	88, 121–122, 124, 142–146, 148–155, 158–160, 162
metrics_summary_display.py	8	0	100%
precision_recall_curve.py	281	5	98%	455, 555, 559, 619, 751
prediction_error.py	227	5	97%	179, 186, 422, 505, 705
roc_curve.py	294	8	97%	387, 510, 515, 616, 621, 625, 694, 834
skore/src/skore/_sklearn/train_test_split
__init__.py	0	0	100%
train_test_split.py	58	0	100%
skore/src/skore/_sklearn/train_test_split/warning
__init__.py	8	0	100%
high_class_imbalance_too_few_examples_warning.py	19	1	94%	83
high_class_imbalance_warning.py	20	0	100%
random_state_unset_warning.py	10	0	100%
shuffle_true_warning.py	9	0	100%
stratify_is_set_warning.py	10	0	100%
time_based_column_warning.py	21	0	100%
train_test_split_warning.py	3	0	100%
skore/src/skore/_utils
__init__.py	6	2	66%	8, 13
_accessor.py	90	3	96%	34, 146, 190
_environment.py	27	1	96%	40
_fixes.py	8	0	100%
_index.py	5	0	100%
_logger.py	22	4	81%	15–17, 19
_measure_time.py	10	0	100%
_parallel.py	38	3	92%	23, 33, 124
_patch.py	13	5	61%	21, 23–24, 35, 37
_progress_bar.py	46	0	100%
_repr_html.py	8	0	100%
_show_versions.py	38	0	100%
_testing.py	55	0	100%
skore/src/skore/project
__init__.py	2	0	100%
project.py	48	0	100%
summary.py	75	1	98%	120
widget.py	187	0	100%
TOTAL	4044	112	97%

Tests	Skipped	Failures	Errors	Time
1101	5 💤	0 ❌	0 🔥	4m 21s ⏱️

github-actions · 2025-08-28T14:20:21Z

Documentation preview @ 2cbc1f1

skore/src/skore/_sklearn/_estimator/feature_importance_accessor.py

glemaitre · 2025-08-30T12:25:11Z

Since we still don't have the permutation importance across the different reporters, there is not documentation to change in the user guide.

skore/src/skore/_sklearn/_estimator/feature_importance_accessor.py

glemaitre

It looks good. I would only suggest to amend the example called plot_feature_importance.py where we can demonstate the feature. We have a Ridge model with some feature importance and we can show that we can compute with index 0 and -1.

skore/src/skore/_sklearn/_estimator/feature_importance_accessor.py

skore/tests/unit/reports/estimator/feature_importance/test_permutation_importance.py

auguste-probabl · 2025-10-15T14:57:52Z

amend the example called plot_feature_importance.py where we can demonstate the feature. We have a Ridge model with some feature importance and we can show that we can compute with index 0 and -1.

~~TODO: I found a bug while doing this, at a certain step the permutation computation fails because it doesn't like sparse matrices.~~ Fixed.

Now I need to figure out how to showcase the new feature in the example, hopefully in a way that doesn't disrupt the flow.

glemaitre

Here it comes ;). Sorry for the delay.

examples/use_cases/plot_feature_importance.py

skore/src/skore/_sklearn/_estimator/feature_importance_accessor.py

skore/tests/unit/reports/estimator/feature_importance/test_permutation_importance.py

skore/src/skore/_sklearn/_estimator/feature_importance_accessor.py

skore/tests/unit/reports/estimator/feature_importance/test_permutation_importance.py

Added support for calculating permutation importance at different stages of a pipeline. One can choose to compute feature importance either at the start of the pipeline or at the end. Closes probabl-ai#1398 Supersedes probabl-ai#1888 Co-authored-by: waridrox <[email protected]>

To avoid copy-pasting all the time.

glemaitre

With those changes, you are going to cover the two missing line and it is a possible pipeline that we should be supporting.

skore/tests/unit/reports/estimator/feature_importance/test_permutation_importance.py

examples/use_cases/plot_feature_importance.py

skore/src/skore/_sklearn/_estimator/feature_importance_accessor.py

auguste-probabl · 2025-10-30T10:47:21Z

Now it's

skore/skore/src/skore/_sklearn/_estimator/feature_importance_accessor.py

Line 632 in 612dee8

feature_names = estimator.feature_names_in_

that is not covered

skore/src/skore/_sklearn/_estimator/feature_importance_accessor.py

glemaitre

Just the last comment. Otherwise, LGTM.

.pre-commit-config.yaml

Following #1988. Synchronize `pyproject.toml` and `.pre-commit-config.yaml` to let `mypy` to work outside `pre-commit`.

github-actions bot assigned auguste-probabl Aug 28, 2025

auguste-probabl mentioned this pull request Aug 28, 2025

feat: Show permutation at different stages of a pipeline #1888

Closed

1 task

auguste-probabl force-pushed the push-szluysnvkxtt branch from d5ccc35 to cffb182 Compare August 28, 2025 14:08

auguste-probabl requested a review from glemaitre August 28, 2025 14:51

glemaitre reviewed Aug 30, 2025

View reviewed changes

skore/src/skore/_sklearn/_estimator/feature_importance_accessor.py Outdated Show resolved Hide resolved

auguste-probabl force-pushed the push-szluysnvkxtt branch from cffb182 to f2381ec Compare September 1, 2025 13:45

auguste-probabl requested a review from glemaitre September 1, 2025 15:18

jeremiedbb reviewed Sep 2, 2025

View reviewed changes

skore/src/skore/_sklearn/_estimator/feature_importance_accessor.py Show resolved Hide resolved

glemaitre reviewed Oct 9, 2025

View reviewed changes

auguste-probabl force-pushed the push-szluysnvkxtt branch from 37485da to 1d13bd8 Compare October 16, 2025 15:46

glemaitre self-requested a review October 17, 2025 09:09

auguste-probabl force-pushed the push-szluysnvkxtt branch from 1d13bd8 to b90cbc4 Compare October 17, 2025 15:55

glemaitre reviewed Oct 24, 2025

View reviewed changes

auguste-probabl and others added 13 commits October 29, 2025 16:05

add docs

22864ae

sphinx

fc9742e

rename stage to at_step

5696a4b

Make at_step 0 or -1

c4e9f63

Remove default argument from private method

e8f928c

To avoid copy-pasting all the time.

Clarify if-block

f594a2e

generalize at_step to a step index

b0d1cae

use n_features_in_

7421b69

remove redundant feature_names_source

da25d06

add constraint on at_step

9eaba4f

refine test descriptions

3fd7573

remove example

25067c9

auguste-probabl added 5 commits October 29, 2025 17:10

remove dead code

75fd69f

refactor: Put data in fixture

da3c847

test dataframes as well

1dbbc94

clean

a0c1f5e

mypy

d852a81

glemaitre self-requested a review October 30, 2025 09:24

glemaitre reviewed Oct 30, 2025

View reviewed changes

auguste-probabl added 4 commits October 30, 2025 11:33

Use non-sklearn regressor

228971f

also test at_step=0

085a41c

use skore train_test_split

8444567

reuse fixtures

612dee8

glemaitre reviewed Oct 30, 2025

View reviewed changes

examples/use_cases/plot_feature_importance.py Outdated Show resolved Hide resolved

skore/src/skore/_sklearn/_estimator/feature_importance_accessor.py Outdated Show resolved Hide resolved

skore/src/skore/_sklearn/_estimator/feature_importance_accessor.py Outdated Show resolved Hide resolved

auguste-probabl added 3 commits October 30, 2025 11:56

wrap

cb9ae1e

change error type

09d575e

refactor to _get_feature_names

ef9fbf7

glemaitre reviewed Oct 30, 2025

View reviewed changes

skore/src/skore/_sklearn/_estimator/feature_importance_accessor.py Outdated Show resolved Hide resolved

glemaitre approved these changes Oct 30, 2025

View reviewed changes

glemaitre previously approved these changes Oct 30, 2025

View reviewed changes

refactor

2cbc1f1

auguste-probabl dismissed glemaitre’s stale review via 2cbc1f1 October 31, 2025 11:01

glemaitre enabled auto-merge October 31, 2025 11:02

glemaitre approved these changes Oct 31, 2025

View reviewed changes

glemaitre added this pull request to the merge queue Oct 31, 2025

Merged via the queue into probabl-ai:main with commit 2cba17f Oct 31, 2025
32 checks passed

thomass-dev reviewed Oct 31, 2025

View reviewed changes

.pre-commit-config.yaml Show resolved Hide resolved

thomass-dev mentioned this pull request Oct 31, 2025

fix(skore): Add missing scipy-stubs to test dependencies #2119

Merged

github-merge-queue bot pushed a commit that referenced this pull request Oct 31, 2025

fix(skore): Add missing scipy-stubs to test dependencies (#2119)

da2bbe9

Following #1988. Synchronize `pyproject.toml` and `.pre-commit-config.yaml` to let `mypy` to work outside `pre-commit`.

auguste-probabl deleted the push-szluysnvkxtt branch November 20, 2025 09:49

feat(EstimatorReport): Show permutation at different stages of a pipeline #1988

feat(EstimatorReport): Show permutation at different stages of a pipeline #1988

Uh oh!

Conversation

auguste-probabl commented Aug 28, 2025

Uh oh!

github-actions bot commented Aug 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Aug 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

glemaitre commented Aug 30, 2025

Uh oh!

Uh oh!

glemaitre left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

auguste-probabl commented Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

glemaitre left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

glemaitre left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

auguste-probabl commented Oct 30, 2025

Uh oh!

Uh oh!

glemaitre left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

github-actions bot commented Aug 28, 2025 •

edited

Loading

github-actions bot commented Aug 28, 2025 •

edited

Loading

auguste-probabl commented Oct 15, 2025 •

edited

Loading