Prevent zero-variance instability in BaseProbaRegressor.predict_proba by kindler-king · Pull Request #956 · sktime/skpro

kindler-king · 2026-03-16T20:31:46Z

Reference Issues/PRs
Fixes #955

What does this implement/fix?

This PR fixes a numerical instability in BaseProbaRegressor.predict_proba.

When predict_var returns 0, the fallback Normal distribution is constructed with sigma=0, which leads to divide-by-zero warnings and NaN values when evaluating pdf or log_pdf.

To prevent this, the predicted variance is clipped to machine epsilon before computing the standard deviation:

pred_var = np.clip(pred_var, np.finfo(float).eps, None)
This ensures the resulting Normal distribution always has a strictly positive scale while leaving normal model outputs effectively unchanged.

Does your contribution introduce a new dependency?
No.

What should a reviewer concentrate their feedback on?

Whether clipping variance at machine epsilon is the appropriate safeguard.
Consistency with the existing probabilistic regression design.

Did you add any tests for the change?
Yes.

A regression test was added that uses a mock regressor returning zero variance and verifies that:
predict_proba().pdf() and log_pdf() remain finite
no numerical warnings are raised

…nverter_store) In BaseProbaRegressor._check_C, the censoring indicator C was being converted using self._y_converter_store instead of the dedicated self._C_converter_store. This could silently corrupt inverse-transform state when y and C have different mtypes (e.g. pd.DataFrame vs ndarray), causing both to share the same converter dictionary. Also fixes the stale copy-paste comment that said 'convert y to y_inner_mtype' inside _check_C. Fixes: sktime#749

…_proba

fkiraly

I would say this is a hack. Instead of clipping it, I would instead return a Delta distribution if the variance is below machine epsilon (possibly times a factor).

Also, code formatting tests are failing. Please look at the dev guide, and pre-commit.

kindler-king and others added 5 commits February 23, 2026 17:40

Merge branch 'sktime:main' into main

9e7a372

Merge branch 'main' of https://github.com/sktime/skpro

08f1cf1

Merge branch 'main' of https://github.com/sktime/skpro

24619d8

[BUG] Prevent zero-variance instability in BaseProbaRegressor.predict…

8fc8178

…_proba

kindler-king requested review from felipeangelimvieira and fkiraly as code owners March 16, 2026 20:31

fkiraly added bug module:probability&simulation probability distributions and simulators module:regression probabilistic regression module and removed module:probability&simulation probability distributions and simulators labels Mar 21, 2026

fkiraly requested changes Mar 21, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prevent zero-variance instability in BaseProbaRegressor.predict_proba#956

Prevent zero-variance instability in BaseProbaRegressor.predict_proba#956
kindler-king wants to merge 5 commits intosktime:mainfrom
kindler-king:bugfix-zero-variance-predict-proba

kindler-king commented Mar 16, 2026

Uh oh!

fkiraly left a comment •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

kindler-king commented Mar 16, 2026

Uh oh!

fkiraly left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fkiraly left a comment •

edited

Loading