refactoring noise_schedule and time schedule into base class by janfb · Pull Request #1736 · sbi-dev/sbi

janfb · 2026-01-22T09:31:32Z

This is the conflict-free version of #1481 by @psteinb. I merged main in, resolved the conflicts and moved onto this new branch to enable others to continue this work.

@psteinb I hope this is fine for you - all your commits are still here and attributed to you.

created noise_schedule method to be overwritten by derivatives
created times_schedules method to be overwritten by derivatives
created test on times_schedule
improvded docstrings
created test on noise_schedule

This addresses #1437

- created noise_schedule method to be overwritten by derivatives - created times_schedules method to be overwritten by derivatives - created test on times_schedules - improvded docstrings

- added tests too - inspired by https://arxiv.org/abs/2206.00364

in addition: - added improved version to benchmarks (for later comparison) - created new class ImprovedVPScoreEstimator

to understand how the VE estimator is implemented

- without touching the forward function of ConditionalScoreEstimator - benchmarks show that this leads to very long training time without any performance improvements

…e_schedules-1437 Resolved the merge conflicts by aligning the score estimator with the vector-field API changes from main and accepting the deletions of legacy score/NPSE paths. Details: - Cleaned and reconciled ConditionalScoreEstimator imports/init and typing in sbi/neural_nets/estimators/score_estimator.py, keeping beta_min/beta_max and device tracking consistent with the new base. - Dropped deleted legacy files to match main: sbi/inference/trainers/npse/npse.py, sbi/neural_nets/net_builders/score_nets.py, and tests/score_estimator_test.py.

janfb · 2026-01-22T09:41:32Z

And here's a quick review by opencode using Codex 5.2 :

Summary

The PR centralizes time/noise schedule logic in ConditionalScoreEstimator and routes
VP/SubVP drift/diffusion through the shared schedule. This aligns with the requested
feature. However, there are two correctness issues and one scope mismatch with the
issue requirements.

Blocking issues

times_schedule returns torch.Tensor(sorted(times)), which recreates the tensor
on CPU and drops device/dtype. This will silently move schedules off-device.
- Fix: use torch.sort(times).values and return the tensor as-is.
times_schedule relies on self.device, but self.device is not updated when the
module moves devices via .to(). Schedules can be created on the wrong device.
- Fix: use a buffer/device source such as self._mean_base.device or
  next(self.parameters()).device when constructing times.

Non-blocking but important

The loss docstring says it uses times_schedule when times is None, but the
implementation still samples directly with torch.rand. Either update the code to
call self.times_schedule(...) or update the docstring. The issue explicitly asked
for using the default training schedule, so the implementation should likely use the
new method.

Tests

The issue requests tests for both schedules, but this PR only changes
score_estimator.py. Please add tests for:

times_schedule: correct shape, monotonicity, and device
noise_schedule: correct shape and range [beta_min, beta_max]

codecov · 2026-01-22T09:43:05Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 88.39%. Comparing base (649f5d3) to head (ecb295c).
⚠️ Report is 6 commits behind head on main.
✅ All tests successful. No failed tests found.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1736      +/-   ##
==========================================
- Coverage   88.51%   88.39%   -0.13%     
==========================================
  Files         137      137              
  Lines       11527    12188     +661     
==========================================
+ Hits        10203    10773     +570     
- Misses       1324     1415      +91

Flag	Coverage Δ
fast	`84.73% <100.00%> (?)`

Flags with carried forward coverage won't be shown. Click here to find out more.

Files with missing lines	Coverage Δ
sbi/inference/posteriors/vector_field_posterior.py	`77.47% <ø> (+0.29%)`	⬆️
sbi/inference/trainers/vfpe/base_vf_inference.py	`92.81% <100.00%> (ø)`
sbi/neural_nets/estimators/base.py	`75.19% <100.00%> (+1.42%)`	⬆️
sbi/neural_nets/estimators/score_estimator.py	`93.96% <100.00%> (+0.42%)`	⬆️

... and 9 files with indirect coverage changes

janfb · 2026-01-22T13:35:19Z

when comparing to the plan in the original issue #1437 , I noticed that there are still a couple of open todos, for the general API ideas and for the schedule from EDM paper in particular. Accordingly, here is a potential plan for finishing this PR:

Add schedule APIs to ConditionalScoreEstimator
- Add train_schedule(num_samples, t_min=None, t_max=None) that returns diffusion
  times used for training. Default behavior should be a simple uniform schedule so
  current results remain unchanged unless overridden by subclasses.
- Add solve_schedule(num_steps, t_min=None, t_max=None) that returns a deterministic
  monotonic time grid (e.g., evenly spaced). This is the default time discretization
  for evaluation/sampling steps.
- Keep noise_schedule(times) as the mapping from time to noise magnitude (beta/sigma)
  so schedules stay decoupled from the SDE implementation.
Use schedule APIs in the loss
- Update loss(..., times=None) to call self.train_schedule(...) when times is
  not provided, so the new schedule is actually used in training.
- Ensure schedules are created on the estimator’s current device (e.g., via a buffer
  like self._mean_base.device) to avoid CPU/GPU mismatches.
Wire solve schedule into vector-field trainer validation
- In sbi/inference/trainers/vfpe/base_vf_inference.py, when validation_times is an
  integer, replace the uniform torch.linspace(...) with
  self._neural_net.solve_schedule(num_steps).
- Preserve existing behavior when validation_times is already a tensor so users can
  pass custom schedules explicitly.
Add tests
- train_schedule: correct shape, correct bounds (t_min/t_max), device, and that
  the default is uniform.
- solve_schedule: monotonic increasing, includes endpoints, correct device, and
  correct length.
- Regression: verify loss uses train_schedule when times is None.
Optional EDM schedule for VE (paper details)
- Add optional VE-specific parameters (e.g., edm_sigma_min, edm_sigma_max,
  edm_rho, edm_p_mean, edm_p_std) and a switch such as schedule="edm" to opt in.
- Training noise distribution: EDM samples noise levels from a log-normal
  distribution (Table 1):
  - ln(σ) ~ N(P_mean, P_std^2).
  - Example defaults used in the paper: P_mean = -1.2, P_std = 1.2 (CIFAR-10).
  - In code: sample σ = exp(P_mean + P_std * N(0,1)), then clamp to
    [σ_min, σ_max] if needed.
- Solve schedule (deterministic grid): EDM uses a power-law schedule for
  discretizing σ (Eq. 5 in the paper):
  - Define σ_i for i = 0..N-1 as
    σ_i = (σ_max^(1/ρ) + i/(N-1) * (σ_min^(1/ρ) - σ_max^(1/ρ)))^ρ.
  - Set σ_N = 0 for the final step.
  - Larger ρ concentrates steps near low noise; the paper uses ρ = 7.
- Mapping to time: with the EDM choice σ(t) = t and s(t) = 1, time and
  sigma are interchangeable, so t_i = σ_i.
- Keep uniform schedules as the default so existing behavior is stable.

…o unify training in vftrainer class

janfb · 2026-01-30T07:56:32Z

Hi @touronc

Thank you for the updates here - looks very good. I tested this locally and found some issues and suggest fixes below. I just saw that you pushed some fixes already recently, so some of the comments below will be obsolete.

1. `solve_schedule` must be deterministic

The implementation uses torch.rand() but the docstring says "deterministic monotonic time grid". Even with sorting, it's random on each call, which breaks ODE/SDE integration reproducibility. Use torch.linspace() instead:

def solve_schedule(self, num_steps, t_min=None, t_max=None):
    t_min = self.t_min if t_min is None else t_min
    t_max = self.t_max if t_max is None else t_max
    return torch.linspace(t_max, t_min, num_steps, device=self._mean_base.device)

2. Restore `validation_times_nugget`

The trainer change lost the nugget offset that avoids boundary instability at t=0 and t=t_max.

base_vf_inference.py: Pass nugget to solve_schedule():

if isinstance(validation_times, int):
    validation_times = self._neural_net.solve_schedule(
        validation_times,
        t_min=self._neural_net.t_min + validation_times_nugget,
        t_max=self._neural_net.t_max - validation_times_nugget,
    )

This in turn requires the base class ConditionalVectorFieldEstimator.solve_schedule() to accept t_min/t_max parameters:

def solve_schedule(
    self,
    steps: int,
    t_min: Optional[float] = None,
    t_max: Optional[float] = None,
) -> Tensor:
    t_min = self.t_min if t_min is None else t_min
    t_max = self.t_max if t_max is None else t_max
    return torch.linspace(t_max, t_min, steps, device=self._mean_base.device)

4. Device handling in `loss()`

Ensure times tensor is on the correct device after calling train_schedule():

if times is None:
    times = self.train_schedule(input.shape[0])
times = times.to(input.device)

5. Remove `self.device` tracking

The manual device tracking in __init__ and loss() is fragile (doesn't follow .to() calls) and never actually read. Remove these lines:

# In __init__:
self.device = net.device if hasattr(net, "device") else torch.device("cpu")

# In loss():
self.device = input.device if self.device != input.device else self.device

The schedule methods already use self._mean_base.device which is a proper buffer.

6. `train_schedule` should use plain uniform (no sorting (?))

The current implementation sorts times and forces endpoints, or is there a specific reason for the sorting?:

times[0, ...] = t_min
times[-1, ...] = t_max
return torch.sort(times).values

This differs from main branch behavior and I believe uniform random would be fine here (or am I missing sth?):

return (
    torch.rand(num_samples, device=self._mean_base.device) * (t_max - t_min)
    + t_min
)

7. VE test needs slightly more simulations

Locally, I noticed that with these changes, the VE test fails with 2500 simulations, probably because the overall random state changed. Increasing the simulation budget slightly fixes it for me:

# In tests/linearGaussian_vector_field_test.py
num_simulations = 2600 if vector_field_type == "ve" else 2500

8. Minor docstring fixes

train_schedule: Returns says "range [0,1]" but should be "[t_min, t_max]"
noise_schedule: References self.times_schedule which doesn't exist
vector_field_posterior.py (~line 309): ts parameter has broken indentation

Otherwise, this looks very good!

Regarding the EDM paper related changes, I suggest we move this into a follow-up PR.

Thanks @touronc

janfb

Thanks @touronc for implementing this! 🙏 very well done!

I added a couple of comments, mostly formatting.

sbi/inference/posteriors/vector_field_posterior.py

sbi/neural_nets/estimators/score_estimator.py

sbi/neural_nets/estimators/base.py

sbi/neural_nets/estimators/score_estimator.py

tests/vf_estimator_test.py

janfb

All fixes addressed, thank you @touronc ! 🙏

I approve this PR (cannot officially approve the PR because I created it).

manuelgloeckler

Great, love it. Thanks for finishing this up.

I think e.g. for VE variants it would be beneficial to do some EDM like train and solve schedule, but this is another project which would need some benchmarking. So will also approve this to get it merged.

psteinb and others added 20 commits March 20, 2025 18:38

refactoring noise_schedule and time schedule into base class

abc6742

- created noise_schedule method to be overwritten by derivatives - created times_schedules method to be overwritten by derivatives - created test on times_schedules - improvded docstrings

added noise schedule test

b7d3be3

implemented beta schedule for variance-preserving estimators

dca1939

- added tests too - inspired by https://arxiv.org/abs/2206.00364

code cosmetics triggered by ruff

6ec5793

cloned VPScoreEstimator to yield improved version

3e963a9

in addition: - added improved version to benchmarks (for later comparison) - created new class ImprovedVPScoreEstimator

more realistic bounds for unit test

4b62510

typo and refactoring

274212e

to understand how the VE estimator is implemented

fixed wrong setup of pmean and pstd

74065ae

code reformatting

d61e198

use the time schedule for computing the validation scores

d2c0100

propagate name change

5b50673

fix unit tests to respect new schedules

2081ae0

comply with formatting

4f58be6

attempted to implement EDM-like diffusion

773a28b

- without touching the forward function of ConditionalScoreEstimator - benchmarks show that this leads to very long training time without any performance improvements

removed "improved" denoising network

6386fc8

consolidated tests

2bbd92f

removed occurrances of vp++

8f0c65a

removed all mentions of edm

833c17b

ruff fixes

ec81ec0

WIP : use time schedule in loss function, address device issues

bfb0f3a

janfb mentioned this pull request Jan 23, 2026

Add explicit training/evaluation noise schedules #1437

Open

5 tasks

Camille Touron added 3 commits January 26, 2026 10:03

call solve_schedule in validation step

55ad6b3

add solve_schedule method, call train_schedule in loss

5a60729

call the solve schedule during sampling with SDE

6beb9ac

touronc self-assigned this Jan 26, 2026

add a solve_schedule function in the conditional vf estimator class t…

beb187e

…o unify training in vftrainer class

Camille Touron added 3 commits January 29, 2026 18:33

make the solve schedule deterministic

e750c67

corrections on solve schedule

ece8ede

WIP : create solve schedule in base class

a7078c9

Camille Touron added 9 commits January 30, 2026 10:13

modify arguments of solve schedule

aaff762

change train_schedule + docstrings fixes + device handling

89d0c8d

include validation times nugget to avoid instabilities during training

70f4b21

change the nb of simulations for ve option

7383d9c

change device in solve schedule

f1ef67e

add noise schedule in VE subclass

32eec91

reshape noise schedule output in VE class

1a032b7

reshape noise schedule output in VE class

b851fd7

add tests on train and solve schedule shapes, devices, bounds

f17126d

janfb commented Feb 2, 2026

View reviewed changes

This was referenced Feb 2, 2026

refactoring noise_schedule and time schedule into base class #1481

Closed

Adding interface to azula samplers #1468

Open

formatting and changing tests

ecb295c

janfb commented Feb 4, 2026

View reviewed changes

manuelgloeckler approved these changes Feb 4, 2026

View reviewed changes

manuelgloeckler merged commit 1888205 into main Feb 4, 2026
9 checks passed

janfb mentioned this pull request Feb 5, 2026

Add EDM-style noise schedules for VEScoreEstimator #1754

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactoring noise_schedule and time schedule into base class#1736

refactoring noise_schedule and time schedule into base class#1736
manuelgloeckler merged 38 commits intomainfrom
psteinb-explicit_noise_schedules-1437

janfb commented Jan 22, 2026 •

edited

Loading

Uh oh!

janfb commented Jan 22, 2026

Uh oh!

codecov bot commented Jan 22, 2026 •

edited

Loading

Uh oh!

janfb commented Jan 22, 2026 •

edited

Loading

Uh oh!

janfb commented Jan 30, 2026

Uh oh!

janfb left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

janfb left a comment

Uh oh!

manuelgloeckler left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

janfb commented Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

janfb commented Jan 22, 2026

Summary

Blocking issues

Non-blocking but important

Tests

Uh oh!

codecov bot commented Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

janfb commented Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

janfb commented Jan 30, 2026

1. solve_schedule must be deterministic

2. Restore validation_times_nugget

4. Device handling in loss()

5. Remove self.device tracking

6. train_schedule should use plain uniform (no sorting (?))

7. VE test needs slightly more simulations

8. Minor docstring fixes

Uh oh!

janfb left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

janfb left a comment

Choose a reason for hiding this comment

Uh oh!

manuelgloeckler left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

janfb commented Jan 22, 2026 •

edited

Loading

codecov bot commented Jan 22, 2026 •

edited

Loading

janfb commented Jan 22, 2026 •

edited

Loading

1. `solve_schedule` must be deterministic

2. Restore `validation_times_nugget`

4. Device handling in `loss()`

5. Remove `self.device` tracking

6. `train_schedule` should use plain uniform (no sorting (?))