More aggressive conv shape functions, docs, maybe_/expect_ call patterns. #3709

crutcher · 2025-09-12T18:46:13Z

Changes

Flush out conv shape calculations; with maybe_ and expect_ variants in 1 and N-D; include both [usize; D] and &[usize] variants.

crutcher · 2025-09-12T18:46:27Z

@laggui

codecov · 2025-09-12T19:36:53Z

Codecov Report

❌ Patch coverage is 53.64964% with 127 lines in your changes missing coverage. Please review.
✅ Project coverage is 64.35%. Comparing base (f007a31) to head (4984a39).

Files with missing lines	Patch %	Lines
crates/burn-tensor/src/tensor/ops/modules/conv.rs	65.94%	63 Missing ⚠️
crates/burn-fusion/src/ops/module.rs	0.00%	20 Missing ⚠️
crates/burn-router/src/ops/op_module.rs	0.00%	18 Missing ⚠️
crates/burn-cubecl/src/kernel/conv/im2col.rs	0.00%	12 Missing ⚠️
...rates/burn-cubecl/src/kernel/conv/deform_conv2d.rs	0.00%	6 Missing ⚠️
crates/burn-cubecl/src/kernel/conv/direct.rs	0.00%	6 Missing ⚠️
...urn-cubecl/src/kernel/conv/implicit_gemm/launch.rs	0.00%	2 Missing ⚠️

❌ Your patch check has failed because the patch coverage (53.64%) is below the target coverage (80.00%). You can increase the patch coverage or adjust the target coverage.
❌ Your project check has failed because the head coverage (64.35%) is below the target coverage (80.00%). You can increase the head coverage or adjust the target coverage.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #3709      +/-   ##
==========================================
+ Coverage   64.33%   64.35%   +0.01%     
==========================================
  Files        1156     1156              
  Lines      134463   134577     +114     
==========================================
+ Hits        86508    86608     +100     
- Misses      47955    47969      +14

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

laggui

Checking for invalid configurations is a good idea.

Regarding the API changes, I think a better approach would be to implement output_shape(input_shape, weight_shape, options) to get the full shape, e.g.

/// Calculate the expected output shape `[batch_size, channels_out, spatial_dims, ..]` for a convolution.
pub fn calculate_conv_output_shape<const N: usize>(
    in_shape: &Shape,
    weight_shape: &Shape,
    options: &ConvOptions<N>,
) -> Shape {
    assert_eq!(weight_shape.num_dims(), N + 2);
    assert_eq!(in_shape.num_dims(), N + 2);

    let kernel_size = &weight_shape.dims[2..];

    let mut out_shape = in_shape.clone();
    // Spatial dims
    for (i, size_i) in out_shape.dims[2..].iter_mut().enumerate() {
        *size_i = calculate_conv_output_size(
            kernel_size[i],
            options.stride[i],
            options.padding[i],
            options.dilation[i],
            *size_i,
        );
    }
    // Output channels
    out_shape.dims[1] = weight_shape.dims[0];

    out_shape
}

Eventually, we could make the output shape explicit to the backend impl, e.g:

pub trait ModuleOps<B: Backend> {
  fn conv2d(
      x: FloatTensor<B>,
      weight: FloatTensor<B>,
      bias: Option<FloatTensor<B>>,
      options: ConvOptions<2>,
      output_shape: Shape,
  ) -> FloatTensor<B>;
}

That way, there is a single source of truth for the output shape. We can easily compute the complete output shape before calling the backend op. But that will impact some other APIs atm.

crutcher · 2025-09-15T20:35:55Z

@laggui I'm down to provide an api for D2 + conv; but I don't want to lose the raw computation.

laggui · 2025-09-16T17:17:54Z

@laggui I'm down to provide an api for D2 + conv; but I don't want to lose the raw computation.

Not sure I follow, what do you mean by raw computation?

antimora · 2025-10-01T00:29:39Z

@crutcher, might have missed a notification. Still valid?

crutcher · 2025-10-08T22:28:29Z

@laggui, @antimora I did miss the update.

raw computation; I want to be able to explicitly re-use the exact same shape calculations in library code that the conv family uses; so that we can tie contracts which depend upon those shapes to conv without those libraries being forced to try and re-implement that logic.

laggui · 2025-10-09T12:54:19Z

raw computation; I want to be able to explicitly re-use the exact same shape calculations in library code that the conv family uses; so that we can tie contracts which depend upon those shapes to conv without those libraries being forced to try and re-implement that logic.

That makes sense! We'd keep the calculate_conv_output_shape public anyway.

@crutcher see also my PR that introduces more Shape manipulations / calculations #3845. I will follow this up with some changes for the conv output shape similar to my previous comment (without the explicit shape as input).

crutcher marked this pull request as ready for review September 12, 2025 20:59

crutcher changed the title ~~[WIP] Work on conv shape funcs~~ Work on conv shape funcs Sep 12, 2025

crutcher changed the title ~~Work on conv shape funcs~~ More aggressive conv shape functions, docs, maybe_/expect_ call patterns. Sep 12, 2025

nathanielsimard requested a review from laggui September 15, 2025 13:25

laggui reviewed Sep 15, 2025

View reviewed changes

crutcher force-pushed the crutcher/conv_shapes branch from 75c32ba to a01c97f Compare September 15, 2025 21:02

crutcher requested a review from laggui September 15, 2025 21:03

crutcher force-pushed the crutcher/conv_shapes branch from a01c97f to 63a8968 Compare October 8, 2025 22:29

crutcher added 8 commits October 8, 2025 15:31

Work on conv shape funcs

fde38aa

import bug

83c6eb1

cleanup caller refactors

318612a

Usage and link cleanups.

7180291

revert removal of CubeOptionExpand.

c860ab5

Add unit test for conv1d output shape validation

26178e4

Fix vec setup.

df9c0f1

Add support for conv2d output shape calculations and validation

4984a39

crutcher force-pushed the crutcher/conv_shapes branch from 63a8968 to 4984a39 Compare October 8, 2025 22:31

crutcher marked this pull request as draft October 15, 2025 17:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

More aggressive conv shape functions, docs, maybe_/expect_ call patterns. #3709

More aggressive conv shape functions, docs, maybe_/expect_ call patterns. #3709

Uh oh!

crutcher commented Sep 12, 2025 •

edited

Loading

Uh oh!

crutcher commented Sep 12, 2025

Uh oh!

codecov bot commented Sep 12, 2025 •

edited

Loading

Uh oh!

laggui left a comment •

edited

Loading

Uh oh!

crutcher commented Sep 15, 2025

Uh oh!

laggui commented Sep 16, 2025

Uh oh!

antimora commented Oct 1, 2025

Uh oh!

crutcher commented Oct 8, 2025

Uh oh!

laggui commented Oct 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

More aggressive conv shape functions, docs, maybe_/expect_ call patterns. #3709

Are you sure you want to change the base?

More aggressive conv shape functions, docs, maybe_/expect_ call patterns. #3709

Uh oh!

Conversation

crutcher commented Sep 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Uh oh!

crutcher commented Sep 12, 2025

Uh oh!

codecov bot commented Sep 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

laggui left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

crutcher commented Sep 15, 2025

Uh oh!

laggui commented Sep 16, 2025

Uh oh!

antimora commented Oct 1, 2025

Uh oh!

crutcher commented Oct 8, 2025

Uh oh!

laggui commented Oct 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

crutcher commented Sep 12, 2025 •

edited

Loading

codecov bot commented Sep 12, 2025 •

edited

Loading

laggui left a comment •

edited

Loading