Add grid sampling by pxl-th · Pull Request #361 · FluxML/NNlib.jl

pxl-th · 2021-11-02T11:24:48Z

This PR adds grid sampling for the 4D case.

This is needed for things like self-supervised depth estimation, where a neural network would learn a depth D for the current image and projection transformation P for the adjacent images and does "warping" of the images using D and P.
And this function does image sampling given projected coordinates (a.k.a. grid).

The implementation differs from the PyTorch's version in that it assumes align_corners = True and uses only bilinear interpolation. It was tested to give the same results as PyTorch's version.

PR with the CUDA kernels.

pxl-th · 2021-11-02T12:26:44Z

I've commented out failing test for the depthwise conv for now. Should I bring it back?

CarloLucibello · 2021-11-02T12:33:30Z

They were failing intermittently, mostly in the multi-threading environment. Since we have a few multithreaded methods, better have the multithreaded CI passing, so I'm ok with keeping those problematic tests out. We should open an issue for those depthwiseconv failures if there is no one already.

I'll try to review in the next few days, thanks for this contribution, at a first glance looks perfect already!

ToucheSir · 2021-11-03T16:41:27Z

@CarloLucibello RE CI failures, feel free to rename and re-purpose #359.

src/sampling.jl

CarloLucibello · 2021-11-06T07:03:51Z

src/sampling.jl

+    Where for each `(W_out, H_out, N)` grid contains `(x, y)`
+    coordinates that specify sampling locations normalized by the `input` shape.
+
+    Therefore, it should have values mostly in `[-1, 1]` range.


Suggested change

Therefore, it should have values mostly in `[-1, 1]` range.

Therefore, `x` and `y` should have values mostly in `[-1, 1]` range.

Also, why "mostly?"

Also, why "mostly?"

It was added as a hint that the values can be outside, but probably makes no sense.

src/sampling.jl

Do not require wrapping symbols in Val. Fix typos. Remove `@thunk` on the gradient.

maxfreu · 2021-11-11T21:18:35Z

Hi! Just to leave this here, as I ported the tri/bi/linear upsampling code from pytorch: 1) Setting align_corners = true is not the best choice, as the gradients change with changing input image size. Pytorch's default therefore is false. But that doesn't change any results much. 2) The code you wrote reminds me a lot of what I wrote for upsampling. The kernels probably can't be shared, although they look quite similar. But the utility functions can be shared I think - but that's maybe also for the future.

gRox167 · 2025-01-07T16:20:09Z

Just to confirm, is there any plan to add 5D support for grid_sampling?
This is really helpful for doing 3D image registration, which is widely use in neuro and medical imaging field. Networks in medical image registration utilize this function extensively to wrap the moving image to a fixed image, for example VoxelMorph.

CarloLucibello · 2025-01-07T16:47:38Z

No active plans that I know of, but contribution are welcome. Maybe @maxfreu could provide some guidance.

maxfreu · 2025-01-08T14:01:48Z

Hi @gRox167 ! As imaging science phd student this is the perfect task for you :D In principle it should be quite simple:

Fork NNlib
Come up with simple test cases for the forward and backward pass that you want to pass and write those first.
Add specializations for AbstractArray{T,4} to the current implementation where it is needed.
Copy the code for the 4D case, change the specialization to 5D.
Add the depth dimension where sizes are checked etc.
Change the sampling code in the kernel. You can use the pytorch code as a reference, which you can find here for the GPU and here for the CPU.
Ping me or @pxl-th in case you hit bumps in the road :)
Enjoy how little code remains of the C++ hell.

It should be pretty straight forward and more of a writing than a thinking task. Optionally, you can simply use ChatGPT, show it the current julia and the pytorch code and tell it to expand it. I guess that should also work pretty well nowadays. The key to success in any case are good tests for the forward and backward pass.

gRox167 · 2025-01-09T00:52:31Z

Hi @gRox167 ! As imaging science phd student this is the perfect task for you :D In principle it should be quite simple:
1. Fork NNlib

2. Come up with simple test cases for the forward and backward pass that you want to pass and write those first.

3. Add specializations for `AbstractArray{T,4}` to the current implementation where it is needed.

4. Copy the code for the 4D case, change the specialization to 5D.

5. Add the depth dimension where sizes are checked etc.

6. Change the sampling code in the kernel. You can use the pytorch code as a reference, which you can find [here](https://github.com/pytorch/pytorch/blob/f7000350905be5073892e0b23df681c0281be0f0/aten/src/ATen/native/cuda/GridSampler.cu#L156) for the GPU and [here](https://github.com/pytorch/pytorch/blob/f7000350905be5073892e0b23df681c0281be0f0/aten/src/ATen/native/GridSampler.cpp#L41) for the CPU.

7. Ping me or @pxl-th in case you hit bumps in the road :)

8. Enjoy how little code remains of the C++ hell.
It should be pretty straight forward and more of a writing than a thinking task. Optionally, you can simply use ChatGPT, show it the current julia and the pytorch code and tell it to expand it. I guess that should also work pretty well nowadays. The key to success in any case are good tests for the forward and backward pass.

Thanks for your guidance, I think I can take a try.

gRox167 · 2025-01-20T17:12:02Z

Please refer to PR #627 .

Anton Smirnov added 6 commits October 31, 2021 22:28

Add bilinear grid sampling kernels

690ec02

Add threading

f01aecf

Some optimizations

31b61ee

Add tests, docs

e92a5fb

Merge remote-tracking branch 'origin/master'

6d1d7ab

Add more docs

4d94aca

pxl-th mentioned this pull request Nov 2, 2021

Add CUDA kernels for grid sampling FluxML/NNlibCUDA.jl#31

Merged

Comment out failing test for depthwise conv

8b9f055

Anton Smirnov added 2 commits November 2, 2021 17:14

Add kernel function to reuse it in CUDA kernels

2a6a75f

Add default padding value && minor cleanup

97992a3

pxl-th changed the title ~~Add grid samping~~ Add grid sampling Nov 3, 2021

CarloLucibello reviewed Nov 6, 2021

View reviewed changes

Refactor

014c31e

Do not require wrapping symbols in Val. Fix typos. Remove `@thunk` on the gradient.

pxl-th requested a review from CarloLucibello November 8, 2021 17:20

Do not require to have the same eltype as

a53f129

CarloLucibello merged commit 4459d70 into FluxML:master Nov 11, 2021

	Therefore, it should have values mostly in `[-1, 1]` range.
	Therefore, `x` and `y` should have values mostly in `[-1, 1]` range.

Uh oh!

Conversation

pxl-th commented Nov 2, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pxl-th commented Nov 2, 2021

Uh oh!

CarloLucibello commented Nov 2, 2021

Uh oh!

ToucheSir commented Nov 3, 2021

Uh oh!

Uh oh!

CarloLucibello Nov 6, 2021

Choose a reason for hiding this comment

Uh oh!

pxl-th Nov 8, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

maxfreu commented Nov 11, 2021

Uh oh!

gRox167 commented Jan 7, 2025

Uh oh!

CarloLucibello commented Jan 7, 2025

Uh oh!

maxfreu commented Jan 8, 2025

Uh oh!

gRox167 commented Jan 9, 2025

Uh oh!

gRox167 commented Jan 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

pxl-th commented Nov 2, 2021 •

edited

Loading