First of all, thank you for providing this library.
I want to move a 2D Swin image->image model to neighbourhood attention. So for, I have been using the relative positional embeddings as in the original Swin repo.
Both in issues as well as the documentation of the fused attention, you mention that there will most likely never be an implementation of RBP in the fused kernels, and that there are better alternatives.
... Could you maybe give me some pointers to techniques that work in you experience well with neighborhood attention?
Cheers
Felix