Open
Description
I couldn't think of a better place to put this, but it would be nice to have a list of papers/blog posts/repos to keep track of potential optimization opportunities. To that end, here is a first contribution:
- Lu G, Zhang W, Wang Z. Optimizing Depthwise Separable Convolution Operations on GPUs. IEEE Transactions on Parallel and Distributed Systems. 2021 May 28. (link here)