GitHub - puskal-khadka/coswin: Official implementation of "CoSwin: Convolution Enhanced Hierarchical Shifted Window Attention For Small-Scale Vision" (PyTorch version)

CoSwin: Convolution Enhanced Hierarchical Shifted Window Attention For Small-Scale Vision

Pytorch implementation of the paper CoSwin: Convolution Enhanced Hierarchical Shifted Window Attention For Small-Scale Vision

Note

We have released the source code and model weights.

Installation

Clone the CoSwin github repository:

git clone https://github.com/puskal-khadka/coswin
cd coswin

Install required dependencies
```
pip install -r requirements.txt
```
To train the CoSwin model on different benchmark datasets:
```
python train.py --model coswin --dataset <dataset-name>
```
<dataset-name> is the name of dataset such as TINY-IMAGENET
To perform Grad-CAM visualization:

Make sure to place the images you want to visualize in a separate directory.
```
python gradcam.py --model coswin --dataset TINY-IMAGENET --model_path /path/to/your/model  
```

Model Weights

Dataset	Training Size	Resolution	Accuracy@1	Model Weight
CIFAR-10	50,000	32x32	96.63	ckpt
CIFAR-100	50,000	32x32	81.64	ckpt
MNIST	60,000	28x28	99.60	ckpt
SVHN	73,257	32x32	98.07	ckpt
TINY-IMAGENET	100,000	64x64	65.06	ckpt

Results

CoSwin demonstrates strong performance on small dataset challenges compared to existing state-of-the-art models.

Citation

@article{khadka2025coswin,
  title={CoSwin: Convolution Enhanced Hierarchical Shifted Window Attention For Small-Scale Vision},
  author={Puskal Khadka and Rodrigue Rizk and Longwei Wang and KC Santosh},
  journal={arXiv preprint arXiv:2509.08959},
  year={2025}
}

Acknowledgment

This repo is based on Timm, Swin Transformer. Thanks for their amazing work.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
assets		assets
model		model
util		util
README.md		README.md
gradcam.py		gradcam.py
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

CoSwin: Convolution Enhanced Hierarchical Shifted Window Attention For Small-Scale Vision

Note

Installation

Model Weights

Results

Citation

Acknowledgment

About

Uh oh!

Releases

Packages

Languages

Uh oh!

Uh oh!

puskal-khadka/coswin

Folders and files

Latest commit

History

Repository files navigation

CoSwin: Convolution Enhanced Hierarchical Shifted Window Attention For Small-Scale Vision

Note

Installation

Model Weights

Results

Citation

Acknowledgment

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages