GitHub - aniketDash7/SimCLR: Implementing SimCLR for Remote Sensing Agricultural Scene Classification using euroSAT dataset and Resnet Backbone.

SimCLR + ResNet18: Self-Supervised Learning and Downstream Classification

Overview

This notebook demonstrates a self-supervised learning pipeline using the SimCLR framework and a ResNet-18 backbone. The aim is to pretrain a model on unlabeled data using contrastive learning and then transfer the learned representation to a supervised downstream classification task.

Steps to Run the Notebook

Import necessary libraries:
- PyTorch and torchvision for model and data utilities
- Hugging Face datasets to load and process image datasets
- SimCLR implementation (defined in a class within the notebook)
Set up GPU and load dataset:
- Automatically detects GPU/CPU (device = torch.device('cuda' if torch.cuda.is_available() else 'cpu'))
- Loads image dataset using Hugging Face's load_dataset() and applies SimCLR-style augmentations
Define SimCLR model:
- Constructs a custom SimCLR model with a ResNet-18 backbone
- Projection head includes a two-layer MLP for contrastive representation
Train SimCLR (optional):
- Includes training loop for pretraining using contrastive loss (NT-Xent)
- In this version, training is skipped and a pretrained model is loaded instead

Load Pretrained SimCLR Model:

backbone = models.resnet18(weights=ResNet18_Weights.IMAGENET1K_V1)
model = SimCLR(backbone=backbone, tau=0.1).to(device)
model.load_state_dict(torch.load(PATH, weights_only=True))

Extract Backbone and Fine-tune on Downstream Task:
- Backbone’s final fc layer is replaced with nn.Identity()
- A custom ClassificationModel is defined to use the frozen backbone and a new classification head
Train and Evaluate Classification Model:
- Uses cross-entropy loss and Adam optimizer
- Evaluates using accuracy on a validation set

Key Code Blocks & Explanations

SimCLR Model

class SimCLR(nn.Module):
    def __init__(self, backbone, projection_dim=256, tau=0.1):
        ...

Combines ResNet-18 feature extractor with an MLP projection head
Uses cosine similarity and temperature-scaled NT-Xent loss

Contrastive Loss

def contrastive_loss(z_i, z_j, temperature):
    ...

Pairs of augmented views of the same image are encouraged to be close
Other pairs are pushed apart in representation space

Downstream Classification

class ClassificationModel(nn.Module):
    ...

Wraps the ResNet backbone with a classification head
Handles the case where fc is replaced by nn.Identity()

Challenges and Solutions

1. Model Loading: Accessing `fc.in_features` after SimCLR training

Problem: The final fc layer in ResNet was replaced by nn.Identity() during SimCLR, causing an error when trying to access fc.in_features.
Solution: Used a dummy input to the backbone to infer the output feature size dynamically for the classifier layer.

2. Projection head interference in downstream

Problem: Needed to ensure the classifier didn’t accidentally include the projection head from SimCLR.
Solution: Separated and extracted the encoder-only part (backbone) from the SimCLR model.

📈 Results

The fine-tuned classification model successfully used the pretrained encoder to achieve reasonable accuracy on the downstream task.
The modular design allows for experimenting with different datasets, backbones, and projection heads.

Conclusion & Takeaways

Self-supervised contrastive learning like SimCLR enables strong feature extractors without requiring labels.
Downstream classification is efficient and effective when pretrained representations are properly leveraged.
It's crucial to manage model internals like fc layers and projection heads to avoid errors during fine-tuning.
SimCLR works best with strong augmentations and a well-structured projection head.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
LICENSE		LICENSE
README.md		README.md
simclr-resnet18 (1).ipynb		simclr-resnet18 (1).ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SimCLR + ResNet18: Self-Supervised Learning and Downstream Classification

Overview

Steps to Run the Notebook

Key Code Blocks & Explanations

SimCLR Model

Contrastive Loss

Downstream Classification

Challenges and Solutions

1. Model Loading: Accessing `fc.in_features` after SimCLR training

2. Projection head interference in downstream

📈 Results

Conclusion & Takeaways

About

Uh oh!

Releases

Packages

Languages

License

aniketDash7/SimCLR

Folders and files

Latest commit

History

Repository files navigation

SimCLR + ResNet18: Self-Supervised Learning and Downstream Classification

Overview

Steps to Run the Notebook

Key Code Blocks & Explanations

SimCLR Model

Contrastive Loss

Downstream Classification

Challenges and Solutions

1. Model Loading: Accessing fc.in_features after SimCLR training

2. Projection head interference in downstream

📈 Results

Conclusion & Takeaways

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

1. Model Loading: Accessing `fc.in_features` after SimCLR training

Packages