FGDCC: Fine-Grained Deep Cluster Categorization - An Architecture for Intra-Class Variability in Problems in FGVC tasks

FGDCC is a method developed to tackle intra-class variability problems in fine-grained visual categorization (FGVC) tasks. It tries to leverage the original dataset labels to find latent degrees of intra-class variability via class-wise clustering. We then train the model to predict the cluster prototypes obtained through K-Means assignments.

As opposed to many deep clustering pipelines that uses autoencoder bottleneck features to perform clustering we use VICReg regularization to enforce an L2 norm friendly space in which we can naturally perform K-Means clustering with increased dimensionality features. This is a trick in which we sample images that belongs to the same class, run them to the network to extract representations and then compute VICReg between these representations. This ends up constraining the model to learn classification features in whereas the euclidian distance between samples from the same class are preserved.

VICReg regularization was both an insight from the work of Ravid Shwarz that demonstrates that SSL enforces natural clusters with respect to semantical labels, and an empirical validation by observing that applying our method (without any regularization) to ImageNet-based pre-training, the K-Means loss increased over the epochs, therefore denoting that the euclidian distance between the representations were increasing throughout the fine-tuning process.

Accounting for multiple K hypotheses:

Requirements

Python 3.8 (or newer)
PyTorch 2.0
torchvision
Faiss
Other dependencies: pyyaml, numpy, opencv, submitit, timm

Acknowledgement

This repository is built upon the I-JEPA repository.

License

See the LICENSE file for details about the license under which this code is made available.

Other

If you find this repository useful in your research, please consider giving a star ⭐ and a citation

Name		Name	Last commit message	Last commit date
Latest commit History 148 Commits
configs		configs
dinov2		dinov2
logs		logs
plots		plots
src		src
tests		tests
util		util
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
FGDCC.sh		FGDCC.sh
FGDCC_v2.sh		FGDCC_v2.sh
LICENSE		LICENSE
README.md		README.md
engine_FGDCC.py		engine_FGDCC.py
engine_FGDCCv2.py		engine_FGDCCv2.py
engine_finetuning.py		engine_finetuning.py
finetuning.sh		finetuning.sh
main_FGDCC.py		main_FGDCC.py
main_FGDCC_v2.py		main_FGDCC_v2.py
main_finetuning.py		main_finetuning.py
main_test.py		main_test.py
new_engine_FGDCC.py		new_engine_FGDCC.py
runscript.sh		runscript.sh
test.py		test.py
test.sh		test.sh
test_FGDCC.py		test_FGDCC.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

FGDCC: Fine-Grained Deep Cluster Categorization - An Architecture for Intra-Class Variability in Problems in FGVC tasks

Accounting for multiple K hypotheses:

Requirements

Acknowledgement

License

Other

About

Uh oh!

Releases

Packages

Languages

License

FalsoMoralista/FGDCC

Folders and files

Latest commit

History

Repository files navigation

FGDCC: Fine-Grained Deep Cluster Categorization - An Architecture for Intra-Class Variability in Problems in FGVC tasks

Accounting for multiple K hypotheses:

Requirements

Acknowledgement

License

Other

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages