CIFAR-10 Vision Quest: Exploring Image Classification with Deep Learning

Overview

This project focuses on classifying images from the CIFAR-10 dataset using a variety of machine learning models. The models explored include:

Logistic regression
Decision trees
Convolutional neural networks (CNNs)
Transformers

Training and Evaluation

The models were trained and evaluated to compare their performance using standard metrics such as accuracy. This approach allows for a comprehensive comparison of different machine learning techniques in the context of image classification.

Key Findings

Our results demonstrate the superior effectiveness of deep learning models in handling complex image classification tasks. Specifically:

CNNs showed exceptional performance
Transformers also exhibited high effectiveness

These findings highlight the power of advanced neural network architectures in processing and classifying visual data from the CIFAR-10 dataset.

Project Structure

root
├── README.md
├── project_report.pdf
├── EDA_ML_demo.ipynb
├── ResNet
    ├── Resnet18_train.ipynb
    ├── demo.ipynb
    ├── cifar10-resnet18.ckpt
├── CCT
    ├── code            
    │   ├── Configure.py          
    │   ├── DataLoader.py         
    │   ├── ImageUtils.py 
    |   ├── loss.py
    |   ├── main.py 
    |   ├── Model.py
    |   ├── Network.py
    |   ├── Readme.md
    |   ├── requirements.txt           
    └── saved_models
    |   ├── checkpoint.pth
├── multi_modal
    ├── demo.ipynb

Running Exploratory Data Analysis and Machine Learning models

cd root/ Open the python notebook EDA_ML_demo.ipynb to run the file

Running the ResNet model

cd root/ResNet/

--- Training the model ---

Open Resnet18_train.ipynb

--- Testing the model ---

Open demo.ipynb

Notes

Ensure that the cifar10-resnet18.ckpt is present while evaluating the model.

Running the Compact Convolutional Transformer model

cd root/CCT/code/

--- Training the model ---

python main.py "train" ./data

--- Testing the model ---

python main.py "test" ./data

Notes

Please note that CIFAR-10 data is downloaded into ./data folder for the first time.
Ensure the GPU mode is enabled as it takes lot of time in CPU mode.

Running the Multi-modal model

cd root/multi_modal

Open demo.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CIFAR-10 Vision Quest: Exploring Image Classification with Deep Learning

Overview

Training and Evaluation

Key Findings

Project Structure

Running Exploratory Data Analysis and Machine Learning models

Running the ResNet model

Notes

Running the Compact Convolutional Transformer model

Notes

Running the Multi-modal model

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
CCT		CCT
ResNet		ResNet
multi_modal		multi_modal
EDA_ML_demo.ipynb		EDA_ML_demo.ipynb
README.md		README.md
project_report.pdf		project_report.pdf

aakashdeepsil/CIFAR-10-Vision-Quest-Exploring-Image-Classification-with-Deep-Learning

Folders and files

Latest commit

History

Repository files navigation

CIFAR-10 Vision Quest: Exploring Image Classification with Deep Learning

Overview

Training and Evaluation

Key Findings

Project Structure

Running Exploratory Data Analysis and Machine Learning models

Running the ResNet model

Notes

Running the Compact Convolutional Transformer model

Notes

Running the Multi-modal model

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages