Multi-Modal-AI-search

🧠 Fashion Product Image Search Engine

This project is a multi-modal product search engine that allows users to input a text querry or an image query and retrieves the top matching fashion product images from a dataset using deep learning and vector similarity.

It uses CLIP (Contrastive Language–Image Pre-training) for creating embeddings from both images and text, and FAISS for efficient nearest neighbor search.

Real world applications:

E-Commerce Product Search
Visual Search (Search by Image or Text)
Digital Asset Management (DAM)
Content Moderation & Compliance
Game Asset or 3D Model Search

📁 Dataset

We use a subset of the Fashion Product Images (Small) dataset from Kaggle, consisting of:

~15,000 product images
styles.csv: CSV file mapping product metadata
styles/: Folder containing per-image JSON files with category, subcategory, and descriptions

querry: red shirts for men: Output:top 5 matching images:

querry:

Oytput:

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
venv		venv
README.md		README.md
SearchEngine.ipynb		SearchEngine.ipynb
clip_image_features.pt		clip_image_features.pt
clip_index.faiss		clip_index.faiss
image_features.pt		image_features.pt
image_filenames.npy		image_filenames.npy
image_texts.json		image_texts.json
temp_0.png		temp_0.png
test.ipynb		test.ipynb
text_features.pt		text_features.pt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Multi-Modal-AI-search

🧠 Fashion Product Image Search Engine

📁 Dataset

About

Uh oh!

Releases

Packages

Languages

aashwinraj/Multi-Modal-AI-search

Folders and files

Latest commit

History

Repository files navigation

Multi-Modal-AI-search

🧠 Fashion Product Image Search Engine

📁 Dataset

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages