🛒 Retail AI Pipeline – Product Detection & Grouping

📌 Overview

This project implements a production-style AI pipeline to detect and group similar retail products from shelf images.

It is built using a microservice architecture with clear separation of responsibilities:

Detector Service → detects product bounding boxes
Grouping Service → groups visually + spatially similar products
Gateway Service → orchestrates services and provides UI

🏗️ Architecture


Client (UI)
↓
Gateway Service (Flask)
↓
Detector Service (YOLO)
↓
Grouping Service (ResNet + Clustering)
↓
Final Output (Image + JSON)

⚙️ Services

1️⃣ Gateway Service

Handles image upload
Orchestrates detector → grouping
Returns final JSON + image
Serves output images

2️⃣ Detector Service

Uses YOLOv8
Implements:
- Sliding window detection
- Low confidence threshold
- Area filtering
- Non-Max Suppression (NMS)
- Aspect ratio filtering
- Duplicate removal
Outputs:
- Bounding boxes
- Cropped images

3️⃣ Grouping Service

Uses ResNet18 embeddings
Combines:
- Visual features
- Spatial features
Applies:
- Feature normalization
- Agglomerative clustering
Outputs:
- Group IDs
- Annotated image

🧠 Key Design Decisions

Microservices → modular & scalable
Sliding window detection → improves recall
Spatial + visual fusion → better grouping
Config-driven system → no hardcoding
Logging → easier debugging

🚀 How to Run

Dev Mode

🔹 1. Create Virtual Environment (Recommended)

Windows

python -m venv venv
venv\Scripts\activate

Mac/Linux

python3 -m venv venv
source venv/bin/activate

🔹 2. Install dependencies

pip install -r requirements/requirements-dev.txt

🔹 3. Start services

Option A (recommended)

python run_all.py

Option B (manual)

# Terminal 1
cd detector_service
python app.py

# Terminal 2
cd grouping_service
python app.py

# Terminal 3
cd gateway
python app.py

🔹 4. Open UI

http://127.0.0.1:5000

🐳 Docker Deployment

🔹 Run with Docker

cd docker
docker-compose up --build

🔹 Access UI

http://localhost:5000

🔹 Service Ports

Service	Port
Gateway	5000
Detector	8001
Grouping	8002

🧪 API Flow

Upload image → Gateway
Gateway → Detector
Gateway → Grouping
Final response returned

Example Response

{
  "request_id": "...",
  "output_image": "/outputs/result_xxx.jpg",
  "results": [
    {
      "bbox": [x1, y1, x2, y2],
      "group_id": 0
    }
  ]
}

📂 Project Structure

project/
│
├── notebooks/
│   └── model.ipynb
├── gateway/
├── detector_service/
├── grouping_service/
├── logs/
├── docs/
├── models/
├── outputs/
├── docker/
│   └── docker-compose.yml
├── run_all.py
├── requirements.txt
└── README.md

📊 Notebook (Model Development)

The notebook (notebooks/model.ipynb) was used during the experimentation phase.

🔍 Purpose

Prototyping detection pipeline
Testing slicing strategy
Developing filtering logic
Validating grouping approach
Analyzing clustering behavior

🔄 Transition to Production

Notebook	Production
Inline code	Microservices
Hardcoded values	Config-driven
Sequential flow	API-based pipeline

🎯 Key Learnings Applied

Sliding window improves recall
Area filtering removes noise
Spatial + visual features improve grouping
Normalization stabilizes clustering

🔧 Configuration & Tuning

All parameters are configurable via config.py.

🎯 Detector Config

SLICE_SIZE = 512
OVERLAP = 0.4
IOU_THRESHOLD = 0.4
MIN_AREA_RATIO = 0.0005
MAX_AREA_RATIO = 0.03

🎯 Grouping Config

DISTANCE_THRESHOLD = 0.6
SPATIAL_WEIGHT = 0.1
IMAGE_SIZE = 224
CLUSTERING_METRIC = "euclidean"
CLUSTERING_LINKAGE = "average"

🔍 Parameter Insights

DISTANCE_THRESHOLD
- lower → more groups
- higher → fewer groups
SPATIAL_WEIGHT
- 0 → visual only
- 0.1 → balanced
- higher → spatial bias

🧪 Experiments

Increase DISTANCE_THRESHOLD → merge clusters
Reduce SPATIAL_WEIGHT → visual grouping
Increase OVERLAP → better detection

⚙️ Environment Variables

DISTANCE_THRESHOLD=0.7
SPATIAL_WEIGHT=0.2

🖼️ Example Output

Below is an example demonstrating detection and grouping results from the pipeline.

📥 Input Image

📤 Output (Grouped Products)

📊 Sample JSON Output

{
  "request_id": "example-id",
  "output_image": "/outputs/result_example.jpg",
  "results": [
    {
      "bbox": [100, 200, 300, 400],
      "group_id": 2
    },
    {
      "bbox": [320, 210, 500, 390],
      "group_id": 2
    }
  ]
}

⚡ Features

End-to-end ML pipeline
Microservice architecture
UI + API integration
Config-driven design
Logging support

🔮 Future Improvements

Fine-tuned embeddings
Better clustering (DBSCAN / metric learning)
Independent service scaling

🏁 Conclusion

This project demonstrates the transition from:

Notebook → Production-ready ML system

Combining:

Machine Learning
Backend Engineering
System Design

👤 Author

Aman Gupta

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
common		common
detector_service		detector_service
docker		docker
docs		docs
gateway		gateway
grouping_service		grouping_service
models		models
notebooks		notebooks
requirements		requirements
sample_images		sample_images
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
run_all.py		run_all.py

Folders and files

Latest commit

History

Repository files navigation

🛒 Retail AI Pipeline – Product Detection & Grouping

📌 Overview

🏗️ Architecture

⚙️ Services

1️⃣ Gateway Service

2️⃣ Detector Service

3️⃣ Grouping Service

🧠 Key Design Decisions

🚀 How to Run

Dev Mode

🔹 1. Create Virtual Environment (Recommended)

Windows

Mac/Linux

🔹 2. Install dependencies

🔹 3. Start services

Option A (recommended)

Option B (manual)

🔹 4. Open UI

🐳 Docker Deployment

🔹 Run with Docker

🔹 Access UI

🔹 Service Ports

🧪 API Flow

Example Response

📂 Project Structure

📊 Notebook (Model Development)

🔍 Purpose

🔄 Transition to Production

🎯 Key Learnings Applied

🔧 Configuration & Tuning

🎯 Detector Config

🎯 Grouping Config

🔍 Parameter Insights

🧪 Experiments

⚙️ Environment Variables

🖼️ Example Output

📥 Input Image

📤 Output (Grouped Products)

📊 Sample JSON Output

⚡ Features

🔮 Future Improvements

🏁 Conclusion

👤 Author

About

Topics

Resources

License

Code of conduct

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages