ViewMate: AI-Based Convenience Store Assistant for the Visually Impaired

ViewMate is an AI-powered assistance system designed to help visually impaired users shop independently in convenience stores.
The system supports real-time corner recognition and product information reading by combining object detection, OCR, and text-to-speech technologies.

🚀 Overview

ViewMate enables users to:

Identify convenience store corners (snacks, beverages, ramen, instant meals) using real-time vision detection
Read product information such as name, nutrition facts, and expiration date
Receive all information through clear audio guidance

The goal is to improve accessibility, independence, and usability during everyday shopping.

🌟 Key Features

1. Corner Recognition Mode

Detects store corners and informs the user with audio feedback.

How it works:

The user points the camera toward store shelves (auto or manual trigger).
YOLOv8n detects corner categories.
The system analyzes bounding box positions to determine left / right / front.
TTS provides real-time guidance.
- “Left: Beverage corner. Right: Snack corner.”

2. Product Information Reading Mode

Extracts product information while the user rotates the item.

Process:

User activates product mode via voice command or button.
System prompts: “Please rotate the product slowly.”
YOLOv8n detects product tag, nutrition facts, and date regions.
Detected boxes are cropped and processed using Google Vision OCR.
TTS outputs the extracted information.

Example Output:
“Product: Chilsung Cider, 250ml, 120kcal. Expiration date: December 28, 2024.”

🧠 Model Architecture

YOLOv8n — real-time object detection
Google Cloud Vision API — OCR for text extraction
Text-to-Speech (TTS) — audio guidance
Custom data pipeline featuring:
- In-store image data
- Augmentation to enhance robustness
- Balanced training/validation splits

Performance:

Average F1-score 90%+
Evaluation metrics:
- mAP, class loss, box loss, DFL loss
- Confusion matrix
- F1-score per class

📐 System Workflow

Corner Recognition

Camera → YOLOv8n → Corner Classification → Position Analysis → Audio Output

Product Information Reading

Camera → YOLOv8n (tag/nutrition/date) → Crop → OCR → Text Parsing → Audio Output

🎥 Demo Video

The demo includes:

Corner recognition in a simulated convenience-store environment
Product information reading with real packaged goods
Robust performance under diverse lighting and angles

(Insert video link here)

🛠️ Tech Stack

Python
YOLOv8n (Ultralytics)
Google Cloud Vision API
TTS (Text-To-Speech)
OpenCV
Custom data collection & preprocessing pipeline

Project Duration

October 2024 – December 2024

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
11_29/annotation		11_29/annotation
data		data
OBD_OCR_svc.ipynb		OBD_OCR_svc.ipynb
OCR_best frame.ipynb		OCR_best frame.ipynb
README.md		README.md
config.yaml		config.yaml
data_augmentation.ipynb		data_augmentation.ipynb
data_random_split.ipynb		data_random_split.ipynb
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ViewMate: AI-Based Convenience Store Assistant for the Visually Impaired

🚀 Overview

🌟 Key Features

1. Corner Recognition Mode

2. Product Information Reading Mode

🧠 Model Architecture

📐 System Workflow

Corner Recognition

Product Information Reading

🎥 Demo Video

🛠️ Tech Stack

Project Duration

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ViewMate: AI-Based Convenience Store Assistant for the Visually Impaired

🚀 Overview

🌟 Key Features

1. Corner Recognition Mode

2. Product Information Reading Mode

🧠 Model Architecture

📐 System Workflow

Corner Recognition

Product Information Reading

🎥 Demo Video

🛠️ Tech Stack

Project Duration

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages