-
Notifications
You must be signed in to change notification settings - Fork 181
Description
OCR • Translation • Text-to-Speech Activities
A set of three interconnected Sugar Labs activities designed to improve accessibility, multilingual education, and inclusive learning.
The activities work independently and together, forming a seamless learning pipeline from real-world text to spoken audio.
🌟 Overview
The suite consists of:
- SugarScan – OCR for printed and handwritten text
- SugarTranslate – Multilingual text translation
- SugarSpeak – Text-to-Speech conversion
Together, these activities enable learners to:
- Digitize handwritten or printed text
- Translate content into multiple languages
- Listen to text through speech output
All activities are connected using the Sugar Journal, following Sugar OS design principles.
🧠 Activities Description
1️⃣ SugarScan – OCR Activity
📌 Description
SugarScan converts printed and handwritten text from images into editable digital text.
✨ Features
-
Capture image using camera or upload image
-
OCR support for:
- Printed text (books, worksheets)
- Handwritten text (notes, board writing)
-
Multi-language OCR support
-
Editable text output
-
Save extracted text to Sugar Journal
-
Export text as
.txtor.pdf
🎓 Educational Value
- Digitizes learning materials
- Helps preserve handwritten notes
- Improves accessibility
- Encourages digital literacy
🔗 Integration
- Saves OCR output to Sugar Journal
- Adds metadata tag:
ocr-text - Output can be used by SugarTranslate and SugarSpeak
2️⃣ SugarTranslate – Translation Activity
📌 Description
SugarTranslate allows learners to translate text between multiple languages, supporting multilingual and regional education.
✨ Features
- Manual text input or import from SugarScan
- Supports multiple language pairs (e.g., English ↔ Hindi)
- Side-by-side view of original and translated text
- Vocabulary and word highlighting
- Offline translation for basic languages
- Save translated text to Sugar Journal
🎓 Educational Value
- Enhances language learning
- Supports mother-tongue education
- Encourages cross-cultural understanding
🔗 Integration
- Reads OCR text from Sugar Journal
- Saves translated text with language metadata
- Output can be passed to SugarSpeak
3️⃣ SugarSpeak – Text-to-Speech Activity
📌 Description
SugarSpeak converts text into spoken audio, making learning more accessible and interactive.
✨ Features
- Text-to-Speech conversion
- Multiple voices and accents
- Adjustable speed and pitch
- Multi-language speech support
- Text highlighting during playback
- Save generated audio to Sugar Journal
🎓 Educational Value
- Helps early readers
- Supports dyslexic and visually impaired learners
- Improves pronunciation and listening skills
🔗 Integration
- Reads text from SugarScan or SugarTranslate
- Saves audio output to Journal
- Can be replayed or shared
🔄 How the Activities Are Connected
The activities are loosely coupled and connected via the Sugar Journal, allowing smooth transitions without tight dependencies.
📈 Learning Flow
📷 Image (Printed / Handwritten)
↓
🧠 SugarScan (OCR)
↓
📄 Digital Text (Journal)
↓
🌍 SugarTranslate
↓
🈯 Translated Text (Journal)
↓
🔊 SugarSpeak
↓
🎧 Spoken Audio
🗂️ Technical Integration
Shared Components
-
Sugar Journal as the central data store
-
Metadata tags:
ocrlanguagetranslatedaudio
-
Activities read and write Journal entries
Design Benefits
- Modular and maintainable
- Activities work independently
- Easy content reuse
- Scalable for future extensions
🎓 Learning Outcomes
| Skill Area | Supported By |
|---|---|
| Reading | OCR + Text-to-Speech |
| Writing | OCR Editing |
| Listening | Text-to-Speech |
| Language Learning | Translation |
| Accessibility | OCR + TTS |
| Multilingual Education | All Activities |
🌱 Why This Project Fits Sugar Labs
- Follows learning-by-doing philosophy
- Strong focus on accessibility and inclusion
- Open-source friendly
- Suitable for classrooms and self-learning
- Ideal for Sugar Activities and GSoC projects
🚀 Future Enhancements
- Text summarization
- Braille output support
- Sign-language video generation
- AI-based handwriting improvement
- Offline-first models