Image to text & voice

A simple but tool that converts text from images into audible speech.

Overview

ImageTo_Text_n_Voice extracts text content from images using Optical Character Recognition (OCR) technology and converts the extracted text into speech. This tool is perfect for accessibility purposes, content consumption on the go, or processing text-based images into audio format.

Functionality

Image Processing: Upload and process various image formats containing text
Text Extraction: Uses OCR to accurately extract text content from images
Speech Synthesis: Converts extracted text into natural-sounding speech
Multi-language Support: Works with multiple languages for both text recognition and speech output
Batch Processing: Process multiple images in a single operation

Tools & Technologies

Tesseract OCR: For optical character recognition
Python: Primary programming language
PIL/Pillow: For image preprocessing
gTTS (Google Text-to-Speech): For text-to-speech conversion
Flask: Web framework for the interface (if applicable)

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.gitignore		.gitignore
Arabic_OCR.png		Arabic_OCR.png
Arabic_imageToText&Voice.ipynb		Arabic_imageToText&Voice.ipynb
English_imageToText&voice.ipynb		English_imageToText&voice.ipynb
LICENSE		LICENSE
README.md		README.md
english_OCR.png		english_OCR.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Image to text & voice

Overview

Functionality

Tools & Technologies

About

Uh oh!

Releases

Packages

Languages

License

lenni991/imageTo_Text_n_Voice

Folders and files

Latest commit

History

Repository files navigation

Image to text & voice

Overview

Functionality

Tools & Technologies

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages