Skip to content

lenni991/imageTo_Text_n_Voice

Repository files navigation

Image to text & voice

A simple but tool that converts text from images into audible speech.

Overview

ImageTo_Text_n_Voice extracts text content from images using Optical Character Recognition (OCR) technology and converts the extracted text into speech. This tool is perfect for accessibility purposes, content consumption on the go, or processing text-based images into audio format.

Functionality

  • Image Processing: Upload and process various image formats containing text
  • Text Extraction: Uses OCR to accurately extract text content from images
  • Speech Synthesis: Converts extracted text into natural-sounding speech
  • Multi-language Support: Works with multiple languages for both text recognition and speech output
  • Batch Processing: Process multiple images in a single operation

Tools & Technologies

  • Tesseract OCR: For optical character recognition
  • Python: Primary programming language
  • PIL/Pillow: For image preprocessing
  • gTTS (Google Text-to-Speech): For text-to-speech conversion
  • Flask: Web framework for the interface (if applicable)

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published