Skip to content

Conversation

@OsamaRab3
Copy link
Owner

Description

This PR adds OCR (Optical Character Recognition) functionality to Filto, enabling users to extract text from images and scanned documents directly from the command line.

Key Features

  • Added new ocr command for text extraction from images
  • Integrated Tesseract OCR engine for accurate text recognition
  • Supports multiple languages through Tesseract language packs
  • Works with common image formats (PNG, JPG, etc.)

Usage

# Basic usage
filto ocr input.png -l eng -o output.txt

# Example with Arabic text
filto ocr documento.png -l ara -o texto_extraido.txt

@OsamaRab3 OsamaRab3 merged commit 55fb7ad into main Nov 27, 2025
0 of 6 checks passed
@OsamaRab3 OsamaRab3 deleted the ocr branch November 27, 2025 19:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants