Skip to content

Yosiefeyob/smart-doc-scanner

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Smart Document Scanner (with OCR)

This is a small project I built to practice using FastAPI, OpenCV, and Tesseract OCR.
The app lets you upload a photo of a document (like class notes), fixes the perspective, makes it look like a clean scan, and then extracts the text. It also tries to auto-tag the document based on the words it finds.

Why I made it

At university it’s common to take photos of whiteboards, slides, or notes, but they’re often messy to read later. I wanted a tool that could clean them up and make the text searchable.

Features

  • Upload an image and get back a cleaned “scanned” version
  • Extract text using Tesseract OCR
  • Auto-generate simple tags from the text
  • Basic web interface built with HTML + FastAPI backend

Tech stack

  • Python (FastAPI, OpenCV, scikit-learn, pytesseract)
  • Frontend: plain HTML/JS (no framework, kept simple)
  • OCR: Tesseract

How to run (Windows)

  1. Install Tesseract OCR.
  2. Clone this repo: powershell git clone https://github.com//smart-doc-scanner.git cd smart-doc-scanner
  3. Create a virtual environment: py -m venv .venv ..venv\Scripts\Activate
  4. Install requirements: pip install -r requirements.txt
  5. Run the server: uvicorn app.main:app --reload
  6. Open http://127.0.0.1:8000 in web browser

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors