Smart Document Scanner (with OCR)

This is a small project I built to practice using FastAPI, OpenCV, and Tesseract OCR.
The app lets you upload a photo of a document (like class notes), fixes the perspective, makes it look like a clean scan, and then extracts the text. It also tries to auto-tag the document based on the words it finds.

Why I made it

At university it’s common to take photos of whiteboards, slides, or notes, but they’re often messy to read later. I wanted a tool that could clean them up and make the text searchable.

Features

Upload an image and get back a cleaned “scanned” version
Extract text using Tesseract OCR
Auto-generate simple tags from the text
Basic web interface built with HTML + FastAPI backend

Tech stack

Python (FastAPI, OpenCV, scikit-learn, pytesseract)
Frontend: plain HTML/JS (no framework, kept simple)
OCR: Tesseract

How to run (Windows)

Install Tesseract OCR.
Clone this repo: powershell git clone https://github.com//smart-doc-scanner.git cd smart-doc-scanner
Create a virtual environment: py -m venv .venv ..venv\Scripts\Activate
Install requirements: pip install -r requirements.txt
Run the server: uvicorn app.main:app --reload
Open http://127.0.0.1:8000 in web browser

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
app		app
static		static
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Smart Document Scanner (with OCR)

Why I made it

Features

Tech stack

How to run (Windows)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Smart Document Scanner (with OCR)

Why I made it

Features

Tech stack

How to run (Windows)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages