GitHub - naravid19/typhoon-ocr: Windows-focused fork of Typhoon OCR featuring a modern Next.js web app. Supports multi-page PDF/image OCR to Markdown/HTML, interactive preview, and URL import.

Typhoon OCR

A powerful OCR application for extracting structured markdown from images and PDFs
Explore the docs »

View Demo · Report Bug · Request Feature

Table of Contents

About The Project
- Built With
Getting Started
- Prerequisites
- Installation
Usage
Features
Roadmap
Contributing
License
Changelog
Contact
Acknowledgments

About The Project

Typhoon OCR is a model for extracting structured markdown from images or PDFs. It supports document layout analysis and table extraction, returning results in markdown or HTML.

This fork provides a modern Next.js web application alongside the original Gradio demo, featuring:

🚀 Modern UI with dark theme and premium aesthetics
📄 Multi-page PDF support with interactive page selection
🔗 URL import for loading documents directly from the web
📊 Real-time progress indicators during OCR processing
🎨 Compare mode to view original and extracted text side-by-side

This fork focuses on Windows 10/11. For macOS/Linux setup, please refer to the official Typhoon OCR repository.

📝 See CHANGELOG.md for latest updates.

(back to top)

Built With

(back to top)

Getting Started

To get a local copy up and running follow these steps.

Prerequisites

Windows 10/11 with Python 3.10+
Node.js 18+ with npm
Poppler (for PDF processing)

Install Poppler using PowerShell:

iwr -useb https://github.com/oschwartz10612/poppler-windows/releases/download/v25.07.0-0/Release-25.07.0-0.zip -OutFile $env:TEMP\poppler.zip; rm C:\poppler -Recurse -Force -ErrorAction SilentlyContinue; Expand-Archive $env:TEMP\poppler.zip C:\poppler -Force; $bin=(Get-ChildItem C:\poppler -Recurse -Filter pdfinfo.exe | Select-Object -First 1).DirectoryName; if(-not $bin){throw "pdfinfo.exe not found under C:\poppler"}; $u=[Environment]::GetEnvironmentVariable('Path','User'); if([string]::IsNullOrEmpty($u)){$u=''}; if($u -notlike "*$bin*"){[Environment]::SetEnvironmentVariable('Path', ($u.TrimEnd(';')+';'+$bin).Trim(';'), 'User')}; $env:Path+=';'+$bin; pdfinfo -v

Verify installation:

pdfinfo -v
pdftoppm -v

Installation

Clone the repo

git clone https://github.com/naravid19/typhoon-ocr.git
cd typhoon-ocr

Configure environment

Create a .env file in the project root:

TYPHOON_BASE_URL=https://api.opentyphoon.ai/v1
TYPHOON_API_KEY=YOUR_API_KEY
TYPHOON_OCR_MODEL=typhoon-ocr

Set up Backend (Python)

python -m venv venv
.\venv\Scripts\activate
pip install -r backend/requirements.txt

Set up Frontend (Next.js)
```
cd frontend
npm install
```
Run the application

Option A: One-Click Start (Recommended)

Simply double-click the start_app.bat file in the project root.

The script automatically detects your virtual environment and opens the browser for you.

Option B: Manual Start

Terminal 1 - Backend:
```
python -m uvicorn backend.main:app --reload --port 8000
```
Terminal 2 - Frontend:
```
cd frontend
npm run dev
```
Open in browser

Navigate to http://localhost:3000/ocr

(back to top)

Usage

Upload a document - Drag & drop or click to upload PDF/images
Import from URL - Paste a URL to load documents directly from the web
Select pages - For multi-page PDFs, select specific pages or use quick actions (Select All, Odd/Even, Range)
Configure parameters - Adjust temperature, top_p, and other OCR settings
Run OCR - Click "Run OCR" and monitor progress
View results - Switch between Combined and Compare views

(back to top)

Features

✅ Upload PDF or images (PNG, JPG, WebP)
✅ Multi-page PDF selection with visual grid preview
✅ Import documents from URL (with CORS proxy)
✅ Shift-click for range selection
✅ Quick actions: Select All, Odd/Even pages, Custom range
✅ Two task types: default (Markdown) and structure (HTML tables)
✅ Real-time progress indicator
✅ Compare mode: Original image vs. extracted text
✅ Copy extracted text with one click
✅ Code generator for API integration

(back to top)

Roadmap

See the open issues for a full list of proposed features (and known issues).

(back to top)

Contributing

Contributions are what make the open source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.

Fork the Project
Create your Feature Branch (git checkout -b feature/AmazingFeature)
Commit your Changes (git commit -m 'Add some AmazingFeature')
Push to the Branch (git push origin feature/AmazingFeature)
Open a Pull Request

(back to top)

License

Distributed under the Apache 2.0 License. See LICENSE for more information.

(back to top)

Contact

Project Link: https://github.com/naravid19/typhoon-ocr

(back to top)

Acknowledgments

SCB10X Typhoon OCR - Original project
OpenAI - API compatibility
Best-README-Template - README template
Shields.io - Badges

(back to top)

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
backend		backend
examples		examples
frontend		frontend
images		images
packages/typhoon_ocr		packages/typhoon_ocr
.env.template		.env.template
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
README.th.md		README.th.md
app.py		app.py
requirements.txt		requirements.txt
start_app.bat		start_app.bat

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Typhoon OCR

About The Project

Built With

Getting Started

Prerequisites

Installation

Option A: One-Click Start (Recommended)

Option B: Manual Start

Usage

Features

Roadmap

Contributing

License

Contact

Acknowledgments

About

Uh oh!

Releases

Packages

Languages

License

naravid19/typhoon-ocr

Folders and files

Latest commit

History

Repository files navigation

Typhoon OCR

About The Project

Built With

Getting Started

Prerequisites

Installation

Option A: One-Click Start (Recommended)

Option B: Manual Start

Usage

Features

Roadmap

Contributing

License

Contact

Acknowledgments

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages