A document viewer for viewing PDF documents with parsed content overlays. Runs entirely on your machine with no external dependencies.
- View PDF Documents: Display PDFs with zoom controls and page navigation
- View Parsed Data: See extracted content with bounding boxes overlaid on the PDF
- Multiple Sections: Browse documents in Parsed Documents, My Documents, and Word-Bbox sections
- Auto-Discovery: Documents are automatically discovered from folders
Check if Python is installed:
python3 --versionIf not installed, download from python.org/downloads (Python 3.10+).
cd backend
pip install -r requirements.txtpython -m uvicorn app.main:app --port 8001 --reloadGo to http://localhost:8001 in your browser.
Drop files into the appropriate folders:
| Section | PDF Location | Parsed JSON Location |
|---|---|---|
| PARSED DOCUMENTS | backend/files/parsed_documents/pdfs/ |
backend/files/parsed_documents/parsed/ |
| MY DOCUMENTS | backend/files/sectioned_documents/pdfs/ |
backend/files/runs/{doc-id}/ |
| WORD-BBOX | backend/files/matched_documents/pdfs/ |
backend/files/matched_documents/parsed/ |
Refresh the browser after adding files.
backend/files/
├── parsed_documents/
│ ├── pdfs/ # PDF files
│ └── parsed/ # Matching JSON files
├── sectioned_documents/
│ └── pdfs/ # PDF files
├── matched_documents/
│ ├── pdfs/ # PDF files
│ └── parsed/ # Matching JSON files
└── runs/
└── {doc-id}/ # JSON files for MY DOCUMENTS section
| Problem | Solution |
|---|---|
| "python not found" | Install Python or use python3 |
| "Port already in use" | Use different port: --port 8002 |
| "No module named fastapi" | Run pip install -r requirements.txt in backend folder |
| Documents don't appear | Check folder structure, refresh browser |
- Swagger UI: http://localhost:8001/docs
- ReDoc: http://localhost:8001/redoc