Skip to content

Commit 83195ae

Browse files
authored
Update README.md
1 parent 534c8bc commit 83195ae

File tree

1 file changed

+0
-9
lines changed

1 file changed

+0
-9
lines changed

README.md

Lines changed: 0 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -13,15 +13,6 @@ Includes improvements from 0.2:
1313
- Parallel job execution with a live status board.
1414
- 24h caching with optional `--no-cache` and cost reporting.
1515

16-
## What's New in 0.2 (2025)
17-
18-
- OCR integration via Tesseract to reliably handle low‑text/scan PDFs.
19-
- LiteLLM integration to open up multiple APIs and local models (OpenAI tested).
20-
- Parallelized job execution (OCR and AI) with a live status board and correct dependencies (images before text when both are enabled).
21-
- 24h caching for OCR and AI calls with optional `--no-cache` and spent vs saved cost reporting.
22-
- Systematic test integration: expanded unit and integration tests across OCR, LLM client, image selection, CLI, and pipeline.
23-
24-
2516
## What It Is
2617

2718
autoPDFtagger is a CLI for semi‑automatic classification, sorting, and tagging of PDF documents. It enriches PDFs with standard metadata using OCR + AI (text and images) and is explicitly built to handle difficult inputs like low‑quality scans and image‑heavy files (e.g., presentations). Your archive remains plain files and folders (no lock‑in), with optional JSON export for review and integration.

0 commit comments

Comments
 (0)