Skip to content

Latest commit

 

History

History
58 lines (42 loc) · 2.91 KB

File metadata and controls

58 lines (42 loc) · 2.91 KB

Backend Tests

The Smart Document Router is an open source document processing data layer.

  • It ingests unstructured docs through REST APIs and integrations from faxes, emails, and ERPs.
  • It processes documents at scale with OCR and LLMs
  • And it chunks, embeds, and organizes documents into queriable knowledge bases

The Document Router is designed to work standalone or with a human-in-the-loop, and can process medical, insurance, financial, supply chain, and legal documents.

It acts as a system of record for the extraction schemas and prompts, and it is portable over all major clouds and LLM providers.

A Document Agent is available to configure prompts and extractions, and to review processed results.

Quick start (Docker)

Requires Docker on Linux, macOS, or Windows (WSL).

curl -fsSL https://raw.githubusercontent.com/analytiq-hub/doc-router/main/tools/run-doc-router-docker.sh | bash -s -- up
curl -fsSL https://raw.githubusercontent.com/analytiq-hub/doc-router/main/tools/run-doc-router-docker.sh | bash -s -- down

The script prints URLs and sign-in credentials after up. More detail: Docker setup, docrouter_docker, and tools/run-doc-router-docker.sh.

Tech stack

  • NextJS, NextAuth, MaterialUI, TailwindCSS
  • FastAPI
  • MongoDB
  • Pydantic
  • LiteLLM
  • OpenAI, Anthropic, Gemini, Vertex AI for GCP, AWS Bedrock, xAI, OpenRouter...

PyData Boston DocRouter Slides (Feb '24) have more details about tech stack, and how Cursor AI was used to build the DocRouter.

User Experience

Document List Document Agent

Example Deployment

Smart Document Router

Presentations

Docs