BookGraph

BookGraph turns your reading list and research papers into a dynamic knowledge globe and an AI-powered discovery engine.

It ingests books and documents, extracts concepts using LLM-backed agents, builds structural relationships in Neo4j, and lets you interact with your knowledge through a real-time neural-map and streaming AI chat.

BG_linkedin.mp4

What The App Does

Multi-Modal Ingestion: Add items from Open Library, Google Books, arXiv, or upload local PDFs.
Automated Enrichment: LLM agents "read" metadata/PDF text to extract core concepts, fields, and bibliographic data.
Strategic Graphing: Automatically builds relationships between new and existing items (e.g., Influenced By, Contradicts, Expands).
Knowledge Globe: Visualize your entire intellectual landscape as an interactive, high-performance "galaxy" of nodes.
Queryable Intelligence: Talk to your library using real-time streaming chat that can perform complex Cypher reasoning over the graph's structure.

Core Features

Knowledge Ingestion: Seamless addition of books and academic papers with automated metadata extraction.
Neural Globe: High-performance canvas-based visualization of your knowledge base with organic physics.
Structural Reasoning: AI Chat that writes Cypher queries to answer structural questions (e.g., "Find authors who wrote about both Physics and Philosophy").
Real-time Streaming: Token-by-token AI responses for a modern messaging experience.
Discovery Engine: Automated background detection of thematic clusters and reading paths.
Resource Management: Easily curate your graph with a "Recently Ingested" dashboard and node deletion.

Tech Stack

Frontend: Next.js + react-force-graph-2d
Backend: FastAPI + python-multipart + PyPDF2
Graph DB: Neo4j
LLM providers: OpenAI, OpenRouter, or Ollama

Project Structure

bookgraph/
├── backend/
│   ├── app/
│   │   ├── agents/       # LLM Agent logic (Chat, Metadata, Relationship)
│   │   ├── api/          # FastAPI routes and schemas
│   │   ├── enrichment/   # Concept extraction logic
│   │   ├── graph/        # Neo4j repository layer
│   │   ├── ingestion/    # API clients (Google Books, Arxiv, OpenLibrary)
│   │   └── services/     # Business logic orchestration
│   ├── main.py
│   └── requirements.txt
├── frontend/
│   ├── app/              # Next.js App Router (Ingestion, Chat, Globe)
│   ├── components/       # Shared UI and Graph Canvas
│   ├── public/           # Static assets (Favicon)
│   └── lib/              # API utilities
├── docker/
│   └── docker-compose.yml
└── README.md

Quick Start

Option 1: Docker (recommended)

cd docker
docker compose up --build

Frontend: http://localhost:3000
Backend: http://localhost:8000
API docs: http://localhost:8000/docs
Neo4j Browser: http://localhost:7474 (neo4j / bookgraph)

Option 2: Local dev

Backend:

cd backend
python3 -m venv .venv
source .venv/bin/activate # or .venv\Scripts\activate on Windows
pip install -r requirements.txt
uvicorn main:app --reload --port 8000

Frontend:

cd frontend
npm install
npm run dev

Key API Endpoints

POST /books | POST /google-books | POST /papers: Ingest resources
POST /pdf: Upload and extract metadata from local PDF
GET /graph: Fetch global snapshot for the Globe view
DELETE /graph/nodes/{node_id}: Remove specific items/nodes
POST /chat/stream: Streaming AI chat with graph context
GET /discoveries: View automated graph insights

Graph Model

Nodes: Book, Paper, Author, Concept, Field
Relationships: WRITTEN_BY, MENTIONS, BELONGS_TO, RELATED_TO, INFLUENCED_BY, CONTRADICTS, EXPANDS

LLM Configuration

Set provider in backend .env:

OpenAI: MODEL_PROVIDER=openai
OpenRouter: MODEL_PROVIDER=openrouter
Ollama: MODEL_PROVIDER=ollama

Architecture

flowchart TD
    UI["Next.js Frontend"] <--> API["FastAPI Backend"]
    API --> INGEST["Multi-modal Ingestion (Books/PDF/Arxiv)"]
    INGEST --> AGENTS["AI Enrichment Agents"]
    AGENTS <--> LLM["Pluggable LLM (Streaming)"]
    AGENTS --> NEO["Neo4j Graph DB"]
    API <--> NEO
    NEO --> DISC["Discovery & Exploration Jobs"]

Future Enhancements

PDF Full-Text Search: Vector indexing of entire document contents.
Author Influence Mapping: Deep-dive into author citation networks.
YouTube/Podcasts: Transcribing and graphing audio-visual knowledge.
Browser Extension: One-click ingestion from Amazon or Arxiv.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
.github/workflows		.github/workflows
backend		backend
docker		docker
frontend		frontend
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BookGraph

What The App Does

Core Features

Tech Stack

Project Structure

Quick Start

Option 1: Docker (recommended)

Option 2: Local dev

Key API Endpoints

Graph Model

LLM Configuration

Architecture

Future Enhancements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

BookGraph

What The App Does

Core Features

Tech Stack

Project Structure

Quick Start

Option 1: Docker (recommended)

Option 2: Local dev

Key API Endpoints

Graph Model

LLM Configuration

Architecture

Future Enhancements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages