InsightHub

A Python-based CLI application for intelligent document retrieval using advanced RAG (Retrieval-Augmented Generation) techniques with support for both vector and graph-based search.

Quick Start

Prerequisites

Docker and Docker Compose
Python 3.11+
Poetry
Task (https://taskfile.dev)

Setup

# Start all infrastructure services
task up

# Install dependencies
task install

# Run migrations
task migrate-up

# Verify everything works
task health-check

The following services will be available:

PostgreSQL: localhost:5432
Qdrant UI: http://localhost:6334
Neo4j Browser: http://localhost:7474
MinIO Console: http://localhost:9001
Ollama API: http://localhost:11434
Redis: localhost:6379

CLI Quick Start

InsightHub provides a comprehensive command-line interface following a simple, discoverable pattern.

CLI Command Structure

All commands follow this consistent pattern:

task cli -- <resource> <action> [arguments]

Where:

<resource> is what you're working with: workspace, document, chat, state, rag-options, default-rag-config
<action> is what you want to do: list, create, show, select, upload, send, etc.
[arguments] are specific to each action (like IDs or file paths)

Getting Help Anytime

The CLI is self-documenting. Use --help at any level:

# See all available resources
task cli -- --help

# See what you can do with a resource
task cli -- workspace --help

# See details for a specific action
task cli -- workspace create --help

Pro tip: Whenever you're stuck or unsure, just add --help to discover what's available!

Essential Commands

Check your current state:

task cli -- state show

Shows your currently selected workspace and chat session.

Discover available RAG algorithms:

task cli -- rag-options list

Lists vector, graph, and hybrid search methods with descriptions.

Get help on any resource:

task cli -- <resource> --help

Replace <resource> with workspace, document, chat, etc.

Common Patterns

Pattern 1: List -> Show -> Select

# List all items
task cli -- workspace list

# Show details of one
task cli -- workspace show 1

# Select/activate it
task cli -- workspace select 1

Pattern 2: Create -> Verify -> Use

# Create (interactive prompts)
task cli -- workspace create

# Verify it was created
task cli -- workspace list

# Select it for use
task cli -- workspace select <id>

Pattern 3: Check State -> Act -> Verify

# Check current context
task cli -- state show

# Perform action
task cli -- document add file.pdf

# Verify the result
task cli -- document list

Basic Usage Examples

Workspace management:

task cli -- workspace create           # Create a workspace (interactive)
task cli -- workspace list             # List all workspaces
task cli -- workspace select 1         # Select active workspace
task cli -- workspace show 1           # Show workspace details

Document management:

task cli -- document add file.pdf   # Add a document
task cli -- document list              # List documents in workspace
task cli -- document show 1            # Show document details
task cli -- document remove file.pdf   # Remove a document

Chat operations:

task cli -- chat create 1              # Create new chat session in workspace
task cli -- chat list 1                # List sessions in workspace
task cli -- chat select 1              # Select active session
task cli -- chat send "Your message"   # Send message to session
task cli -- chat history               # Show message history

Configuration and state:

task cli -- state show                 # Show selected workspace/session
task cli -- rag-options list           # List all RAG algorithms
task cli -- default-rag-config show    # Show default RAG settings
task cli -- default-rag-config create  # Configure RAG settings (interactive)

Learning More

For comprehensive tutorials and guides, see the tutorials directory:

Each tutorial includes CLI tips, help examples, and best practices.

Architecture

InsightHub uses a domain-driven design architecture with clean separation of concerns:

CLI Interface: Task-based command-line interface for all operations
Domain Layer: Business logic organized by bounded contexts (workspace, document, chat, state, rag_options)
Infrastructure Layer: Generic utilities, RAG workflows, LLM providers, caching, and a RAG store manager
Data Layer: PostgreSQL with pgvector, Qdrant, Neo4j, Redis cache

Each domain contains its own models, repositories, services, orchestrators, validation, and CLI commands, ensuring true domain isolation and maintainability.

Project Structure

insighthub/
+-- src/
|   +-- cli.py                          # CLI entry point
|   +-- config.py                       # Configuration management
|   +-- context.py                      # DI container
|   +-- domains/                        # Business domains (DDD)
|   |   +-- default_rag_config/         # Default RAG configuration
|   |   |   +-- models.py               # Domain entities
|   |   |   +-- repositories.py         # Data persistence
|   |   |   +-- service.py              # Business logic
|   |   |   +-- orchestrator.py         # Workflow coordination
|   |   |   +-- validation.py           # Input validation
|   |   |   +-- mappers.py              # DTO mappings
|   |   |   +-- dtos.py                 # Request/Response DTOs
|   |   |   \-- commands.py             # CLI commands
|   |   +-- rag_options/                # RAG algorithm options (read-only)
|   |   |   +-- service.py              # Query available algorithms
|   |   |   +-- orchestrator.py         # Orchestration
|   |   |   +-- dtos.py                 # Response DTOs
|   |   |   \-- commands.py             # CLI commands
|   |   +-- state/                      # Application state
|   |   |   +-- models.py               # State model
|   |   |   +-- repositories.py         # State repository
|   |   |   +-- service.py              # State queries
|   |   |   +-- orchestrator.py         # Orchestration
|   |   |   +-- dtos.py                 # Response DTOs
|   |   |   \-- commands.py             # CLI commands
|   |   \-- workspace/                  # Workspace domain
|   |       +-- models.py               # Domain entities
|   |       +-- repositories.py         # Data persistence
|   |       +-- data_access.py          # Cache + repository coordination
|   |       +-- service.py              # Business logic
|   |       +-- orchestrator.py         # Orchestration
|   |       +-- validation.py           # Input validation
|   |       +-- mappers.py              # DTO mappings
|   |       +-- dtos.py                 # DTOs
|   |       +-- commands.py             # CLI commands
|   |       +-- chat/                   # Chat subdomain
|   |       |   +-- message/            # Messages
|   |       |   |   +-- models.py
|   |       |   |   +-- repositories.py
|   |       |   |   +-- service.py
|   |       |   |   \-- ...
|   |       |   \-- session/            # Sessions
|   |       |       +-- models.py
|   |       |       +-- repositories.py
|   |       |       +-- service.py
|   |       |       \-- ...
|   |       \-- document/               # Document subdomain
|   |           +-- models.py
|   |           +-- repositories.py
|   |           +-- data_access.py      # Cache + repo coordination
|   |           +-- service.py
|   |           \-- ...
|   \-- infrastructure/                 # Infrastructure layer
|       +-- cache/                      # Caching implementations
|       |   +-- cache.py                # Abstract cache interface
|       |   +-- redis_cache.py          # Redis implementation
|       |   \-- in_memory_cache.py      # In-memory implementation
|       +-- validation/                 # Generic validation utilities
|       |   \-- utils.py                # Reusable validators
|       +-- mappers/                    # Generic mapper utilities
|       |   \-- utils.py                # Date formatting, etc.
|       +-- cli_io.py                   # CLI I/O wrappers
|       +-- rag/                        # RAG implementation
|       |   +-- options.py              # RAG algorithm registry
|       |   +-- steps/                  # RAG processing steps
|       |   |   +-- general/            # Parsing, chunking
|       |   |   +-- graph_rag/          # Graph RAG
|       |   |   \-- vector_rag/         # Vector RAG, embedding, reranking
|       |   \-- workflows/              # End-to-end workflows
|       +-- llm/                        # LLM provider abstractions
|       +-- cache/                      # Caching implementations
|       +-- storage/                    # Blob storage (FS/S3)
|       \-- sql_database.py             # Database connection
+-- migrations/                         # SQL migration scripts
+-- tests/                              # Test suite
|   +-- unit/                           # Unit tests
|   +-- integration/                    # Integration tests
|   \-- e2e/                            # End-to-end tests
+-- Taskfile.yml                        # Task automation
+-- docker-compose.yml                  # Infrastructure services
\-- pyproject.toml                      # Python dependencies

Features

Document Management

Upload and parse documents (PDF, HTML, TXT)
Automatic chunking with semantic/sliding window strategies
Metadata enrichment for better retrieval
Support for both filesystem and S3-compatible storage
Document versioning and tracking per workspace

RAG Capabilities

Vector RAG:

Semantic search using Qdrant vector database
Sentence Transformers embeddings
Similarity-based retrieval with configurable top-k
Result reranking for improved accuracy

Graph RAG:

Knowledge graph construction in Neo4j
Relationship-based retrieval
Entity and concept extraction
Graph traversal for context discovery

Processing Pipeline:

Multi-format document parsing (PyPDF, BeautifulSoup)
Intelligent text chunking with overlap
Batch embedding for efficiency
Configurable RAG parameters per workspace

Chat Interface

Multi-session support with history
Streaming LLM responses
Contextual retrieval from documents
Session management and persistence

Multi-Workspace Support

Organize documents and chats by workspace
Isolated configurations per workspace
Easy workspace switching via CLI

Caching & Performance

Redis or in-memory caching
Embedding cache to avoid recomputation
Query result caching

Development

Common Tasks

# Code quality
task compile         # Check Python syntax compilation
task format          # Format code (Black + isort)
task lint            # Lint code (Ruff)
task type-check      # Type checking (mypy)
task check           # Run all quality checks (compile, format, lint, type-check, test)

# Testing
task unit-test       # Run unit tests
task integration-test # Run integration tests
task e2e-test        # Run e2e tests
task test            # Run all tests

# Database
task migrate-up      # Apply migrations
task migrate-down    # Rollback migrations

# Infrastructure
task up              # Start Docker services
task down            # Stop services
task reset           # Full reset with migrations
task health-check    # Verify all services

Running Tests

# Run all tests with coverage
task test

# Run specific test categories
poetry run pytest tests/unit
poetry run pytest tests/integration
poetry run pytest tests/e2e

# Run with verbose output
poetry run pytest -v

# Run specific test file
poetry run pytest tests/unit/test_chunking.py

CLI Development

# Run CLI directly
poetry run python -m src.cli workspace list

# Or use task runner
task cli -- workspace list
task cli -- document add my-file.pdf
task cli -- chat message send "Hello!"

Configuration

Environment variables and configuration options (see src/config.py):

Database:

DATABASE_URL - PostgreSQL connection string (default: postgresql://insighthub:insighthub@localhost:5432/insighthub)

Vector Store:

QDRANT_HOST - Qdrant host (default: localhost)
QDRANT_PORT - Qdrant port (default: 6333)

Graph Database:

NEO4J_URI - Neo4j connection URI
NEO4J_USERNAME - Neo4j username
NEO4J_PASSWORD - Neo4j password

LLM Provider:

LLM_PROVIDER - Provider choice: ollama, openai, or anthropic
OLLAMA_BASE_URL - Ollama API endpoint (default: http://localhost:11434)
OPENAI_API_KEY - OpenAI API key (if using OpenAI)
ANTHROPIC_API_KEY - Anthropic API key (if using Claude)
LLM_MODEL - Model name for selected provider
EMBEDDING_MODEL - Embedding model name

Cache:

CACHE_TYPE - Cache backend: memory or redis (default: memory)
REDIS_URL - Redis connection URL (if using Redis cache)

Storage:

STORAGE_TYPE - Storage backend: filesystem or s3 (default: filesystem)
STORAGE_PATH - Filesystem storage path (default: ./storage)
S3_ENDPOINT_URL - S3-compatible endpoint (e.g., MinIO)
S3_BUCKET - S3 bucket name
S3_ACCESS_KEY - S3 access key
S3_SECRET_KEY - S3 secret key

Logging:

LOG_LEVEL - Logging level (default: INFO)

Docker Services

The docker-compose.yml defines 7 services:

Service	Description	Ports
`postgres`	PostgreSQL 16 with pgvector extension	5432
`qdrant`	Vector database for semantic search	6333, 6334 (UI)
`neo4j`	Graph database for knowledge graphs	7474 (UI), 7687
`ollama`	Local LLM service	11434
`ollama-setup`	Initializes Ollama models	-
`minio`	S3-compatible object storage	9000, 9001 (console)
`redis`	Caching layer	6379

RAG Workflows

Add Document Workflow

Parse document (extract text from PDF/HTML/TXT)
Chunk text with configured strategy
Generate embeddings for chunks
Store vectors in Qdrant
Optionally build knowledge graph in Neo4j
Store metadata in PostgreSQL

Query Workflow

Embed user query
Retrieve similar chunks from vector store
Optionally traverse knowledge graph
Rerank results for relevance
Build context for LLM
Generate streaming response
Cache results

Remove Document Workflow

Delete vectors from Qdrant
Remove graph nodes from Neo4j
Delete metadata from PostgreSQL
Clean up storage

Testing Architecture

Unit Tests (tests/unit/):

Domain logic testing
Service layer testing
Isolated component tests

Integration Tests (tests/integration/):

Database integration
Vector store integration
LLM provider integration
Workflow testing

E2E Tests (tests/e2e/):

Complete CLI command testing
Full RAG pipeline testing
Multi-service integration

Contributing

Fork the repository
Create a feature branch
Make your changes
Run quality checks: task check
Run tests: task test
Submit a pull request

License

GPL3.0

Name		Name	Last commit message	Last commit date
Latest commit History 206 Commits
.github/workflows		.github/workflows
docs		docs
migrations		migrations
src		src
tests		tests
.env.example		.env.example
.env.test		.env.test
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
Taskfile.yml		Taskfile.yml
docker-compose.yml		docker-compose.yml
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Folders and files

Latest commit

History

Repository files navigation

InsightHub

Quick Start

Prerequisites

Setup

CLI Quick Start

CLI Command Structure

Getting Help Anytime

Essential Commands

Common Patterns

Basic Usage Examples

Learning More

Architecture

Project Structure

Features

Document Management

RAG Capabilities

Chat Interface

Multi-Workspace Support

Caching & Performance

Development

Common Tasks

Running Tests

CLI Development

Configuration

Docker Services

RAG Workflows

Add Document Workflow

Query Workflow

Remove Document Workflow

Testing Architecture

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages