Argentic Search Lab

Production-oriented local research stack with:

Modern web UI (AppAgent.html)
SearXNG as internal search backend
MCP server for tool-calling in agent/chat/code workflows
Dockerized runtime (UI + SearXNG + MCP + Redis)

Android Support (Termux)

Yes. Android is supported via Termux + proot Debian/Ubuntu using the Node.js runtime branch. Node.js branch: codex/app-nodejs-runtime Android setup details: node-runtime/README.md

The system is designed to run locally with zero mandatory cloud cost.

Live UI Screenshots

Why It Is Truly Free

No required subscription to run the core stack.
Runs on your local machine (localhost) with local providers (LM Studio / Ollama).
No mandatory per-query payment wall for research flows.
You control performance/cost by model choice and hardware.

Compared to hosted research products, this project can run fully local-first with no recurring usage fee.

Local-First Responsibility Notes

Quality depends on the model you choose.
Recommended default for this project: non-thinking/non-reasoning models for best speed and stability.
Thinking/reasoning models are optional and currently more variable in formatting/latency.
Smaller models are faster/cheaper but may reason less deeply.
Larger models generally improve grounding, critique, and synthesis quality.
Always validate high-stakes outputs (medical/legal/financial) before acting.

Quick Start (1 minute)

Option A: Docker (full stack)

bash <(curl -fsSL https://raw.githubusercontent.com/zvspuentus-rgb/Argentic-Search-Lab/main/scripts/bootstrap.sh)

Manual:

git clone https://github.com/zvspuentus-rgb/Argentic-Search-Lab.git
cd Argentic-Search-Lab
cp .env.example .env
docker compose up -d --build

Open:

UI: http://localhost:8093
MCP endpoint: http://localhost:8193/mcp
SearXNG JSON: http://localhost:8393/search?q=test&format=json

Option B: Node.js runtime (auto setup)

One command install (Node.js + local SearXNG via Python venv, no Docker):

bash ./scripts/bootstrap-node-runtime.sh

Or from fresh clone:

git clone https://github.com/zvspuentus-rgb/Argentic-Search-Lab.git
cd Argentic-Search-Lab
bash ./scripts/bootstrap-node-runtime.sh

Run:

argentic up

Open:

UI: http://localhost:3093
MCP endpoint: http://localhost:3093/mcp
SearXNG JSON: http://localhost:8394/search?q=test&format=json

Android Support (Termux + proot Debian/Ubuntu)

You can run the Node runtime on Android through Termux with a proot distro (Debian/Ubuntu).

Recommended:

Use distro Python from proot (/usr/bin/python3.x), not Termux Python (/data/data/com.termux/...).
Keep Node.js + npm + git installed in the same environment.

Example:

export PYTHON_BIN=/usr/bin/python3.11
bash ./scripts/bootstrap-node-runtime.sh
cd node-runtime && npx argentic up

CLI:

argentic status
argentic down
argentic up runs in foreground. Press Ctrl+C to stop the Node server and cleanup services started by this run.

Useful links:

Full MCP Guide: MCP_INTEGRATION.md
Bootstrap script: scripts/bootstrap.sh
Node runtime guide: node-runtime/README.md
Node.js runtime branch (for Node-first workflow): codex/app-nodejs-runtime

Why This Project

Argentic Search Lab gives you two research speeds in one interface:

Quick Search: fast answer path with minimal orchestration
Deep Research: full multi-agent pipeline for higher quality and coverage

You can use it as:

A standalone research UI
An MCP tool provider for external agents/IDE chat tools

Core Capabilities

Discovery feed + one-click run
Session history with local restore (localStorage)
Prompt enhancement with configurable output language
Side media context (image/video)
Selection-to-Ask: highlight text inside Analysis, then send a contextual follow-up query
Deep research timeline, source management, follow-up suggestions
Research Thread cards render markdown formatting (headings, lists, emphasis) for prior runs
Export to PDF/JSON and share flows
Provider-ready architecture (LM Studio default, plus Ollama/OpenAI/Anthropic/Gemini settings)
Zero mandatory API cost in local mode (LM Studio + local models)
Repo-grounded MCP retrieval (GitHub URL scoped context with strict_repo_only)

Search Modes

1) Quick Search

Goal: lowest latency with useful answer quality.

Behavior:

Minimal planning and fewer steps
Lower search fan-out
Faster synthesis
Best for follow-ups, short factual checks, and iterative chat

2) Deep Research

Goal: maximum coverage, quality, and confidence.

Pipeline:

Analyzer
Planner
Refiner
Multi-lane Search
Critic / quality pass
Synthesis (with citations)
Copilot follow-ups

Best for:

Complex technical investigations
Multi-source comparisons
Higher-stakes answers where evidence quality matters

Pipeline Diagram

flowchart TB
    A["User Query"] --> B{"Mode Resolver"}
    B -->|"Quick"| Q1["Quick Planner"]
    Q1 --> Q2["Targeted Search"]
    Q2 --> Q3["Fast Synthesis"]
    Q3 --> Q4["Answer + Suggestions"]
    B -->|"Deep"| D1["Analyzer"]
    D1 --> D2["Planner"]
    D2 --> D3["Refiner"]
    D3 --> D4["Multi-lane Search"]
    D4 --> D5["Critic"]
    D5 --> D6["Synthesis + Citations"]
    D6 --> D7["Copilot Follow-ups"]
    classDef entry fill:#123047,stroke:#5fa8ff,color:#e8f4ff,stroke-width:1px;
    classDef quick fill:#153a2e,stroke:#43d3a8,color:#eafff6,stroke-width:1px;
    classDef deep fill:#3a1d12,stroke:#ffb067,color:#fff2e8,stroke-width:1px;
    classDef output fill:#2f2248,stroke:#b690ff,color:#f2eaff,stroke-width:1px;
    class A,B entry;
    class Q1,Q2,Q3 quick;
    class D1,D2,D3,D4,D5,D6 deep;
    class Q4,D7 output;

3) Auto

Chooses mode from query intent
If query explicitly asks for deep/research/analysis => Deep
Otherwise => Quick

Prompt Enhancement Language

Enhance Prompt now has a dedicated setting:

Default: English
Optional: Auto (same language as user prompt)
Optional: specific language (Hebrew, Spanish, French, etc.)

This setting is persisted in localStorage and restored after refresh.

Architecture

.
├── AppAgent.html
├── assets/
│   ├── css/
│   │   ├── base.css
│   │   └── components.css
│   └── js/
│       ├── app-core.js
│       ├── app-state.js
│       ├── app-utils.js
│       ├── app-research.js
│       └── app-ui.js
├── Dockerfile
├── docker-compose.yml
├── server.js
├── mcp-service/
│   ├── app.py
│   ├── Dockerfile
│   └── requirements.txt
├── docs/
│   ├── logo.svg
│   └── pipeline.png
├── scripts/
│   └── bootstrap.sh
├── searxng/
│   └── settings.yml
└── MCP_INTEGRATION.md

MCP Integration (Tool Provider)

The MCP service is exposed as JSON-RPC 2.0 and backward-compatible HTTP endpoints.

Reference: MCP_INTEGRATION.md

Available tools:

search_quick
search_deep
fetch_url_context
fetch_url_context_smart (MCP2 smart multi-link crawl + merged context)

MCP deep/quick now support URL-aware research:

Detect URLs embedded inside query text
Accept explicit urls argument
Return context_items (cleaned page extracts) and urls_detected
For GitHub repo URLs: enforce repo-scoped retrieval and pull key file context from inside the repository

Use cases:

Agentic IDE coding assistants
Chat agents that need optional web search
Tool calls from orchestrators with explicit search delegation

Service Topology

flowchart TB
    UI["AppAgent UI (Web)"] --> APP["App Server (Node)"]
    APP --> SX["SearXNG (JSON Search)"]
    MCP["MCP Server (Tool Provider)"] --> SX
    EXT["External Agent / IDE"] --> MCP
    classDef ui fill:#123047,stroke:#5fa8ff,color:#e8f4ff,stroke-width:1px;
    classDef infra fill:#1b2d22,stroke:#43d3a8,color:#eafff6,stroke-width:1px;
    classDef ext fill:#3a1d12,stroke:#ffb067,color:#fff2e8,stroke-width:1px;
    class UI ui;
    class APP,SX,MCP infra;
    class EXT ext;

GitHub Repo Context (Model Grounding)

When a user provides a GitHub repo URL, MCP now applies strict repo scope and pulls file-level context from inside that repository.

What happens automatically:

URL is detected from query text
Repo scope is enforced (github.com/<owner>/<repo>)
Unrelated GitHub results are filtered out
Key files are fetched for context (README, Docker/config/build files, relevant source files)
Response includes repo_scope_enforced, urls_detected, and context_items

Quick example (focused, fast):

{
  "jsonrpc": "2.0",
  "id": 10,
  "method": "tools/call",
  "params": {
    "name": "search_quick",
    "arguments": {
      "query": "inspect this repository https://github.com/zvspuentus-rgb/Argentic-Search-Lab/tree/main",
      "limit": 6,
      "include_context": true,
      "context_max_urls": 3
    }
  }
}

Deep example (maximum coverage on the same repo):

{
  "jsonrpc": "2.0",
  "id": 11,
  "method": "tools/call",
  "params": {
    "name": "search_deep",
    "arguments": {
      "query": "analyze architecture and pipeline in this repo https://github.com/zvspuentus-rgb/Argentic-Search-Lab/tree/main",
      "limit": 10,
      "include_context": true,
      "context_max_urls": 8,
      "strict_repo_only": true
    }
  }
}

Interpretation tips for agents:

If repo_scope_enforced=true: trust this as repo-grounded retrieval
Prefer context_items for synthesis over generic web snippets
Use fetch_url_context only for extra targeted URLs not already covered
Use fetch_url_context_smart when one URL is not enough and deeper page traversal is needed

MCP2 Smart URL Context (new)

fetch_url_context_smart is the advanced URL tool for deeper grounding:

Starts from a seed URL
Discovers additional links (optionally same-domain only)
Fetches and cleans multiple pages
Returns merged context for higher-quality synthesis

Returned fields:

mode (smart)
urls_visited
context_items
merged_context

Example:

{
  "jsonrpc": "2.0",
  "id": 13,
  "method": "tools/call",
  "params": {
    "name": "fetch_url_context_smart",
    "arguments": {
      "url": "https://github.com/zvspuentus-rgb/Argentic-Search-Lab/tree/main",
      "max_urls": 5,
      "max_chars_per_url": 2000,
      "same_domain_only": true
    }
  }
}

Visual Workflow (Repo URL Path)

flowchart LR
    U["User sends GitHub URL"] --> D["URL Detector"]
    D --> S["Repo Scope Enforcer"]
    S --> Q["Scoped Query Builder"]
    Q --> X["SearXNG Retrieval"]
    S --> G["GitHub File Context Fetcher"]
    X --> M["Merge + Deduplicate"]
    G --> M
    M --> O["Grounded Output for Model"]
    classDef step fill:#132f44,stroke:#5fa8ff,color:#e8f4ff,stroke-width:1px;
    classDef guard fill:#23351d,stroke:#8bdc65,color:#f2ffe8,stroke-width:1px;
    classDef out fill:#3d1f3f,stroke:#d59cff,color:#ffeefe,stroke-width:1px;
    class U,D,Q,X,G,M step;
    class S guard;
    class O out;

MCP client config snippet

Add inside your MCP client config (mcpServers):

"appagent": {
  "url": "http://localhost:8193/mcp"
}

MCP config for clients that require `command`/`args`/`env`

Some MCP clients support url directly, while others require a stdio command.

Use one of these patterns:

Direct HTTP (preferred when client supports url)

{
  "mcpServers": {
    "appagent": {
      "url": "http://localhost:8193/mcp"
    }
  }
}

Stdio bridge via mcp-remote (for command-only clients)

{
  "mcpServers": {
    "appagent": {
      "command": "npx",
      "args": [
        "-y",
        "mcp-remote",
        "http://localhost:8193/mcp",
        "--transport",
        "http-only",
        "--allow-http"
      ],
      "env": {}
    }
  }
}

Notes:

env can stay {} if you do not need secrets/custom headers.
Not all clients require all fields; if url works in your client, prefer it.
Keep your server endpoint as http://localhost:8193/mcp unless you changed ports.

Local Model Runtime and Cost

Default setup is local-first.
Works with small models for lightweight tasks (for example ~1B–3B class models).
Larger models generally improve planning, critique, and synthesis quality.
No mandatory paid API required when running local providers.

Docker Quick Start

Copy env:

cp .env.example .env

Start stack:

docker compose up -d --build

Open:

UI: http://localhost:8093
MCP: http://localhost:8193/mcp
SearXNG direct: http://localhost:8393/search?q=test&format=json

Ports and Env

Configured via .env:

APP_PORT=8093
MCP_PORT=8193
SEARX_PORT=8393
LMSTUDIO_BASE=http://host.docker.internal:1234
OLLAMA_BASE=http://host.docker.internal:11434

Live Demo quota mode (for Hugging Face/hosted demos)

Optional server-side cookie quota (does not affect local installs unless enabled):

LIVE_DEMO_MODE=true
LIVE_DEMO_QUERY_LIMIT=2

Behavior:

Each user cookie can run up to LIVE_DEMO_QUERY_LIMIT search executions.
After limit is reached, new runs are blocked with a clear status message.
Local/self-host users remain unaffected when LIVE_DEMO_MODE is not enabled.

Provider Routing Notes

The UI server proxies local providers to avoid browser CORS issues:

/lmstudio/* -> ${LMSTUDIO_BASE}
/ollama/* -> ${OLLAMA_BASE}

Default UI values:

LM Studio base: /lmstudio/v1
Ollama base: /ollama/v1

Persistence

Persisted in browser localStorage:

UI/app settings
Sessions/history
Mode/provider/toggles/language preferences

Behavior restores automatically on refresh.

Useful Commands

# status
docker compose ps

# logs
docker compose logs -f app mcp searxng

# restart MCP only
docker compose up -d --build mcp

# stop
docker compose down

Security Notes

Do not commit real API keys.
Keep .env local.
If exposing publicly, put authentication/reverse-proxy in front.

License

MIT (LICENSE).

GitHub Pages Demo Site

This repository includes a ready demo site in docs/.

Enable it:

Open repository Settings -> Pages
Under Build and deployment, choose Deploy from a branch
Select branch main and folder /docs
Save and wait for deployment

Expected URL:

https://zvspuentus-rgb.github.io/Argentic-Search-Lab/

Local preview (optional):

cd docs
python3 -m http.server 8088
# open http://localhost:8088

Hugging Face Space (Prepared)

A ready Docker Space bundle is included in hf-space/ for hosted demo deployment.

What is included:

UI server + MCP service in one container
Demo quota mode enabled (LIVE_DEMO_QUERY_LIMIT=2)
Docker-ready Space metadata (hf-space/README.md)

Name		Name	Last commit message	Last commit date
Latest commit History 129 Commits
assets		assets
docs		docs
hf-space		hf-space
mcp-service		mcp-service
node-runtime		node-runtime
scripts		scripts
searxng		searxng
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
AppAgent.html		AppAgent.html
Dockerfile		Dockerfile
LICENSE		LICENSE
MCP_INTEGRATION.md		MCP_INTEGRATION.md
README.md		README.md
docker-compose.yml		docker-compose.yml
server.js		server.js

Folders and files

Latest commit

History

Repository files navigation

Argentic Search Lab

Android Support (Termux)

Live UI Screenshots

Why It Is Truly Free

Local-First Responsibility Notes

Quick Start (1 minute)

Option A: Docker (full stack)

Option B: Node.js runtime (auto setup)

Android Support (Termux + proot Debian/Ubuntu)

Why This Project

Core Capabilities

Search Modes

1) Quick Search

2) Deep Research

Pipeline Diagram

3) Auto

Prompt Enhancement Language

Architecture

MCP Integration (Tool Provider)

Service Topology

GitHub Repo Context (Model Grounding)

MCP2 Smart URL Context (new)

Visual Workflow (Repo URL Path)

MCP client config snippet

MCP config for clients that require command/args/env

Local Model Runtime and Cost

Docker Quick Start

Ports and Env

Live Demo quota mode (for Hugging Face/hosted demos)

Provider Routing Notes

Persistence

Useful Commands

Security Notes

License

GitHub Pages Demo Site

Hugging Face Space (Prepared)

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

MCP config for clients that require `command`/`args`/`env`

Packages