GitHub - at264939-ctrl/Heisenberg

██╗  ██╗███████╗██╗███████╗███████╗███╗   ██╗██████╗ ███████╗██████╗  ██████╗
██║  ██║██╔════╝██║██╔════╝██╔════╝████╗  ██║██╔══██╗██╔════╝██╔══██╗██╔════╝
███████║█████╗  ██║███████╗█████╗  ██╔██╗ ██║██████╔╝█████╗  ██████╔╝██║  ███╗
██╔══██║██╔══╝  ██║╚════██║██╔══╝  ██║╚██╗██║██╔══██╗██╔══╝  ██╔══██╗██║   ██║
██║  ██║███████╗██║███████║███████╗██║ ╚████║██████╔╝███████╗██║  ██║╚██████╔╝
╚═╝  ╚═╝╚══════╝╚═╝╚══════╝╚══════╝╚═╝  ╚═══╝╚═════╝ ╚══════╝╚═╝  ╚═╝ ╚═════╝

"I am the one who knocks."

A fully local, production-grade autonomous AI agent. Runs 9B-class GGUF models on constrained hardware using adaptive TurboQuant strategies — all within a strict 3 GB RAM budget, no cloud required.

🚀 Quick Start · 🏗️ Architecture · 🧠 TurboQuant · 📁 Structure · 🔒 Security · ☕ Support

🌟 What Is Heisenberg?

Heisenberg is not just another AI wrapper. It is a production-oriented autonomous agent built from the ground up for developers who refuse to compromise on privacy, performance, or control.

Feature	Detail
🧱 Core Language	Rust — zero-cost abstractions, memory safety, raw speed
🤖 Inference Engine	llama-cpp via FFI/subprocess (GGUF native)
💾 Memory Model	Adaptive 3 GB hard cap with graceful degradation
🐚 Execution Layer	Bash-first, auditable, sandboxed via `ulimit`
🧬 Long-term Memory	PostgreSQL (`mike` module)
🌐 Browser Automation	Sandboxed via `the_rv` + `scripts/drive.sh`
☁️ Cloud Dependency	Zero — fully offline, fully private
📦 Model Format	GGUF (drop `.gguf` in `models/` and go)

🏗️ System Architecture

High-Level Flow

flowchart TD
    U(["👤 User Terminal"]) -->|natural language input| H

    subgraph HEISENBERG ["🧪 Heisenberg Orchestrator (Rust)"]
        H["🎯 REPL / Action Loop"]
        H -->|structured JSON commands| J["🔧 Jesse\nBash Executor"]
        H -->|inference requests| L["⚗️ The Lab\nllama-cpp + GGUF Loader"]
        H -->|memory ops| M["🗄️ Mike\nMemory Manager"]
        H -->|schedule tasks| G["⏱️ Gus\nScheduler"]
        H -.->|self-patch| B["🔵 Blue Sky\nSelf-Modification"]
        H -.->|screen events| K["👁️ Hank\nScreen Observer"]
        H -.->|browser tasks| RV["🌐 The RV\nBrowser Sandbox"]
    end

    J -->|shell result JSON| H
    L -->|streamed tokens| H
    M <-->|read/write| DB[("🐘 PostgreSQL\nLong-term Memory")]
    RV -->|page data| H

    style HEISENBERG fill:#1a1a2e,stroke:#e94560,color:#fff
    style U fill:#16213e,stroke:#0f3460,color:#fff
    style DB fill:#0f3460,stroke:#533483,color:#fff

IPC Sequence: Rust ↔ Bash

sequenceDiagram
    autonumber
    participant U  as 👤 User
    participant R  as 🎯 Rust Orchestrator
    participant CF as 📨 CMD_FIFO
    participant BW as 🔧 Bash Worker (Jesse)
    participant RF as 📩 RESP_FIFO
    participant DB as 🗄️ PostgreSQL (Mike)

    U->>R: "Analyze logs and summarize"
    R->>CF: write {cmd: "run", script: "cook.sh", args: [...]}
    CF->>BW: Execute script under ulimit sandbox
    BW->>BW: Run, collect stdout/stderr
    BW->>RF: write {status: "ok", pid: 1234, output: "..."}
    RF->>R: Read response
    R->>DB: Persist summary to long-term memory
    DB-->>R: ACK
    R-->>U: Stream result tokens

TurboQuant Memory Planning Pipeline

flowchart LR
    A["📂 GGUF Header Scan\n(layers, heads, dim, tokenizer)"]
    --> B["📐 KV Cache Estimator\n(context × layers × heads × dtype)"]
    --> C["📊 Memory Planner\n(weights + KV + OS overhead)"]

    C --> D{"> 3 GB?"}
    D -- YES --> E["⚡ Quantization Selector\nQ8_0 → Q5_0 → Q4_0"]
    D -- YES --> F["📉 Context Reducer\n2048 → 1024 → 512"]
    D -- NO  --> G["✅ Launch as-is"]
    E --> H["🚀 cook.sh Launcher\nulimit -v 3072 MB"]
    F --> H
    G --> H

    H --> I["🔍 Mike Live Monitor\n(RSS polling)"]
    I -->|pressure detected| J["🔄 Runtime Mitigation\n(shrink ctx / cancel tasks)"]
    J --> I

    style D fill:#e94560,stroke:#c23152,color:#fff
    style H fill:#0f3460,stroke:#533483,color:#fff
    style I fill:#16213e,stroke:#e94560,color:#fff

🔬 Module Reference

Each module in src/ has a named role inspired by Breaking Bad — intentional, consistent, and memorable.

mindmap
  root((🧪 Heisenberg))
    🎯 heisenberg
      REPL loop
      Action dispatcher
      Agent state machine
    ⚗️ the_lab
      GGUF loader
      llama-cpp FFI
      TurboQuant Planner
      Token streamer
    🔧 jesse
      Bash executor
      FIFO IPC
      ulimit sandbox
      Script runner
    ⏱️ gus
      Task scheduler
      Priority queue
      Retry logic
    🗄️ mike
      Memory manager
      PostgreSQL glue
      Context compaction
      Pressure monitor
    ⚙️ saul
      Config parser
      CLI interface
      Argument validation
    👁️ hank
      Screen observer
      Event capture
      Display hooks
    🌐 the_rv
      Browser sandbox
      Automation layer
      Allow-list engine
    🔵 blue_sky
      Self-modification
      Patch generator
      Safe apply

📁 Project Structure

Heisenberg/
│
├── 📦 src/
│   ├── heisenberg/      # 🎯 Orchestrator — REPL, action loop, agent state machine
│   ├── the_lab/         # ⚗️  Inference engine — GGUF loader, llama-cpp, TurboQuant
│   ├── jesse/           # 🔧 Bash executor — FIFO IPC, sandboxed shell worker
│   ├── gus/             # ⏱️  Scheduler — task queue, priority, retries
│   ├── mike/            # 🗄️  Memory manager — PostgreSQL glue, context compaction
│   ├── saul/            # ⚙️  Config & CLI — argument parsing, env vars
│   ├── hank/            # 👁️  Screen observer — display hooks, event capture
│   ├── the_rv/          # 🌐 Browser sandbox — automation, allow-list control
│   └── blue_sky/        # 🔵 Self-modification — patch generator, safe apply
│
├── 🛠️ scripts/
│   ├── build.sh         # Full build (release mode)
│   ├── say-my-name.sh   # Launch agent with shell UI
│   ├── cook.sh          # Inference wrapper — ulimit enforcement
│   └── drive.sh         # Browser automation entry point
│
├── 🤖 models/           # Drop your .gguf files here (do NOT commit large files)
│   └── *.gguf
│
├── 🗃️ sql/              # PostgreSQL schema — long-term memory tables
│
└── 📋 Cargo.toml        # Rust workspace manifest

🚀 Quick Start

Prerequisites

Requirement	Version	Check
Rust toolchain	1.70+	`rustc --version`
Cargo	Latest stable	`cargo --version`
PostgreSQL	14+ (for Mike memory)	`psql --version`
A GGUF model	9B-class recommended	—

Step-by-step

# 1. Clone the repository
git clone https://github.com/at264939-ctrl/Heisenberg.git
cd Heisenberg

# 2. Place your GGUF model
cp /path/to/your/model.gguf models/
# Recommended: Qwen3.5-9B-Uncensored-HauhauCS-Aggressive-Q4_K_M.gguf

# 3. Build (release mode, optimized)
./scripts/build.sh

# 4. Launch the agent
Heisenberg
# — or —
./scripts/say-my-name.sh chat

# 5. Check system status
Heisenberg status

💡 Tip: For large models (>2 GB), use Git LFS if you plan to track them in version control.

🧠 TurboQuant Engine

The secret weapon that makes 9B-class models behave like small models.

TurboQuant is Heisenberg's adaptive runtime strategy layer. It solves a fundamental problem: 9B parameter models have massive KV caches that expand with context length and easily exceed available RAM.

The Core Problem

9B model weights (Q4_K_M) ≈ ~5.5 GB on disk
But RAM usage during inference = weights + KV cache + OS overhead
KV cache alone at context=2048 can push total usage well over 3 GB

TurboQuant's 5-Stage Response

Stage 1 → SCAN         Read GGUF header: layers, heads, hidden_dim, tokenizer
Stage 2 → ESTIMATE     Calculate KV cache size for requested context window
Stage 3 → PLAN         Aggregate: KV + mmap'd weights + tokenizer + OS overhead
Stage 4 → ADAPT        If predicted > 3 GB: downgrade KV quant and/or reduce context
Stage 5 → CONFINE      Launch via cook.sh with ulimit -v enforced at OS level

Live Example: Qwen3.5-9B on 3 GB RAM

Step	Action	Value
Request	User asks for generation	`context = 2048`
Scan	Read GGUF metadata	layers, heads, hidden_dim extracted
Estimate	Compute KV cache	~X MB for context=2048
Plan	Total predicted usage	Exceeds 3 GB cap ⚠️
Adapt	Select stronger KV quant	Q8_0 → Q5_0 → Q4_0
Adapt	Reduce context if needed	2048 → 1024
Launch	`cook.sh --mem-mb 3072`	ulimit enforced at OS level ✅
Monitor	`mike` polls RSS live	Mitigation triggered if needed

🔒 Security & Privacy

┌─────────────────────────────────────────────────────────┐
│                   SECURITY PRINCIPLES                   │
│                                                         │
│  ✅  100% local — zero external API calls               │
│  ✅  Zero telemetry — no usage data ever leaves         │
│  ✅  Bash scripts are auditable first-class citizens    │
│  ✅  Browser automation is allow-listed by default      │
│  ✅  ulimit enforced at OS level for all subprocesses   │
│  ✅  FIFO-based IPC — no shared memory exploits         │
│                                                         │
│  ⚠️  Review scripts/ before granting execution perms   │
└─────────────────────────────────────────────────────────┘

⚙️ Memory & Resource Strategy

Heisenberg enforces a 3 GB hard memory cap via sysinfo monitoring. When RAM pressure is detected, the orchestrator responds progressively:

Level 1 — YELLOW  ▶  Shrink inference context + reduce KV cache size
Level 2 — ORANGE  ▶  Lower concurrency → single-thread fallback for heavy ops
Level 3 — RED     ▶  Persist + compact long-term memory to PostgreSQL
Level 4 — CRITICAL▶  Drop nonessential buffers, disable expensive observers

Design philosophy: Stay responsive under pressure, never crash.

🔧 Extensibility

Add a new inference adaptor

src/the_lab/
├── your_adaptor.rs    # Implement the InferenceAdaptor trait
└── mod.rs             # Register your adaptor

Preferred: FFI to llama-cpp. Fallback: isolated subprocess.

Add a new shell action

src/jesse/
├── your_action.rs     # Define the action and its JSON schema
└── mod.rs             # Register via FIFO IPC

🌍 شرح TurboQuant بالعربية

نقطة القوة الحقيقية في Heisenberg هي قدرة TurboQuant Planner على تشغيل موديل 9B محليًا بذاكرة 3 جيجابايت فقط.

كيف يشتغل؟

المسح: نقرأ header ملف الـ GGUF ونستخرج مواصفات النموذج (عدد الطبقات، الأبعاد، عدد الـ heads)
التقدير: نحسب حجم الـ KV cache المطلوب بناءً على طول السياق المطلوب
التخطيط: نجمع: KV cache + الأوزان المحملة + overhead النظام ونقارن بالسقف
التكيّف: لو التوقعات تجاوزت 3GB، نقلّل جودة الـ KV cache تلقائيًا أو نقلّص نافذة السياق
العزل: نشغّل الـ inference داخل cook.sh مع ulimit -v لمنع أي تجاوز في الذاكرة

النتيجة: تقدر تشغّل Qwen3.5-9B-Q4_K_M.gguf على جهاز عادي بدون سحاب، بدون خصوصية مكشوفة، وبأداء متدرج بدلًا من الانهيار.

☕ Support Me

If Heisenberg helped you cook something great — consider buying me a coffee.

Building and maintaining a production-grade local AI agent takes serious time and effort. If this project saved you hours of work or sparked something useful in your own projects, your support means the world.

Every contribution — big or small — goes directly toward:

🖥️ Better hardware for testing on constrained environments
🧪 Experimenting with new GGUF models and quantization strategies
📚 Documentation, tutorials, and community support
⚡ Continued active development of new features

📬 Contact

Have questions, ideas, or want to collaborate? Reach out!

Channel	Info
📧 Email	ibrahimtarek1245@gmail.com
📱 Phone / WhatsApp	+201030553763
🐙 GitHub	at264939-ctrl/Heisenberg

Feel free to open an Issue or Discussion directly on GitHub for bugs, feature requests, or general feedback.

📄 License

MIT License — Copyright (c) Ibrahim Tarek

Permission is hereby granted, free of charge, to any person obtaining a copy
of this software and associated documentation files (the "Software"), to deal
in the Software without restriction, including without limitation the rights
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
copies of the Software...

See the full LICENSE file for details.

Built with ❤️ and Rust by Ibrahim Tarek

"Say my name."

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.github		.github
bin		bin
scripts		scripts
sql		sql
src		src
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.proposed.toml		Cargo.proposed.toml
Cargo.toml		Cargo.toml
README.md		README.md
TurboQuant.pdf		TurboQuant.pdf
heisenberg_memory.sql		heisenberg_memory.sql
the-cook-cropped.png		the-cook-cropped.png
the-cook.png		the-cook.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

"I am the one who knocks."

🌟 What Is Heisenberg?

🏗️ System Architecture

High-Level Flow

IPC Sequence: Rust ↔ Bash

TurboQuant Memory Planning Pipeline

🔬 Module Reference

📁 Project Structure

🚀 Quick Start

Prerequisites

Step-by-step

🧠 TurboQuant Engine

The Core Problem

TurboQuant's 5-Stage Response

Live Example: Qwen3.5-9B on 3 GB RAM

🔒 Security & Privacy

⚙️ Memory & Resource Strategy

🔧 Extensibility

Add a new inference adaptor

Add a new shell action

🌍 شرح TurboQuant بالعربية

☕ Support Me

📬 Contact

📄 License

About

Uh oh!

Releases

Sponsor this project

Uh oh!

Packages

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

"I am the one who knocks."

🌟 What Is Heisenberg?

🏗️ System Architecture

High-Level Flow

IPC Sequence: Rust ↔ Bash

TurboQuant Memory Planning Pipeline

🔬 Module Reference

📁 Project Structure

🚀 Quick Start

Prerequisites

Step-by-step

🧠 TurboQuant Engine

The Core Problem

TurboQuant's 5-Stage Response

Live Example: Qwen3.5-9B on 3 GB RAM

🔒 Security & Privacy

⚙️ Memory & Resource Strategy

🔧 Extensibility

Add a new inference adaptor

Add a new shell action

🌍 شرح TurboQuant بالعربية

☕ Support Me

📬 Contact

📄 License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages