open-compress
diff --git a/‎.github/workflows/ci.yml‎
Lines changed: 49 additions & 2 deletions b/‎.github/workflows/ci.yml‎
Lines changed: 49 additions & 2 deletions
diff --git a/‎.github/workflows/docs.yml‎
Lines changed: 28 additions & 0 deletions b/‎.github/workflows/docs.yml‎
Lines changed: 28 additions & 0 deletions
diff --git a/‎CONTRIBUTING.md‎
Lines changed: 113 additions & 17 deletions b/‎CONTRIBUTING.md‎
Lines changed: 113 additions & 17 deletions
diff --git a/‎README.md‎
Lines changed: 83 additions & 5 deletions b/‎README.md‎
Lines changed: 83 additions & 5 deletions
@@ -25,6 +25,53 @@ jobs:
         run: |
           python -m pip install --upgrade pip
           pip install -e ".[dev,accurate]"
+          pip install pytest-cov
 
-      - name: Run tests
-        run: pytest tests/ -v --tb=short
+      - name: Run tests with coverage
+        run: pytest tests/ -v --tb=short --cov=scripts --cov=claw_compactor --cov-report=xml --cov-report=term-missing
+
+      - name: Upload coverage to Codecov
+        if: matrix.python-version == '3.12'
+        uses: codecov/codecov-action@v4
+        with:
+          file: ./coverage.xml
+          fail_ci_if_error: false
+        env:
+          CODECOV_TOKEN: ${{ secrets.CODECOV_TOKEN }}
+
+      - name: Generate test badge data
+        if: matrix.python-version == '3.12' && github.ref == 'refs/heads/main'
+        run: |
+          # Count passed tests from pytest output
+          RESULT=$(pytest tests/ --tb=no -q 2>&1 | tail -1)
+          PASSED=$(echo "$RESULT" | grep -oP '\d+ passed' | grep -oP '\d+' || echo "0")
+          FAILED=$(echo "$RESULT" | grep -oP '\d+ failed' | grep -oP '\d+' || echo "0")
+
+          if [ "$FAILED" = "0" ]; then
+            COLOR="brightgreen"
+            MSG="${PASSED} passed"
+          else
+            COLOR="red"
+            MSG="${PASSED} passed, ${FAILED} failed"
+          fi
+
+          mkdir -p .badges
+          cat > .badges/tests.json << EOF
+          {
+            "schemaVersion": 1,
+            "label": "tests",
+            "message": "$MSG",
+            "color": "$COLOR"
+          }
+          EOF
+
+      - name: Deploy badge to gist
+        if: matrix.python-version == '3.12' && github.ref == 'refs/heads/main'
+        uses: schneegans/dynamic-badges-action@v1.7.0
+        with:
+          auth: ${{ secrets.GIST_TOKEN }}
+          gistID: ${{ vars.BADGE_GIST_ID }}
+          filename: claw-compactor-tests.json
+          label: tests
+          message: ${{ env.TEST_MSG || 'passing' }}
+          color: ${{ env.TEST_COLOR || 'brightgreen' }}
@@ -0,0 +1,28 @@
+name: Deploy Docs
+
+on:
+  push:
+    branches: [main]
+    paths:
+      - 'docs/**'
+      - 'mkdocs.yml'
+
+permissions:
+  contents: write
+
+jobs:
+  deploy:
+    runs-on: ubuntu-latest
+    steps:
+      - uses: actions/checkout@v4
+
+      - name: Set up Python
+        uses: actions/setup-python@v5
+        with:
+          python-version: '3.12'
+
+      - name: Install mkdocs
+        run: pip install mkdocs-material
+
+      - name: Deploy to GitHub Pages
+        run: mkdocs gh-deploy --force
@@ -1,35 +1,131 @@
 # Contributing to Claw Compactor
 
-Thanks for your interest in contributing!
+Thanks for your interest in contributing! Claw Compactor is an open-source project and we welcome contributions of all kinds.
 
-## Quick Start
+## Getting Started
+
+### Prerequisites
+
+- Python 3.9+
+- Git
+
+### Setup
 
 ```bash
-git clone https://github.com/aeromomo/claw-compactor.git
+# Fork the repository on GitHub, then:
+git clone https://github.com/YOUR_USERNAME/claw-compactor.git
 cd claw-compactor
-pip install -e .
+
+# Create a virtual environment
+python -m venv .venv
+source .venv/bin/activate  # or .venv\Scripts\activate on Windows
+
+# Install in development mode with all extras
+pip install -e ".[dev,accurate]"
+
+# Verify everything works
+pytest tests/ -x -q
 ```
 
-## Development
+## Development Workflow
 
-- Python 3.9+
-- Run tests: `python -m pytest`
-- This project uses MIT license
+### 1. Find Something to Work On
+
+- Check [open issues](https://github.com/open-compress/claw-compactor/issues) — look for `good first issue` or `help wanted` labels
+- Have an idea? Open an issue first to discuss before investing time
+
+### 2. Create a Branch
 
-## Pull Request Process
+```bash
+git checkout -b feat/your-feature-name
+# or: fix/your-bug-fix, docs/your-doc-update
+```
+
+### 3. Make Your Changes
+
+- Follow existing code style and patterns
+- Keep changes focused — one feature or fix per PR
+- Add tests for new functionality
+
+### 4. Test Your Changes
+
+```bash
+# Run the full test suite
+pytest tests/ -x -q
+
+# Run a specific test file
+pytest tests/test_fusion_engine.py -v
+
+# Run with coverage
+pytest tests/ --cov=scripts --cov=claw_compactor --cov-report=term-missing
+```
 
-1. Fork the repository
-2. Create a branch from `main`
-3. Make focused changes with tests
-4. Open a descriptive PR
+All PRs must pass CI on Python 3.9–3.12. The test suite has 1600+ tests — don't be alarmed, they run fast.
+
+### 5. Submit a PR
+
+1. Push your branch to your fork
+2. Open a Pull Request against `main`
+3. Fill in the PR template with a clear description
+4. Link any related issues
+
+## Code Guidelines
+
+### Architecture
+
+Claw Compactor is built around a 14-stage Fusion Pipeline. Each stage is a self-contained compressor inheriting from `FusionStage`. See [ARCHITECTURE.md](ARCHITECTURE.md) for the full design.
+
+### Key Principles
+
+- **Immutability** — `FusionContext` is frozen. Every stage produces a new `FusionResult`. Never mutate inputs.
+- **Gate-before-compress** — Each stage has `should_apply()`. If a stage doesn't apply to the content type, it should be a no-op at zero cost.
+- **Zero required dependencies** — The core pipeline runs without any external packages. Optional dependencies (tiktoken, tree-sitter) are runtime-detected.
+
+### Adding a New Fusion Stage
+
+1. Create a new file in `scripts/lib/fusion/stages/`
+2. Inherit from `FusionStage`
+3. Implement `should_apply()` and `apply()`
+4. Register it in the stage registry
+5. Add tests covering happy path, edge cases, and the gate condition
+
+```python
+from scripts.lib.fusion.base import FusionStage, FusionContext, FusionResult
+
+class MyStage(FusionStage):
+    name = "my_stage"
+    order = 22  # controls execution order in the pipeline
+
+    def should_apply(self, ctx: FusionContext) -> bool:
+        return ctx.content_type == "log"
+
+    def apply(self, ctx: FusionContext) -> FusionResult:
+        compressed = my_logic(ctx.content)
+        return FusionResult(content=compressed, ...)
+```
+
+### Style
+
+- Type hints on all public functions
+- Docstrings for non-obvious logic
+- Functions under 50 lines, files under 800 lines
+- No deep nesting (4 levels max)
 
 ## Reporting Issues
 
 Please include:
-- Clear reproduction steps
-- Expected vs actual behavior
-- Python version and OS
+
+- **Python version** (`python --version`)
+- **OS** (macOS, Linux, Windows)
+- **Steps to reproduce** — minimal example preferred
+- **Expected vs actual behavior**
+- **Traceback** if applicable
+
+## Community
+
+- [Discord](https://discord.com/invite/clawd) — ask questions, discuss ideas
+- [GitHub Discussions](https://github.com/open-compress/claw-compactor/discussions) — longer-form conversations
 
 ## License
 
-By contributing, you agree your contributions will be licensed under MIT.
+By contributing, you agree that your contributions will be licensed under the [MIT License](LICENSE).
@@ -35,16 +35,16 @@
 ![Claw Compactor Banner](assets/banner.png)
 
 [![CI](https://github.com/open-compress/claw-compactor/actions/workflows/ci.yml/badge.svg)](https://github.com/open-compress/claw-compactor/actions)
-[![Tests](https://img.shields.io/badge/tests-1663%20passed-brightgreen)](https://github.com/open-compress/claw-compactor)
+[![codecov](https://codecov.io/gh/open-compress/claw-compactor/graph/badge.svg)](https://codecov.io/gh/open-compress/claw-compactor)
 [![Python](https://img.shields.io/badge/python-3.9%2B-blue)](https://python.org)
 [![License](https://img.shields.io/badge/license-MIT-purple)](LICENSE)
 [![PyPI](https://img.shields.io/pypi/v/claw-compactor?color=blue&label=PyPI)](https://pypi.org/project/claw-compactor/)
 [![Downloads](https://img.shields.io/pypi/dm/claw-compactor?color=green&label=downloads)](https://pypi.org/project/claw-compactor/)
 [![Stars](https://img.shields.io/github/stars/open-compress/claw-compactor?style=social)](https://github.com/open-compress/claw-compactor)
 
-**15–82% compression depending on content &middot; Zero LLM inference cost &middot; Reversible &middot; 1663 tests**
+**15–82% compression depending on content &middot; Zero LLM inference cost &middot; Reversible &middot; 1600+ tests**
 
-[Architecture](ARCHITECTURE.md) &middot; [Benchmarks](#benchmarks) &middot; [Quick Start](#quick-start) &middot; [API](#api)
+[Documentation](https://open-compress.github.io/claw-compactor) &middot; [Architecture](ARCHITECTURE.md) &middot; [Benchmarks](#benchmarks) &middot; [Quick Start](#quick-start) &middot; [API](#api)
 
 </div>
 
@@ -54,6 +54,62 @@
 
 Claw Compactor is an open-source **LLM token compression engine** built around a 14-stage **Fusion Pipeline**. Each stage is a specialized compressor — from AST-aware code analysis to JSON statistical sampling to simhash-based deduplication — chained through an immutable data flow architecture where each stage's output feeds the next.
 
+### Demo
+
+```
+$ claw-compactor benchmark ./my-workspace
+
+  Claw Compactor v7.0 — Fusion Pipeline Benchmark
+  ─────────────────────────────────────────────────
+
+  Scanning workspace... 47 files, 234,891 tokens
+
+  Stage Results:
+  ┌──────────────────┬──────────┬───────────┬──────────┐
+  │ Stage            │ Applied  │ Reduction │ Time     │
+  ├──────────────────┼──────────┼───────────┼──────────┤
+  │ Cortex           │ 47/47    │ —         │ 12ms     │
+  │ Photon           │ 3/47     │ 2.1%      │ 4ms      │
+  │ RLE              │ 41/47    │ 8.3%      │ 6ms      │
+  │ SemanticDedup    │ 47/47    │ 12.7%     │ 18ms     │
+  │ Ionizer          │ 8/47     │ 71.2%     │ 9ms      │
+  │ Neurosyntax      │ 23/47    │ 18.4%     │ 31ms     │
+  │ TokenOpt         │ 47/47    │ 4.1%      │ 3ms      │
+  │ Abbrev           │ 12/47    │ 6.8%      │ 5ms      │
+  └──────────────────┴──────────┴───────────┴──────────┘
+
+  Summary:
+    Before:  234,891 tokens ($2.35 at GPT-4 rates)
+    After:   108,250 tokens ($1.08)
+    Saved:   126,641 tokens (53.9%) — $1.27/run
+    Time:    88ms total
+
+  Estimated monthly savings at 100 runs/day: $3,810
+```
+
+---
+
+## How It Compares
+
+| Feature | Claw Compactor | LLMLingua-2 | SelectiveContext | gzip + base64 |
+|:--------|:-:|:-:|:-:|:-:|
+| Compression rate | 15–82% | 30–70% | 10–40% | 60–80% |
+| ROUGE-L @ 0.3 | **0.653** | 0.346 | ~0.4 | N/A |
+| ROUGE-L @ 0.5 | **0.723** | 0.570 | ~0.6 | N/A |
+| LLM inference cost | **$0** | ~$0.02/call | **$0** | **$0** |
+| Latency | **<50ms** | ~300ms | ~200ms | <10ms |
+| Reversible | **Yes** | No | No | Yes (manual) |
+| Content-aware routing | **14 stages** | 1 (perplexity) | 1 (self-info) | None |
+| AST-aware code handling | **Yes** (tree-sitter) | No | No | No |
+| JSON schema sampling | **Yes** | No | No | No |
+| Log/diff/search stages | **Yes** | No | No | No |
+| Required dependencies | **0** | torch, transformers | torch | zlib |
+| LLM-readable output | **Yes** | Partial | Partial | **No** |
+
+**Why Claw Compactor wins:** LLMLingua-2 drops tokens by perplexity score — effective for natural language, but destroys code identifiers, JSON keys, and log patterns. Claw Compactor uses content-type-aware stages that understand the structure of what they're compressing.
+
+---
+
 ```
 Input
   |
@@ -286,7 +342,7 @@ See [ARCHITECTURE.md](ARCHITECTURE.md) for the full technical deep-dive:
 - How to extend the pipeline
 
 ```
-12,000+ lines Python  ·  1,676 tests  ·  14 fusion stages  ·  0 external ML dependencies
+12,000+ lines Python  ·  1,600+ tests  ·  14 fusion stages  ·  0 external ML dependencies
 ```
 
 ---
@@ -312,11 +368,22 @@ pip install -e ".[dev,accurate]"
 
 ---
 
+## Who Uses This
+
+| Project | How |
+|:--------|:----|
+| [OpenClaw](https://openclaw.ai) | Built-in skill for all OpenClaw AI agents — compresses workspace context before every LLM call |
+| [OpenCompress](https://opencompress.ai) | Production compression engine powering the OpenCompress API |
+
+Using Claw Compactor? [Open a PR](https://github.com/open-compress/claw-compactor/pulls) to add yourself here.
+
+---
+
 ## Project Stats
 
 | Metric | Value |
 |:-------|:------|
-| Tests | 1,676 passed |
+| Tests | 1,600+ passed |
 | Python source | 12,000+ lines |
 | Fusion stages | 14 |
 | Languages detected | 16 |
@@ -328,12 +395,23 @@ pip install -e ".[dev,accurate]"
 
 ---
 
+## Contributing
+
+See [CONTRIBUTING.md](CONTRIBUTING.md) for guidelines on:
+- Setting up the development environment
+- Adding new Fusion stages
+- Running the test suite
+- Submitting PRs
+
+---
+
 ## Related
 
 - [OpenClaw](https://openclaw.ai) — AI agent platform
 - [ClawhubAI](https://clawhub.com) — Agent skills marketplace
 - [OpenClaw Discord](https://discord.com/invite/clawd) — Community
 - [OpenClaw Docs](https://docs.openclaw.ai) — Documentation
+- [Full Documentation](https://open-compress.github.io/claw-compactor) — GitHub Pages docs
 
 ---