Skip to content

Commit 871ebaa

Browse files
Update README to align with website (#15)
- Add tagline 'Reliable LLM outputs start with clean context' - Add four pillars table (Deduplicate, Compress, Summarize, Cache) - Add 'garbage in, garbage out' context - Expand comparison table with cost per call - Add Integrations section with supported tools Co-authored-by: Ona <no-reply@ona.com>
1 parent 1581b43 commit 871ebaa

1 file changed

Lines changed: 34 additions & 3 deletions

File tree

README.md

Lines changed: 34 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -8,8 +8,12 @@
88

99
[![Build with Ona](https://ona.com/build-with-ona.svg)](https://app.ona.com/#https://github.com/siddhant-k-code/distill)
1010

11+
**Reliable LLM outputs start with clean context.**
12+
1113
A reliability layer for LLM context. Deterministic deduplication that removes redundancy before it reaches your model.
1214

15+
Less redundant data. Lower costs. Faster responses. More efficient & deterministic results.
16+
1317
**[Learn more →](https://distill.siddhantkhare.com)**
1418

1519
```
@@ -19,16 +23,29 @@ Context sources → Distill → LLM
1923

2024
## The Problem
2125

22-
LLM outputs are unreliable because context is polluted.
26+
LLM outputs are unreliable because context is polluted. "Garbage in, garbage out."
2327

2428
30-40% of context assembled from multiple sources is semantically redundant. Same information from docs, code, memory, and tools competing for attention. This leads to:
2529

2630
- **Non-deterministic outputs** — Same workflow, different results
2731
- **Confused reasoning** — Signal diluted by repetition
2832
- **Production failures** — Works in demos, breaks at scale
2933

34+
You can't fix unreliable outputs with better prompts. You need to fix the context that goes in.
35+
3036
## How It Works
3137

38+
Math, not magic. No LLM calls. Fully deterministic.
39+
40+
| Step | What it does | Benefit |
41+
|------|--------------|---------|
42+
| **Deduplicate** | Remove redundant information across sources | More reliable outputs |
43+
| **Compress** | Keep what matters, remove the noise | Lower token costs |
44+
| **Summarize** | Condense older context intelligently | Longer sessions |
45+
| **Cache** | Instant retrieval for repeated patterns | Faster responses |
46+
47+
### Pipeline
48+
3249
```
3350
Query → Over-fetch (50) → Cluster → Select → MMR Re-rank (8) → LLM
3451
```
@@ -343,14 +360,28 @@ Connect your repo and set `OPENAI_API_KEY` in environment variables.
343360
- **Agent Workflows** - Clean up tool outputs + memory + docs
344361
- **Enterprise** - Deterministic outputs for compliance
345362

346-
## Why Distill?
363+
## Why not just use an LLM?
364+
365+
LLMs are non-deterministic. Reliability requires deterministic preprocessing.
347366

348367
| | LLM Compression | Distill |
349368
|---|---|---|
350369
| Latency | ~500ms | ~12ms |
370+
| Cost per call | $0.01+ | $0.0001 |
351371
| Deterministic | No | Yes |
352-
| Auditable | No | Yes |
353372
| Lossless | No | Yes |
373+
| Auditable | No | Yes |
374+
375+
Use LLMs for reasoning. Use deterministic algorithms for reliability.
376+
377+
## Integrations
378+
379+
Works with your existing AI stack:
380+
381+
- **LLM Providers:** OpenAI, Anthropic
382+
- **Frameworks:** LangChain, LlamaIndex
383+
- **Vector DBs:** Pinecone, Qdrant, Weaviate, Chroma, pgvector
384+
- **Tools:** Cursor, Lovable, and more
354385

355386
## Contributing
356387

0 commit comments

Comments
 (0)