Skip to content

Commit 0cae47f

Browse files
docs: reframe README around memory reliability, add differentiator (#86)
- Lead with 'Your agent remembers what matters' instead of token reduction - Add one-sentence differentiator near the top - Reframe the problem statement around agent forgetfulness and trust - Add 'Remember' stage to the How It Works table - Document new memory endpoints (expire, supersede) and MCP tools - Move token efficiency to a supporting detail, not the headline Closes #80 Closes #81 Co-authored-by: Ona <no-reply@ona.com>
1 parent 2ff45fd commit 0cae47f

1 file changed

Lines changed: 12 additions & 9 deletions

File tree

README.md

Lines changed: 12 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -8,11 +8,11 @@
88

99
[![Build with Ona](https://ona.com/build-with-ona.svg)](https://app.ona.com/#https://github.com/siddhant-k-code/distill)
1010

11-
**Open-source context preprocessing for LLM applications.**
11+
**Your agent remembers what matters.**
1212

13-
Distill sits between your application and any LLM. It cleans up context before it's sent: deduplicating semantically redundant chunks, compressing conversation history as it ages, and placing cache markers on stable content so Anthropic's prompt cache actually fires.
13+
Distill gives LLM agents persistent, deduplicated memory that survives across sessions. It prevents repeated re-learning, surfaces conflicting information before it causes mistakes, and compresses aging context so the signal stays high.
1414

15-
The result: fewer tokens sent, lower cost per request, and context windows that don't fill up with noise.
15+
> Other tools compress what goes into your agent. Distill controls what your agent *remembers* — across sessions, without conflicts, ranked by what matters now.
1616
1717
**[Learn more →](https://distill.siddhantkhare.com)**
1818

@@ -22,29 +22,30 @@ The result: fewer tokens sent, lower cost per request, and context windows that
2222
RAG / tools / memory / docs
2323
2424
Distill
25-
(dedupe · compress · cache)
25+
(remember · dedupe · compress · cache)
2626
2727
LLM
2828
```
2929

3030
## The Problem
3131

32-
30-40% of context assembled from multiple sources is semantically redundant. The same information arrives from docs, code, memory, and tool outputs, all competing for attention in the same prompt.
32+
Agents forget. Every new session starts from zero — the same constraints, preferences, and facts have to be re-established. When context does persist, 30-40% of it is semantically redundant, and contradictory information sits side by side with no signal about which version is current.
3333

34-
This causes non-deterministic outputs, confused reasoning, and failures that only show up at scale. Better prompts don't fix it. The context going in needs to be clean.
34+
This causes non-deterministic outputs, confused reasoning, and failures that only show up at scale. Better prompts don't fix it. The agent needs memory it can trust.
3535

3636
## How It Works
3737

3838
No LLM calls. Fully deterministic. ~12ms overhead.
3939

4040
| Stage | What it does |
4141
|-------|-------------|
42+
| **Remember** | Persistent memory across sessions with write-time dedup, expiry, and sensitivity tagging |
4243
| **Deduplicate** | Cluster semantically similar chunks, keep one representative per cluster |
4344
| **Compress** | Extractive compression to remove noise and preserve signal |
4445
| **Summarize** | Progressively condense conversation history as turns age |
4546
| **Cache** | Annotate stable prefixes with `cache_control`, track TTL per prefix |
4647

47-
All four stages chain together via `POST /v1/pipeline` or `distill pipeline` CLI.
48+
All stages chain together via `POST /v1/pipeline` or `distill pipeline` CLI. Memory is available via `--memory` flag.
4849

4950
### Dedup pipeline
5051

@@ -273,7 +274,7 @@ Memory tools are available in Claude Desktop, Cursor, and other MCP clients when
273274
distill mcp --memory
274275
```
275276

276-
Tools exposed: `store_memory`, `recall_memory`, `forget_memory`, `memory_stats`.
277+
Tools exposed: `store_memory`, `recall_memory`, `forget_memory`, `memory_expire`, `memory_supersede`, `memory_stats`.
277278

278279
### How Decay Works
279280

@@ -422,9 +423,11 @@ distill completion powershell | Out-String | Invoke-Expression
422423
| GET | `/v1/batch/{id}` | Poll batch job status and progress |
423424
| GET | `/v1/batch/{id}/results` | Retrieve completed batch results |
424425
| POST | `/v1/retrieve` | Query vector DB with dedup (requires backend) |
425-
| POST | `/v1/memory/store` | Store memories with write-time dedup (requires `--memory`) |
426+
| POST | `/v1/memory/store` | Store memories with write-time dedup and sensitivity tagging (requires `--memory`) |
426427
| POST | `/v1/memory/recall` | Recall memories by relevance + recency (requires `--memory`) |
427428
| POST | `/v1/memory/forget` | Remove memories by ID, tag, or age (requires `--memory`) |
429+
| POST | `/v1/memory/expire` | Mark memories as expired without deleting (requires `--memory`) |
430+
| POST | `/v1/memory/supersede` | Replace a memory with a newer version (requires `--memory`) |
428431
| GET | `/v1/memory/stats` | Memory store statistics (requires `--memory`) |
429432
| POST | `/v1/session/create` | Create a session with token budget (requires `--session`) |
430433
| POST | `/v1/session/push` | Push entries with dedup + budget enforcement (requires `--session`) |

0 commit comments

Comments
 (0)