You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
return []string{"Local deterministic no-ranking baseline in cmd/goncho-bench; uses the recency order before current Goncho ranking."}
695
695
case"agentmemory":
696
-
return []string{"External Python probe: scripts/bench_agentmemory_locomo.py --capability.", "Comparable when AGENTMEMORY_SOURCE_DIR points at PR #583 / commit 9b18a80c9d2839b025279978d3f4b5e1f9bc6e74 with npm dependencies installed.", "Adapter path uses standalone InMemoryKV fallback: memory_save external_id plus metadata.memory_id, then memory_smart_search. This validates stable IDs but is not the full running agentmemory server.", "If AGENTMEMORY_SOURCE_DIR is absent, agentmemory is marked not comparable."}
696
+
return []string{"External Python probe: scripts/bench_agentmemory_locomo.py --capability.", "Comparable when AGENTMEMORY_SOURCE_DIR points at https://github.com/rohitg00/agentmemory PR #583 / commit 9b18a80c9d2839b025279978d3f4b5e1f9bc6e74 with npm dependencies installed.", "Adapter path uses standalone InMemoryKV fallback: memory_save external_id plus metadata.memory_id, then memory_smart_search. This validates stable IDs but is not the full running agentmemory server.", "If AGENTMEMORY_SOURCE_DIR is absent, agentmemory is marked not comparable."}
697
697
case"mem0":
698
698
return []string{"External Python probe: scripts/bench_mem0_locomo.py --capability.", "Exact package version used in this run: none; backend marked not comparable before scoring.", "Install candidate: pip install mem0ai, with local/vector dependencies configured by upstream mem0 docs.", "Current status: not comparable in this harness until search results return caller-supplied memory_id unchanged without LLM answer scoring."}
b.WriteString("- Goncho, Goncho no-rank, BM25, and SQLite FTS5 are local Go adapters with no hosted dependency.\n")
881
-
b.WriteString("- agentmemory probe: `python3 scripts/bench_agentmemory_locomo.py --capability`. Comparable when `AGENTMEMORY_SOURCE_DIR` points at PR #583 / commit `9b18a80c9d2839b025279978d3f4b5e1f9bc6e74` with npm dependencies installed. This adapter uses the standalone InMemoryKV fallback, not the full running agentmemory server.\n")
881
+
b.WriteString("- agentmemory probe: `python3 scripts/bench_agentmemory_locomo.py --capability`. Comparable when `AGENTMEMORY_SOURCE_DIR` points at `https://github.com/rohitg00/agentmemory` PR #583 / commit `9b18a80c9d2839b025279978d3f4b5e1f9bc6e74` with npm dependencies installed. This adapter uses the standalone InMemoryKV fallback, not the full running agentmemory server.\n")
882
882
b.WriteString("- mem0 probe: `python3 scripts/bench_mem0_locomo.py --capability`. Exact package version used here: none; backend is marked not comparable before scoring. Candidate install: `pip install mem0ai` plus upstream local vector-store dependencies. Comparable only after configured local retrieval can return caller-supplied `memory_id` without answer-generation scoring.\n")
883
883
b.WriteString("\n## Interpretation\n\nBackends marked not comparable are excluded from score claims until they implement the `MemoryBackend` contract and return the same stable `memory_id` values that were inserted. This keeps the arena fair and prevents answer-generation or LLM-judge effects from leaking into retrieval metrics.\n")
| agentmemory | comparable with PR source | Set `AGENTMEMORY_SOURCE_DIR` to PR #583 commit `9b18a80c9d2839b025279978d3f4b5e1f9bc6e74`. Stable IDs work; standalone fallback LOCOMO score is `0.0000`, and this is not the full running server. |
218
+
| agentmemory | comparable with PR source | Set `AGENTMEMORY_SOURCE_DIR` to `https://github.com/rohitg00/agentmemory` at PR #583 commit `9b18a80c9d2839b025279978d3f4b5e1f9bc6e74`. Stable IDs work; standalone fallback LOCOMO score is `0.0000`, and this is not the full running server. |
218
219
| mem0 | not comparable | Package is not installed locally; no stable-ID run exists. |
Copy file name to clipboardExpand all lines: docs/benchmarks/external-backend-adapters.md
+2-2Lines changed: 2 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -70,15 +70,15 @@ LOCOMO contains duplicate and near-duplicate content, including repeated content
70
70
| Goncho no-rank | local Go harness | yes | Native LOCOMO `memory_id`| Local deterministic no-ranking baseline that uses recency order before current Goncho ranking. |
71
71
| BM25 | local Go harness | yes | Native LOCOMO `memory_id`| Local deterministic lexical baseline. |
72
72
| SQLite FTS5 | local Go SQLite FTS5 | yes | Native LOCOMO `memory_id` column | Local deterministic lexical baseline. |
73
-
| agentmemory |`@agentmemory/agentmemory 0.9.20`, PR #583 commit `9b18a80c9d2839b025279978d3f4b5e1f9bc6e74`| yes, standalone fallback |`memory_save.external_id` plus `metadata.memory_id` returned by `memory_smart_search`| Stable IDs work. LOCOMO full score is `0.0` for the standalone InMemoryKV fallback because it uses strict all-term substring matching; this is not the full running agentmemory server. |
73
+
| agentmemory |`https://github.com/rohitg00/agentmemory`, `@agentmemory/agentmemory 0.9.20`, PR #583 commit `9b18a80c9d2839b025279978d3f4b5e1f9bc6e74`| yes, standalone fallback |`memory_save.external_id` plus `metadata.memory_id` returned by `memory_smart_search`| Stable IDs work. LOCOMO full score is `0.0` for the standalone InMemoryKV fallback because it uses strict all-term substring matching; this is not the full running agentmemory server. |
74
74
| mem0 | Python `3.12.3`; package not installed locally | no | Not executed |`mem0`/`mem0ai` is not installed in this environment; no stable-ID run can be produced. |
0 commit comments