MemPalace JS Optimization Plan

This document outlines the strategy for maximizing the performance of the MemPalace Node.js/TypeScript engine, leveraging V8-specific optimizations, asynchronous I/O, and worker threads.

Phase 1: High-Throughput Embedding Pipeline

Goal: Minimize IPC overhead and maximize ONNX runtime efficiency.

Batch Worker Logic:
- Refactor src/storage/embedding_worker.ts to accept string[] instead of single string.
- Update the internal Transformers.js pipeline call to process the batch in one go.
Vector Storage Batching:
- Add getEmbeddings(texts: string[]) to VectorStorage.ts.
- Update upsertDrawer and mine operations to chunk inputs into batches (size 10-20).
Request Coalescing:
- Implement a debouncer in getEmbedding to bundle near-simultaneous tool call requests into a single worker message.

Phase 2: Optimized Transport Layer (MCP Serialization)

Goal: Speed up JSON-RPC communication for AI agents.

Hybrid Schema Serialization:
- Integrate fast-json-stringify.
- Define a pre-compiled schema for core Python-parity fields (id, content, wing, room, filedAt).
- Use additionalProperties: true to support dynamic user-defined metadata.
Parallel Tool Execution:
- Audit all MCP search/recall tools to ensure they use Promise.all() for multi-wing or multi-room lookups.

Phase 3: Memory-Efficient Internal Streaming

Goal: Move from "Buffer-and-Send" to "Stream-and-Yield".

Generator-based Memory Stack:
- Refactor src/core/layers.ts to use AsyncGenerators.
- Yield context chunks as they are retrieved, reducing peak RSS (memory) during large recalls.
Lazy Heuristics:
- Apply regex extraction and AAAK dialect compression on-the-fly as data streams through the pipeline.

Phase 4: Production Packaging & NPM Polish

Goal: Zero-config global installation.

Worker Resolution: Finalize robust absolute path resolution for embedding_worker.js.
Binary Mapping: Map mempalace command to the CLI entry point in package.json.
Asset Inclusion: Ensure hooks/ and documentation are in the NPM files whitelist.

Phase 5: Verification & Benchmarking

Goal: Mathematically prove performance gains.

Latency Audit: Measure TTFM (Time to First Memory) and Ingestion Throughput.
Accuracy Guardrails: Verify 96.4% R@5 accuracy is maintained post-optimization using longmemeval_bench.ts.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MemPalace JS Optimization Plan

Phase 1: High-Throughput Embedding Pipeline

Phase 2: Optimized Transport Layer (MCP Serialization)

Phase 3: Memory-Efficient Internal Streaming

Phase 4: Production Packaging & NPM Polish

Phase 5: Verification & Benchmarking

FilesExpand file tree

OPTIMIZATION_PLAN.md

Latest commit

History

OPTIMIZATION_PLAN.md

File metadata and controls

MemPalace JS Optimization Plan

Phase 1: High-Throughput Embedding Pipeline

Phase 2: Optimized Transport Layer (MCP Serialization)

Phase 3: Memory-Efficient Internal Streaming

Phase 4: Production Packaging & NPM Polish

Phase 5: Verification & Benchmarking