fix(codex): preserve UTF-8 in stop summaries by valbarko · Pull Request #564 · zilliztech/memsearch

valbarko · 2026-06-01T12:33:51Z

Summary

replace byte-based truncation in the Codex stop hook with Unicode-safe character truncation
sanitize fallback summary output before appending to memory markdown
add a regression test that runs the stop worker with long Cyrillic text and reads the resulting memory file as UTF-8

Verification

bash -n plugins/codex/hooks/stop.sh
uv run pytest tests/test_codex_stop_hook_utf8.py tests/test_codex_parse_rollout.py
uv run ruff check tests/test_codex_stop_hook_utf8.py
git diff --check

googs1025 · 2026-06-10T02:48:55Z

  _json_val "$work_input" "$key" ""
 }

+_truncate_chars() {


I hit this issue locally with the Codex hook as well. The byte-oriented truncation path can leave invalid UTF-8 in the markdown memory file, and then later
memsearch index .memsearch/memory reports a UnicodeDecodeError when reading that file.

googs1025 · 2026-06-10T02:49:39Z

This PR looks like the right source-side fix for Codex:

character-based truncation avoids splitting multi-byte UTF-8 sequences
the regression test exercises the worker fallback path with non-ASCII text

fix(codex): preserve utf8 in stop summaries

e96b565

googs1025 reviewed Jun 10, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(codex): preserve UTF-8 in stop summaries#564

fix(codex): preserve UTF-8 in stop summaries#564
valbarko wants to merge 1 commit into
zilliztech:mainfrom
valbarko:codex/codex-stop-utf8-safe-truncation

valbarko commented Jun 1, 2026

Uh oh!

googs1025 Jun 10, 2026

Uh oh!

googs1025 commented Jun 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

valbarko commented Jun 1, 2026

Summary

Verification

Uh oh!

googs1025 Jun 10, 2026

Choose a reason for hiding this comment

Uh oh!

googs1025 commented Jun 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants