[Mirror] [ENH](frontend): block functions on topology dbs (upstream #6836) by hashbender · Pull Request #5 · hashbender/chroma

hashbender · 2026-04-09T21:14:05Z

Mirrored from chroma-core#6836 — Frontend validation for multi-region + integration tests

Add backend overrides for manifest bounds and bounded fragment scans. Use the Spanner-backed manifest manager to: - derive manifest bounds without loading the full manifest - scan fragments from the compaction offset with a position_limit index - let log-service scout and pull logs through the partial-read path - expose a phase-0 garbage reset helper for GC recovery Essentially, what we can do now because of this patch is do a bounded manifest fetch. As spanner manifests grow in size, the cost of fetching the manifest begins to dominate. This adds a proportional-to-the-fragments fetch path. Co-authored-by: AI

Introduce dedicated multi-cluster multi-region (MCMR) property tests that write records across two regions and verify cross-region replication invariants. Parameterize existing test_add tests by database name so they run against both classic and MCMR databases when multi-region is enabled. Fix dirty log detection in the repl manifest manager: - Treat intrinsic_cursor=0 as unset so newly initialized logs are not spuriously reported as dirty - Initialize manifest_regions rows with intrinsic_cursor=0 instead of omitting the column, preventing GC from misinterpreting the starting offset as an active reader position - Change the SQL threshold comparison from > to >= so logs at the exact threshold boundary are correctly included - Add decode_intrinsic_cursor helper that filters out zero values Retry Spanner-aborted flush compaction transactions: - Add is_spanner_aborted() to SysDbError to detect gRPC Aborted status - Wrap the flush_collection_compaction read-write transaction with backon retry (100ms delay, 4 attempts) on Spanner abort - Propagate gRPC status codes through SysDbError and FlushCompactionError instead of mapping all Spanner errors to Internal Improve dirty log test diagnostics by dumping in-memory and Spanner state on assertion failure. Add regression tests for single-add and three-add dirty log scenarios. Co-authored-by: AI

This reverts commit 0eebd27.

Reject attached function operations when the database name carries a topology prefix, reusing the same multi-region validation path as collection forking. Add targeted frontend endpoint tests covering attach, get, and detach against topology-prefixed databases. Co-authored-by: AI

tenki-reviewer · 2026-04-09T21:14:14Z

Tenki Code Review - Complete

Files Reviewed: 25
Findings: 0

Reviewed the PR changes across Rust log/sysdb/frontend paths, Python property tests, and CI/workflow updates. No reachable code quality or security issues meeting the reporting threshold were identified.

Files Reviewed (25 files)

.github/actions/tilt/action.yaml
.github/workflows/_python-tests.yml
.github/workflows/nightly-tests.yml
chromadb/test/conftest.py
chromadb/test/property/invariants.py
chromadb/test/property/test_add.py
chromadb/test/property/test_add_mcmr.py
rust/frontend/src/server.rs
rust/log-service/src/lib.rs
rust/rust-sysdb/src/spanner.rs
rust/rust-sysdb/src/types.rs
rust/spanner-migrations/log_migrations/0010-add_fragments_position_limit_index.spanner.sql
rust/spanner-migrations/log_migrations/log_migrations.sum
rust/sysdb/src/sysdb.rs
rust/wal3/src/gc.rs
rust/wal3/src/interfaces/batch_manager.rs
rust/wal3/src/interfaces/mod.rs
rust/wal3/src/interfaces/repl/manifest_manager.rs
rust/wal3/src/lib.rs
rust/wal3/src/manifest.rs
rust/wal3/src/reader.rs
rust/wal3/src/writer.rs
rust/worker/chroma_config.yaml
rust/worker/chroma_mcmr.yaml
rust/worker/chroma_mcmr2.yaml

hashbender · 2026-04-09T21:19:29Z

@tenki-reviewer

tenki-reviewer · 2026-04-09T21:19:39Z

Tenki Code Review - Complete

Files Reviewed: 25
Findings: 1

By Severity:

🟠 Medium: 1

I found one medium-confidence functional regression in log scouting for multi-region databases. The rest of the reviewed production changes look directionally sound, including retry/error-code propagation improvements.

Files Reviewed (25 files)

.github/actions/tilt/action.yaml
.github/workflows/_python-tests.yml
.github/workflows/nightly-tests.yml
chromadb/test/conftest.py
chromadb/test/property/invariants.py
chromadb/test/property/test_add.py
chromadb/test/property/test_add_mcmr.py
rust/frontend/src/server.rs
rust/log-service/src/lib.rs
rust/rust-sysdb/src/spanner.rs
rust/rust-sysdb/src/types.rs
rust/spanner-migrations/log_migrations/0010-add_fragments_position_limit_index.spanner.sql
rust/spanner-migrations/log_migrations/log_migrations.sum
rust/sysdb/src/sysdb.rs
rust/wal3/src/gc.rs
rust/wal3/src/interfaces/batch_manager.rs
rust/wal3/src/interfaces/mod.rs
rust/wal3/src/interfaces/repl/manifest_manager.rs
rust/wal3/src/lib.rs
rust/wal3/src/manifest.rs
rust/wal3/src/reader.rs
rust/wal3/src/writer.rs
rust/worker/chroma_config.yaml
rust/worker/chroma_mcmr.yaml
rust/worker/chroma_mcmr2.yaml

tenki-reviewer

Overall

One actionable issue was identified in production Rust code. Most other changes (retry handling, error code propagation, and MCMR read/scan plumbing) appear consistent with intended behavior.

Key Finding

Missing-resource semantics regression in rust/log-service/src/lib.rs: in scout_logs, the topology path treats manifest_bounds_and_witness() == None as an empty log ((1,1)) instead of returning NotFound. This can cause nonexistent collections to be reported as successful empty logs.

Security Review Notes

Reviewed request-handling and storage-facing paths in rust/frontend/src/server.rs, rust/log-service/src/lib.rs, rust/rust-sysdb/src/spanner.rs, and rust/sysdb/src/sysdb.rs.
No high-confidence injection, auth bypass, or secret-handling vulnerabilities were found in changed code.

tenki-reviewer · 2026-04-09T21:22:22Z

rust/log-service/src/lib.rs

+                    bounds_and_witness.bounds.oldest_timestamp,
+                    bounds_and_witness.bounds.next_write_timestamp,
+                ),
+                None => (LogPosition::from_offset(1), LogPosition::from_offset(1)),


🟠 scout_logs hides missing multi-region collections as empty logs (bug)

When database_name includes a topology, scout_logs uses manifest_bounds_and_witness() and treats None as an empty log (offset 1,1). For the repl manifest manager, None means no manifest row exists for that collection/region, so invalid or missing collections are reported as successful empty logs instead of NotFound. This regresses behavior from the non-topology branch, which maps uninitialized/missing logs to a not-found error.

💡 Suggestion: Return Status::not_found when manifest_bounds_and_witness() is None in the topology branch, matching the non-topology semantics for missing collections.

Suggested change

None => (LogPosition::from_offset(1), LogPosition::from_offset(1)),

None => return Err(Status::not_found(format!("collection {collection_id} not found"))),

📋 Prompt for AI Agents

In rust/log-service/src/lib.rs around line 2174 inside scout_logs, change the topology_name.is_some() match arm so that None from log_reader.manifest_bounds_and_witness() returns Status::not_found(format!("collection {collection_id} not found")) instead of defaulting to offsets (1,1). This preserves missing-collection semantics and avoids falsely reporting nonexistent logs as empty.

devin-ai-integration

✅ Devin Review: No Issues Found

Devin Review analyzed this PR and found no potential bugs to report.

View in Devin Review to see 8 additional findings.

rescrv added 23 commits April 6, 2026 14:09

propel

9178e0c

more spanner in sysdb

571fed5

clippy

4a28309

improve tracing

f3a4ee0

add slopified logs

0eebd27

batch size for test add mcmr

1c04b8a

more backoff/retry of queries for ci

fcb32d1

undo python change

ce84eb1

additional wait time as it exceeded 240 locally

63bfe01

cut miniscule tests in half and do similar for large

a5ce2c0

selectively increase MIN_RECORDS_BETWEEN_COMPACTION_WAITS

e6d4806

typo

4d3c6f0

retry on cancelled

60901c4

cut tests for experiment

ba7773e

Do not fail test_add on 2-record recordset

a684e43

remove traceback

a2a3802

Revert "add slopified logs"

f37ffc3

This reverts commit 0eebd27.

cleanup

c2ac926

clippy

3dbee45

cargo fmt

4c35d83

Repository owner deleted a comment from tenki-reviewer bot Apr 9, 2026

tenki-reviewer bot reviewed Apr 9, 2026

View reviewed changes

devin-ai-integration bot reviewed Apr 9, 2026

View reviewed changes

Repository owner deleted a comment from github-actions bot Apr 10, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Mirror] [ENH](frontend): block functions on topology dbs (upstream #6836)#5

[Mirror] [ENH](frontend): block functions on topology dbs (upstream #6836)#5
hashbender wants to merge 23 commits intomainfrom
mirror/upstream-pr-6836

hashbender commented Apr 9, 2026 •

edited by devin-ai-integration bot

Loading

Uh oh!

tenki-reviewer bot commented Apr 9, 2026 •

edited

Loading

Uh oh!

hashbender commented Apr 9, 2026

Uh oh!

tenki-reviewer bot commented Apr 9, 2026 •

edited

Loading

Uh oh!

tenki-reviewer bot left a comment

Uh oh!

tenki-reviewer bot Apr 9, 2026

Uh oh!

devin-ai-integration bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	None => (LogPosition::from_offset(1), LogPosition::from_offset(1)),
	None => return Err(Status::not_found(format!("collection {collection_id} not found"))),

Conversation

hashbender commented Apr 9, 2026 • edited by devin-ai-integration bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tenki-reviewer bot commented Apr 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hashbender commented Apr 9, 2026

Uh oh!

tenki-reviewer bot commented Apr 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tenki-reviewer bot left a comment

Choose a reason for hiding this comment

Overall

Key Finding

Security Review Notes

Uh oh!

tenki-reviewer bot Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration bot left a comment

Choose a reason for hiding this comment

✅ Devin Review: No Issues Found

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

hashbender commented Apr 9, 2026 •

edited by devin-ai-integration bot

Loading

tenki-reviewer bot commented Apr 9, 2026 •

edited

Loading

tenki-reviewer bot commented Apr 9, 2026 •

edited

Loading