Skip to content
Open
Show file tree
Hide file tree
Changes from 7 commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 2 additions & 2 deletions client/src/cbltest/api/syncgateway.py
Original file line number Diff line number Diff line change
Expand Up @@ -763,8 +763,8 @@ async def _delete_database(self, db_name: str, retry_count: int = 0) -> None:
current_span.add_event("SGW returned 500, retry")
await asyncio.sleep(2)
await self._delete_database(db_name, retry_count + 1)
elif e.code == 403:
pass
elif e.code == 403 or e.code == 404:
pass # Database doesn't exist anyway.
Comment thread
vipbhardwaj marked this conversation as resolved.
Comment thread
vipbhardwaj marked this conversation as resolved.
else:
raise

Expand Down
3 changes: 2 additions & 1 deletion pyproject.toml
Original file line number Diff line number Diff line change
Expand Up @@ -33,10 +33,11 @@ lint = [
members = [ "client" ]

[tool.ty.environment]
root = [".", "client/src"]
root = [".", "client/src", "tests"]

[tool.pytest.ini_options]
asyncio_default_fixture_loop_scope = "session"
pythonpath = ["tests"]
filterwarnings = [
"ignore:Class property max_ttl is deprecated.*:couchbase.logic.supportability.CouchbaseDeprecationWarning",
]
Expand Down
129 changes: 129 additions & 0 deletions spec/tests/QE/test_replication_upgrade_delta_sync.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,129 @@
# Test Cases

These tests cover delta-sync replication behavior across a simulated 3.x → 4.x
SGW upgrade. The pre-upgrade state is materialised by restoring the prebuilt
`upgrade` CBS backup (revtree-only docs, no HLV xattrs) and resetting the CBL
local DB from the matching `upgrade` cblite2 file; both binaries run 4.x
throughout. Delta sync is enabled on the SGW `upgrade` database.

## #1 test_delta_sync_history_pull_post_upgrade_sgw_mutation

### Description

PULL replication of a doc whose 2nd revision was created on 4.x SGW (so SGW
holds both a revtree and an HLV) by a client that still holds the revtree-only
ancestor. With delta sync enabled, SGW must populate the rev message's
`history` field with the revtree predecessors so the client can ingest the
delta. The current build sends an empty `history` here — this test is the
regression marker for the fix, and is marked `xfail(strict=True)` until the
SGW fix lands.

Uses doc `nonconflict_3` from the prebuilt `upgrade` dataset. The 2nd
revision is created in-test by mutating the doc on 4.x SGW, which adds an HLV
in parallel with the new revtree leaf (4.x SGW always writes both).
Comment thread
vipbhardwaj marked this conversation as resolved.
Outdated

```
+------------------+-------------------------------+-------------------------------+
| | CBL | SGW |
| +---------------+---------------+---------------+---------------+
| | Rev Tree | HLV | Rev Tree | HLV |
+------------------+---------------+---------------+---------------+---------------+
| After restore | 2-abc | none | 2-abc | none |
| After SGW mutate | 2-abc | none | 3-xxx, 2-abc | [N@SGW] |
| Expected post-PULL| none | [N@SGW] | 3-xxx, 2-abc | [N@SGW] |
+------------------+---------------+---------------+---------------+---------------+
```

### Steps

1. Delete Sync Gateway 'upgrade' database if exists.
2. Restore Couchbase Server Bucket using `upgrade` dataset.
3. Wait 2s to ensure SG picks up the restored database.
4. Reset local database, and load `upgrade` dataset.
5. Create SG 'upgrade' database with delta_sync enabled and import from bucket.
On 412 (already exists), force-recreate by `delete_database` + `put_database`.
6. Verify delta_sync is actually enabled on SGW 'upgrade' database by fetching
the live config; fail with the active config dumped if not enabled.
7. Create user `user1` with full access to `_default._default`.
8. Mutate `nonconflict_3` on 4.x SGW to produce a new revtree leaf + fresh HLV.
9. Start a replicator:
* endpoint: `/upgrade`
* collections: `_default._default`
* type: pull
* document_ids: `['nonconflict_3']`
* continuous: False
* credentials: user1/pass
10. Wait until the replicator is stopped.
11. Validate revid and HLV of local and remote doc:
* Pre: local has revid + no HLV; SGW has revid + canonical HLV (not
RTE-encoded).
* Post: local has no revid (4.x CBL is HLV-only); local HLV equals SGW HLV.

### Expected Outcome

✅ **With SGW fix**: CBL ingests the delta, ends up HLV-only with HLV matching SGW.
❌ **Without SGW fix** (current build): rev message's `history` field is empty,
client cannot ingest → test fails the postcondition. The `xfail(strict=True)`
marker makes this an expected failure today and an `XPASS` (CI failure) the
day the fix lands, forcing the developer landing the fix to remove the marker.

---

## #2 test_delta_sync_history_pull_pre_upgrade_sgw_two_revs

### Description

PULL replication of a doc whose 2nd revision was created on 3.x SGW (so both
sides have revtree-only state, no HLV anywhere) by a client that holds the
revtree-only ancestor. With delta sync enabled, the client must pull the
newer revtree-only rev and generate an HLV locally using the
Revision-Tree-Encoding (RTE) format. This case is expected to work on the
current build (via the pre-fix code path) and serves as a forward regression
marker once the SGW fix lands.

Uses doc `nonconflict_2` from the prebuilt `upgrade` dataset — its baked-in
state already matches the required pre-state, so no in-test SGW mutation is
needed.

```
+------------------+-------------------------------+-------------------------------+
| | CBL | SGW |
| +---------------+---------------+---------------+---------------+
| | Rev Tree | HLV | Rev Tree | HLV |
+------------------+---------------+---------------+---------------+---------------+
| After restore | 1-abc | none | 2-def, 1-abc | none |
| Expected post-PULL| none | [2def@RTE] | 2-def, 1-abc | none |
+------------------+---------------+---------------+---------------+---------------+
```

### Steps

1. Delete Sync Gateway 'upgrade' database if exists.
2. Restore Couchbase Server Bucket using `upgrade` dataset.
3. Wait 2s to ensure SG picks up the restored database.
4. Reset local database, and load `upgrade` dataset.
5. Create SG 'upgrade' database with delta_sync enabled and import from bucket.
On 412 (already exists), force-recreate by `delete_database` + `put_database`.
6. Verify delta_sync is actually enabled on SGW 'upgrade' database by fetching
the live config; fail with the active config dumped if not enabled.
7. Create user `user1` with full access to `_default._default`.
8. Start a replicator:
* endpoint: `/upgrade`
* collections: `_default._default`
* type: pull
* document_ids: `['nonconflict_2']`
* continuous: False
* credentials: user1/pass
9. Wait until the replicator is stopped.
10. Validate revid and HLV of local and remote doc:
* Pre: both sides have revid and no HLV; CBL revid < SGW revid.
* Post: local has no revid (4.x CBL is HLV-only); local HLV ends with
`@Revision+Tree+Encoding`; SGW HLV is still None (PULL doesn't touch SGW).

### Expected Outcome

✅ **Today** (pre-fix): the existing code path correctly pulls the
revtree-only rev and the client generates an RTE-encoded HLV. Test passes.
✅ **After fix lands**: same path, same outcome. Test continues to pass.
This is the forward regression marker confirming the fix doesn't break the
symmetric, revtree-only case.
Comment thread
vipbhardwaj marked this conversation as resolved.
Outdated
195 changes: 195 additions & 0 deletions tests/QE/test_replication_upgrade_delta_sync.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,195 @@
from pathlib import Path

import pytest
from cbltest import CBLPyTest
from cbltest.api.cbltestclass import CBLTestClass
from cbltest.api.error import CblSyncGatewayBadResponseError
from cbltest.api.replicator_types import ReplicatorType
from cbltest.api.syncgateway import DocumentUpdateEntry, PutDatabasePayload
from shared.upgrade_test_helpers import (
DocSnapshot,
do_upgrade_replication_test,
setup_upgrade_env,
)

_DELTA_SYNC_UPGRADE_CONFIG: dict = {
"bucket": "upgrade",
"num_index_replicas": 0,
"scopes": {
"_default": {
"collections": {
"_default": {
"sync": (
"function(doc, oldDoc, meta) {"
" if (doc._deleted) { channel(oldDoc.channels); }"
" else { channel(doc.channels || 'upgrade'); }"
"}"
)
}
}
}
},
"import_docs": True,
"enable_shared_bucket_access": True,
"delta_sync": {"enabled": True},
}


@pytest.mark.sgw
@pytest.mark.min_test_servers(1)
@pytest.mark.min_sync_gateways(1)
@pytest.mark.min_couchbase_servers(1)
class TestUpgradeDeltaSync(CBLTestClass):
async def _prepare_sg_with_delta_sync(self, cblpytest: CBLPyTest) -> None:
sg = cblpytest.sync_gateways[0]
payload = PutDatabasePayload(_DELTA_SYNC_UPGRADE_CONFIG)

self.mark_test_step(
"Create SG 'upgrade' database with delta_sync enabled and import from bucket"
)
try:
await sg.put_database("upgrade", payload)
except CblSyncGatewayBadResponseError as e:
if e.code != 412:
raise
# DB already exists. Try to force-recreate so our delta_sync
# config is applied. delete_database silently swallows 403
# internally (config-managed DBs), so the delete may be a no-op
# and the retry put may also 412. Tolerate that — the
# verify-config assertion below is the real backstop and will
# dump the active config if delta_sync is not enabled.
await sg.delete_database("upgrade")
try:
await sg.put_database("upgrade", payload)
except CblSyncGatewayBadResponseError as e2:
if e2.code != 412:
raise
await sg.wait_for_db_up("upgrade")

Comment thread
vipbhardwaj marked this conversation as resolved.
self.mark_test_step(
"Verify delta_sync is actually enabled on SGW 'upgrade' database"
)
config = await sg.get_database_config("upgrade")
delta_sync = config.get("delta_sync") or {}
assert delta_sync.get("enabled") is True, (
"Prerequisite failed: SGW 'upgrade' database does not have "
f"delta_sync.enabled=True. Active config: {config!r}"
)

self.mark_test_step("Create user1 for replication")
collection_access = sg.create_collection_access_dict(
{"_default._default": ["*"]}
)
await sg.add_user("upgrade", "user1", "pass", collection_access)

@pytest.mark.asyncio(loop_scope="session")
@pytest.mark.xfail(
strict=False,
reason=(
"SGW delta-sync history bug: when SGW sends a delta of a "
"revtree+HLV rev to a client holding the revtree-only ancestor, "
"the rev message's `history` field is empty. Fix pending. "
"NOTE: non-strict — the bug currently does not surface through "
"end-state revid/HLV/body comparison (likely a BLIP wire-level "
"issue only). When the SGW fix lands, flip strict=True so an "
"XPASS becomes a loud signal to remove this decorator."
),
)
Comment thread
vipbhardwaj marked this conversation as resolved.
Outdated
async def test_delta_sync_history_pull_post_upgrade_sgw_mutation(
self, cblpytest: CBLPyTest, dataset_path: Path
) -> None:
doc_id = "nonconflict_3"
db = await setup_upgrade_env(self, cblpytest, dataset_path)
await self._prepare_sg_with_delta_sync(cblpytest)
sg = cblpytest.sync_gateways[0]

self.mark_test_step(
f"Mutate '{doc_id}' on 4.x SGW to produce a new revtree leaf + fresh HLV"
)
current = await sg.get_document("upgrade", doc_id)
assert current is not None, f"Expected '{doc_id}' imported from bucket"
assert current.revid is not None, (
f"Expected '{doc_id}' to have a revid pre-mutation, got None"
)
new_body = {**current.body, "updated_by": "delta_sync_history_test"}
await sg.update_documents(
"upgrade",
[DocumentUpdateEntry(doc_id, current.revid, body=new_body)],
)

def validator(pre: DocSnapshot, post: DocSnapshot) -> None:
assert pre.local.revid is not None and pre.local.cv is None, (
f"Local precondition invalid: RevID={pre.local.revid}, "
f"HLV={pre.local.cv} (expected revtree-only)"
)
assert pre.remote.revid is not None and pre.remote.cv is not None, (
f"Remote precondition invalid: RevID={pre.remote.revid}, "
f"HLV={pre.remote.cv} (expected revtree + HLV after 4.x mutation)"
)
assert not pre.remote.cv.endswith("@Revision+Tree+Encoding"), (
f"Expected canonical HLV on SGW after 4.x write, got RTE-encoded: "
f"{pre.remote.cv}"
)

assert post.local.revid is None, (
f"Expected post-pull local doc to be HLV-only, "
f"got revid={post.local.revid}"
)
assert post.local.cv and post.local.cv == post.remote.cv, (
f"Expected post-pull local HLV to match SGW HLV. "
f"Local={post.local.cv}, Remote={post.remote.cv}"
)

await do_upgrade_replication_test(
self,
cblpytest,
db,
doc_id=doc_id,
replicator_type=ReplicatorType.PULL,
compare_docs=True,
validator=validator,
)

@pytest.mark.asyncio(loop_scope="session")
async def test_delta_sync_history_pull_pre_upgrade_sgw_two_revs(
self, cblpytest: CBLPyTest, dataset_path: Path
) -> None:
doc_id = "nonconflict_2"
db = await setup_upgrade_env(self, cblpytest, dataset_path)
await self._prepare_sg_with_delta_sync(cblpytest)

def validator(pre: DocSnapshot, post: DocSnapshot) -> None:
assert pre.local.revid is not None and pre.local.cv is None, (
f"Local precondition invalid: RevID={pre.local.revid}, "
f"HLV={pre.local.cv} (expected revtree-only)"
)
assert pre.remote.revid is not None and pre.remote.cv is None, (
f"Remote precondition invalid: RevID={pre.remote.revid}, "
f"HLV={pre.remote.cv} (expected revtree-only, no HLV)"
)
assert pre.local.revid < pre.remote.revid, (
f"Pre-condition: expected local revid < remote revid, "
f"got local={pre.local.revid}, remote={pre.remote.revid}"
)

assert post.local.revid and post.local.revid == post.remote.revid, (
f"Expected post-pull revtree-only doc with matching revids. "
f"Local={post.local.revid}, Remote={post.remote.revid}"
)
assert post.local.cv is None, (
f"Expected post-pull local doc to remain revtree-only "
f"(neither side wrote under 4.x), got HLV={post.local.cv}"
)
Comment thread
vipbhardwaj marked this conversation as resolved.
assert post.remote.cv is None, (
f"Expected SGW HLV unchanged (none) after PULL, got {post.remote.cv}"
)

await do_upgrade_replication_test(
self,
cblpytest,
db,
doc_id=doc_id,
replicator_type=ReplicatorType.PULL,
compare_docs=True,
validator=validator,
)
Loading
Loading