You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/release/ingestion/v0.7.0.mdx
+10-4Lines changed: 10 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,9 +1,9 @@
1
1
---
2
-
title: "OLake Go (v0.7.0 - v0.7.2)"
2
+
title: "OLake Go (v0.7.0 - v0.7.3)"
3
3
---
4
4
5
-
# OLake Go (v0.7.0 - v0.7.2)
6
-
April 21, 2026 – May 03, 2026
5
+
# OLake Go (v0.7.0 - v0.7.3)
6
+
April 21, 2026 – May 15, 2026
7
7
8
8
## 🎯 What's New
9
9
@@ -15,6 +15,8 @@ April 21, 2026 – May 03, 2026
15
15
16
16
3.**Schema filtering for PostgreSQL discovery -** <br/> Added an optional schemas config field to restrict the discover operation to user-specified PostgreSQL schemas. When omitted, existing behaviour is preserved and all non-system schemas are discovered.
17
17
18
+
4.**MSSQL read replica support -** <br/> Added optional `jdbc_url_params` to the MSSQL source so you can target Always On read replicas (for example with read-intent), and updated CDC to use replica-safe paths that avoid primary-only agent/msdb and capture-instance management on secondaries.
19
+
18
20
### Destinations
19
21
20
22
1.**Skip equality deletes for CDC inserts post-backfill -** <br/> Equality deletes are now skipped for CDC inserts once the backfill→CDC overlap window is complete, reducing unnecessary write overhead. A new `dedup_inserts` flag on the Iceberg `olake_2pc` table property tracks this — Java sets it to `true` on backfill commit, and Go clears it to `false` after the first successful CDC commit. This applies to both the Arrow and legacy gRPC writers.
@@ -29,4 +31,8 @@ April 21, 2026 – May 03, 2026
29
31
30
32
4.**PostgreSQL primary key discovery fix via pg_catalog -** <br/> `information_schema.key_column_usage` incorrectly included foreign key columns as primary keys, causing wrong `_olake_id` hashes, missed equality deletes, and duplicate rows in Iceberg on CDC upserts. Replaced with a `pg_catalog`-based query that returns only true primary keys and works correctly for read-only roles on managed databases like RDS, Supabase, and Render.
31
33
32
-
5.**MySQL CDC charset corruption fix for non-UTF8 columns -** <br/> ENUM and string columns using non-UTF8 charsets (`utf16`, `ucs2`, `latin1`) were silently corrupted during CDC due to blind `[]byte` → `string` casts. Fixed by adding collation-aware decoding using `TableMapEvent.CollationMap()` and `EnumSetCollationMap()`.
34
+
5.**MySQL CDC charset corruption fix for non-UTF8 columns -** <br/> ENUM and string columns using non-UTF8 charsets (`utf16`, `ucs2`, `latin1`) were silently corrupted during CDC due to blind `[]byte` → `string` casts. Fixed by adding collation-aware decoding using `TableMapEvent.CollationMap()` and `EnumSetCollationMap()`.
35
+
36
+
6.**MongoDB primary key pinning for deterministic deduplication -** <br/> Previously, all indexed fields were treated as primary keys, so updates to non-unique indexed fields changed the `_olake_id` and broke Iceberg equality deletes, creating duplicate rows. The primary key is now pinned strictly to MongoDB’s guaranteed-unique `_id`, ensuring stable hashes and correct deduplicated upserts.
37
+
38
+
7.**DB2 driver download fix in integration tests -** <br/> DB2 integration tests now reuse the already-installed `clidriver` by copying it into the workspace, so Docker containers find it locally instead of repeatedly hitting the flaky IBM CDN download path.
0 commit comments