Skip to content

Commit c2b24d0

Browse files
authored
added go v0.7.3 (#423)
1 parent 4b303bd commit c2b24d0

1 file changed

Lines changed: 10 additions & 4 deletions

File tree

docs/release/ingestion/v0.7.0.mdx

Lines changed: 10 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,9 +1,9 @@
11
---
2-
title: "OLake Go (v0.7.0 - v0.7.2)"
2+
title: "OLake Go (v0.7.0 - v0.7.3)"
33
---
44

5-
# OLake Go (v0.7.0 - v0.7.2)
6-
April 21, 2026 – May 03, 2026
5+
# OLake Go (v0.7.0 - v0.7.3)
6+
April 21, 2026 – May 15, 2026
77

88
## 🎯 What's New
99

@@ -15,6 +15,8 @@ April 21, 2026 – May 03, 2026
1515

1616
3. **Schema filtering for PostgreSQL discovery -** <br/> Added an optional schemas config field to restrict the discover operation to user-specified PostgreSQL schemas. When omitted, existing behaviour is preserved and all non-system schemas are discovered.
1717

18+
4. **MSSQL read replica support -** <br/> Added optional `jdbc_url_params` to the MSSQL source so you can target Always On read replicas (for example with read-intent), and updated CDC to use replica-safe paths that avoid primary-only agent/msdb and capture-instance management on secondaries.
19+
1820
### Destinations
1921

2022
1. **Skip equality deletes for CDC inserts post-backfill -** <br/> Equality deletes are now skipped for CDC inserts once the backfill→CDC overlap window is complete, reducing unnecessary write overhead. A new `dedup_inserts` flag on the Iceberg `olake_2pc` table property tracks this — Java sets it to `true` on backfill commit, and Go clears it to `false` after the first successful CDC commit. This applies to both the Arrow and legacy gRPC writers.
@@ -29,4 +31,8 @@ April 21, 2026 – May 03, 2026
2931

3032
4. **PostgreSQL primary key discovery fix via pg_catalog -** <br/> `information_schema.key_column_usage` incorrectly included foreign key columns as primary keys, causing wrong `_olake_id` hashes, missed equality deletes, and duplicate rows in Iceberg on CDC upserts. Replaced with a `pg_catalog`-based query that returns only true primary keys and works correctly for read-only roles on managed databases like RDS, Supabase, and Render.
3133

32-
5. **MySQL CDC charset corruption fix for non-UTF8 columns -** <br/> ENUM and string columns using non-UTF8 charsets (`utf16`, `ucs2`, `latin1`) were silently corrupted during CDC due to blind `[]byte``string` casts. Fixed by adding collation-aware decoding using `TableMapEvent.CollationMap()` and `EnumSetCollationMap()`.
34+
5. **MySQL CDC charset corruption fix for non-UTF8 columns -** <br/> ENUM and string columns using non-UTF8 charsets (`utf16`, `ucs2`, `latin1`) were silently corrupted during CDC due to blind `[]byte``string` casts. Fixed by adding collation-aware decoding using `TableMapEvent.CollationMap()` and `EnumSetCollationMap()`.
35+
36+
6. **MongoDB primary key pinning for deterministic deduplication -** <br/> Previously, all indexed fields were treated as primary keys, so updates to non-unique indexed fields changed the `_olake_id` and broke Iceberg equality deletes, creating duplicate rows. The primary key is now pinned strictly to MongoDB’s guaranteed-unique `_id`, ensuring stable hashes and correct deduplicated upserts.
37+
38+
7. **DB2 driver download fix in integration tests -** <br/> DB2 integration tests now reuse the already-installed `clidriver` by copying it into the workspace, so Docker containers find it locally instead of repeatedly hitting the flaky IBM CDN download path.

0 commit comments

Comments
 (0)