From 1129fe222b1d266e4dde24d1aab0cf32304ad199 Mon Sep 17 00:00:00 2001 From: Carmen Kwan Date: Thu, 1 Aug 2024 15:05:33 +0200 Subject: [PATCH 1/3] Fix typo in protocol --- PROTOCOL.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/PROTOCOL.md b/PROTOCOL.md index 3bee82b3c..6194a660b 100644 --- a/PROTOCOL.md +++ b/PROTOCOL.md @@ -2082,7 +2082,7 @@ The API supports a start parameter and and an end parameter. The start/end param You can specify a version as a Long or a timestamp as a string in the [Timestamp Format](#timestamp-format). The change data feed represents row-level changes between versions of a Delta table. It records change data for UPDATE, DELETE, and MERGE operations. If you leverage the connectors provided by this library to read change data feed, it results in three metadata columns that identify the type of change event, in addition to the data columns: -- _change_type (type: String): There are four values: insert, update_preimage, update_postimage, delete. preimage is the value before the udpate, postimage is the value after the update. +- _change_type (type: String): There are four values: insert, update_preimage, update_postimage, delete. preimage is the value before the update, postimage is the value after the update. - _commit_version (type: Long): The table version containing the change. - _commit_timestamp (type: Long): The unix timestamp associated when the commit of the change was created, in milliseconds. From f640468a301047840e111a76c976dd495ed425ff Mon Sep 17 00:00:00 2001 From: Carmen Kwan Date: Thu, 1 Aug 2024 15:21:37 +0200 Subject: [PATCH 2/3] Fix typo Delta --- PROTOCOL.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/PROTOCOL.md b/PROTOCOL.md index 6194a660b..95daebb88 100644 --- a/PROTOCOL.md +++ b/PROTOCOL.md @@ -2172,7 +2172,7 @@ The response contains multiple lines: - Historical [Metadata](#metadata) will be returned if includeHistoricalMetadata is set to true. - The ordering of the lines doesn't matter. -When `responseformat=delta`, a sequence of JSON strings delimited by newline. Each line is a JSON object defined in [API Response Format in Parquet](#api-response-format-in-delta). +When `responseformat=delta`, a sequence of JSON strings delimited by newline. Each line is a JSON object defined in [API Response Format in Delta](#api-response-format-in-delta). - The first line is [a JSON wrapper object](#json-wrapper-object-in-each-line-in-delta) containing the delta [Protocol](#protocol-in-delta-format) object. - The second line is [a JSON wrapper object](#json-wrapper-object-in-each-line-in-delta) containing the delta [Metadata](#metadata-in-delta-format) object. - The rest of the lines are [JSON wrapper objects](#json-wrapper-object-in-each-line) for [Files](#file-in-delta-format) of the change data feed. From 347f59a8f949a92d556a90e99df05c5d9c70b44d Mon Sep 17 00:00:00 2001 From: Carmen Kwan Date: Thu, 1 Aug 2024 15:47:26 +0200 Subject: [PATCH 3/3] fix formatting + pluralize Deletion Vectors --- PROTOCOL.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/PROTOCOL.md b/PROTOCOL.md index 95daebb88..cfaa93afa 100644 --- a/PROTOCOL.md +++ b/PROTOCOL.md @@ -2460,11 +2460,11 @@ responses to be compatible with legacy servers. ### responseFormat Indicates the format to expect in the [API Response Format in Parquet](#api-response-format-in-parquet), two values are supported. -- parquet: Represents the format of the delta sharing protocol that has been used in `delta-sharing-spark` 1.0 +- **parquet**: Represents the format of the delta sharing protocol that has been used in `delta-sharing-spark` 1.0 and less, also the default format if `responseFormat` is missing from the header. All the existing delta sharing connectors are able to process data in this format. - **delta**: format can be used to read a shared delta table with minReaderVersion > 1, which contains -readerFeatures such as Deletion Vector or Column Mapping. `delta-sharing-spark` libraries +readerFeatures such as Deletion Vectors or Column Mapping. `delta-sharing-spark` libraries that are able to process `responseformat=delta` will be released soon. ### readerFeatures