You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Update Glue and Iceberg documentation to include new parameters and permissions (#1383)
* Update Glue and Iceberg documentation to include new parameters and permissions
* Remove Hybrid Deployment option from deployment architectures documentation
* Apply suggestion from @lukekim
* Add @docusaurus/theme-mermaid dependency to package.json
* Apply suggestion from @lukekim
* Remove Mermaid docs and plugin changes from lukim/iceberg
| `glue_region` | The AWS region for the Glue Data Catalog. E.g. `us-west-2`. |
50
-
| `glue_key` | Access key (e.g. AWS_ACCESS_KEY_ID for AWS). If not provided, credentials will be loaded from environment variables or IAM roles. |
51
-
| `glue_secret` | Secret key (e.g. AWS_SECRET_ACCESS_KEY for AWS). If not provided, credentials will be loaded from environment variables or IAM roles. |
52
-
| `glue_session_token` | Session token (e.g. AWS_SESSION_TOKEN for AWS) for temporary credentials |
| `glue_region` | The AWS region for the Glue Data Catalog. E.g. `us-west-2`. |
50
+
| `glue_catalog_id` | The Glue catalog ID. For Amazon S3 Tables, use the format `<account_id>:s3tablescatalog/<table_bucket_name>`. If not provided, the default catalog for the account is used. |
51
+
| `glue_key` | Access key (e.g. AWS_ACCESS_KEY_ID for AWS). If not provided, credentials will be loaded from environment variables or IAM roles. |
52
+
| `glue_secret` | Secret key (e.g. AWS_SECRET_ACCESS_KEY for AWS). If not provided, credentials will be loaded from environment variables or IAM roles. |
53
+
| `glue_session_token` | Session token (e.g. AWS_SESSION_TOKEN for AWS) for temporary credentials |
53
54
54
55
## Authentication
55
56
@@ -124,7 +125,7 @@ The IAM role or user needs the following permissions to access Iceberg tables in
Copy file name to clipboardExpand all lines: website/docs/components/data-connectors/iceberg.md
+4-2Lines changed: 4 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -105,6 +105,7 @@ SELECT COUNT(*) FROM transactions;
105
105
| `iceberg_token` | Bearer token value to use for Authorization header. |
106
106
| `iceberg_oauth2_credential` | Credential to use for OAuth2 client credential flow when connecting to the table. Format: `<client_id>:<client_secret>`|
107
107
| `iceberg_oauth2_scope` | Scope to use for OAuth2 client credential flow when connecting to the table. Default: `catalog`|
108
+
| `iceberg_oauth2_token_url` | The URL to use for OAuth2 token endpoint. |
108
109
| `iceberg_oauth2_server_url` | URL of the OAuth2 server tokens endpoint for the client credential flow. |
109
110
| `iceberg_s3_endpoint` | S3-compatible endpoint where the Iceberg table data is stored. |
110
111
| `iceberg_s3_region` | Region of the S3-compatible endpoint. |
@@ -201,7 +202,7 @@ The IAM role or user needs the following permissions to access Iceberg tables in
| [Delta Lake](https://delta.io/) | `file_format: delta` | Stable | Open table format with ACID transactions. Object stores only. |
99
+
| [Apache Iceberg](https://iceberg.apache.org/) | `file_format: iceberg` | Beta | Open table format for large analytic datasets. Object stores only. Requires a [catalog](../catalogs/index.md). |
100
+
| Microsoft Excel | `file_format: xlsx` | Roadmap | Excel spreadsheet format |
101
+
| Markdown | `file_format: md` | Stable | Plain text with formatting (document format) |
102
+
| Text | `file_format: txt` | Stable | Plain text files (document format) |
103
+
| PDF | `file_format: pdf` | Alpha | Portable Document Format (document format) |
104
+
| Microsoft Word | `file_format: docx` | Alpha | Word document format (document format) |
104
105
105
106
### Format-Specific Parameters
106
107
@@ -112,7 +113,7 @@ File formats support additional parameters for fine-grained control. Common exam
112
113
| `csv_delimiter` | CSV | Field delimiter character (default: `,`) |
113
114
| `csv_quote` | CSV | Quote character for fields containing delimiters |
114
115
115
-
For complete format options, see [File Formats Reference](../../reference/file_format).
116
+
For complete format options, see [File Formats Reference](../reference/file_format).
Spice currently supports CSV, JSON, and Parquet data file-formats for data connectors that can read files from a file system or cloud object storage (i.e. [`s3://`](../components/data-connectors/s3), [`abfs://`](../components/data-connectors/abfs), [`file://`](../components/data-connectors/file), etc.). Support for Iceberg and other file-formats are on the roadmap.
9
+
Spice supports CSV, JSON, Parquet, Delta Lake, and Iceberg data file-formats for data connectors that can read files from a file system or cloud object storage (i.e. [`s3://`](../components/data-connectors/s3), [`abfs://`](../components/data-connectors/abfs), [`file://`](../components/data-connectors/file), etc.). Delta Lake and Iceberg are supported for object store connectors. Iceberg requires a catalog to be configured.
10
10
11
11
The parameters supported for specific file-formats are detailed on this page.
0 commit comments