Add localpod docs (#565)

phillipleblanc · peasee · web-flow · commit fc9ec71f7a44 · 2024-10-31T22:44:42.000+09:00
* Add localpod docs

* docs: Add localpod to connectors table

* docs: Re-order localpod position in table

* wip

---------

Co-authored-by: peasee &lt;98815791+peasee@users.noreply.github.com&gt;
diff --git a/spiceaidocs/docs/components/data-connectors/index.md b/spiceaidocs/docs/components/data-connectors/index.md
@@ -31,6 +31,7 @@ Currently supported Data Connectors include:
 | `ftp`, `sftp`   | FTP/SFTP      | Alpha             | Parquet, CSV                        | `append`, `full`            | ❌                             | ✅                 |
 | `graphql`       | GraphQL       | Alpha             | GraphQL                             | `append`, `full`            | ❌                             | ❌                 |
 | `http`, `https` | HTTP(s)       | Alpha             | Parquet, CSV                        | `append`, `full`            | ❌                             | ❌                 |
+| `localpod`      | Local dataset replication | Alpha |                                     | `append`, `full`            | ❌                             | ✅                 |
 | `mssql`         | MS SQL Server | Alpha             | Tabular Data Stream (TDS)           | `append`, `full`            | ❌                             | ❌                 |
 | `sharepoint`    | SharePoint    | Alpha             |                                     | `append`, `full`            | ❌                             | ✅                 |
 | `snowflake`     | Snowflake     | Alpha             | Arrow                               | `append`, `full`            | Roadmap                         | ❌                 |
diff --git a/spiceaidocs/docs/components/data-connectors/localpod.md b/spiceaidocs/docs/components/data-connectors/localpod.md
@@ -0,0 +1,39 @@
+---
+title: 'Localpod Data Connector'
+sidebar_label: 'Localpod Data Connector'
+description: 'Localpod Data Connector Documentation'
+pagination_prev: null
+---
+
+The Localpod Data Connector enables setting up a parent/child relationship between datasets in the current Spicepod. This can be used for configuring multiple/tiered accelerations for a single dataset, and ensuring that the data is only downloaded once from the remote source. For example, you can use the `localpod` connector to create a child dataset that is accelerated in-memory, while the parent dataset is accelerated to a file.
+
+The dataset created by the `localpod` connector will logically have the same data as the parent dataset.
+
+## Synchronized Refreshes
+
+The `localpod` connector supports synchronized refreshes, which ensures that the child dataset is refreshed from the same data as the parent dataset. Synchronized refreshes require that both the parent and child datasets are accelerated with `refresh_mode: full` (which is the default).
+
+When synchronization is enabled, the following logs will be emitted:
+
+```bash
+2024-10-28T15:45:24.220665Z  INFO runtime::datafusion: Localpod dataset test_local synchronizing refreshes with parent table test
+```
+
+### Examples
+
+```yaml
+datasets:
+- from: postgres:cleaned_sales_data
+  name: test
+  params:
+    ...
+  acceleration:
+    enabled: true # This dataset will be accelerated into a DuckDB file
+    engine: duckdb
+    mode: file
+    refresh_check_interval: 10s
+- from: localpod:test
+  name: test_local
+  acceleration:
+    enabled: true # This dataset accelerates the parent `test` dataset into in-memory Arrow records and is synchronized with the parent
+```