You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
- Add new FAQ entry pointing to data-refresh docs for incremental ingestion
- Extend CDC FAQ entry with native PostgreSQL logical replication (recommended for PG)
- Add new Features > CDC > PostgreSQL Logical Replication page documenting
wal_level=logical + pgoutput direct replication (no Debezium/Kafka)
- Update CDC overview to list native PG replication alongside Debezium
Copy file name to clipboardExpand all lines: website/docs/faq/index.md
+13-6Lines changed: 13 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -98,25 +98,32 @@ Spice uses [Apache DataFusion](https://datafusion.apache.org/) as its primary qu
98
98
99
99
## 17. Does Spice support Change Data Capture (CDC)?
100
100
101
-
Yes. Spice supports CDC via Debezium, enabling real-time data ingestion and materialization from databases such as PostgreSQL and MySQL. [Learn More](../features/cdc).
101
+
Yes. Spice supports CDC in two ways:
102
102
103
-
## 18. Can Spice integrate with existing BI tools?
103
+
-**Native PostgreSQL logical replication** (recommended for Postgres sources). Spice connects directly to the source using Postgres' `wal_level=logical` + pgoutput and streams `INSERT`/`UPDATE`/`DELETE` events into the accelerator — no Debezium, no Kafka, no external services. [Learn more](../features/cdc/postgres-replication).
104
+
-**[Debezium](../components/data-connectors/debezium)**, for sources where Debezium + Kafka is already deployed, or for non-PostgreSQL databases (MySQL, SQL Server, etc.). [Learn more](../features/cdc).
105
+
106
+
## 18. How do I keep an accelerated dataset incrementally up-to-date?
107
+
108
+
For sources with a monotonically-increasing version column (e.g. `updated_at`), Spice incrementally ingests new and modified records using [`time_column`](../reference/spicepod/datasets#time_column) + [`refresh_mode: append`](../features/data-acceleration/data-refresh#append), with [`refresh_append_overlap`](../reference/spicepod/datasets#accelerationrefresh_append_overlap) to tolerate clock skew and [`retention_period`](../reference/spicepod/datasets#accelerationretention_period) to evict old or soft-deleted records. Pair with `primary_key` + `on_conflict: upsert` to deduplicate re-read rows within the overlap window. See [Data Refresh](../features/data-acceleration/data-refresh) for configuration details and examples.
109
+
110
+
## 19. Can Spice integrate with existing BI tools?
104
111
105
112
Yes. Spice integrates with BI tools through standard SQL interfaces (ODBC, JDBC, Arrow Flight SQL), enabling accelerated, real-time analytics for dashboards and reporting. An official [Tableau Connector](../clients/tableau) is available and a [BI Acceleration](https://www.youtube.com/watch?v=blEtLgRKu0c) demo using Apache Superset.
106
113
107
-
## 19. How does Spice handle data privacy and compliance?
114
+
## 20. How does Spice handle data privacy and compliance?
108
115
109
116
Spice provides secure, auditable data access through sandboxed runtimes, secure endpoint checks, and detailed telemetry and tracing. The Spice Cloud Platform (SCP) is SOC 2 Type II compliant, meeting enterprise security and compliance requirements.
110
117
111
-
## 20. Can Spice be used for real-time analytics?
118
+
## 21. Can Spice be used for real-time analytics?
112
119
113
120
Yes. Spice accelerates data locally using Apache Arrow, Spice Cayenne (Vortex), DuckDB, SQLite, or PostgreSQL, enabling real-time analytics and sub-second query performance for data-intensive applications and dashboards.
114
121
115
-
## 21. How can developers contribute to Spice?
122
+
## 22. How can developers contribute to Spice?
116
123
117
124
Developers can contribute by submitting code, documentation, or raising issues on [GitHub](https://github.com/spiceai/spiceai). See [CONTRIBUTING.md](https://github.com/spiceai/spiceai/blob/trunk/CONTRIBUTING) for guidelines.
118
125
119
-
## 22. Does Spice support schema evolution?
126
+
## 23. Does Spice support schema evolution?
120
127
121
128
Spice infers the schema for datasets and views at startup and does not apply runtime schema changes by default. If the source schema changes while the runtime is running (for example, columns are added, removed, or their types change), data refreshes will fail with a schema mismatch error rather than silently applying the new schema. This behavior is intentional — it protects against unintentional or breaking schema changes propagating into accelerated tables.
Copy file name to clipboardExpand all lines: website/docs/features/cdc/index.md
+4-1Lines changed: 4 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -33,7 +33,10 @@ It is recommended to use CDC-accelerated datasets with persistent data accelerat
33
33
34
34
Enabling CDC by setting `refresh_mode: changes` in the acceleration settings requires support from the data connector to provide a stream of row-level changes.
35
35
36
-
Currently, the only supported data connector is [Debezium](../components/data-connectors/debezium).
36
+
Spice currently supports CDC via:
37
+
38
+
-**[PostgreSQL Logical Replication](./postgres-replication)** — **recommended** for PostgreSQL sources. Spice connects directly to the source using Postgres' native logical replication protocol (`wal_level=logical` + pgoutput) and streams `INSERT`/`UPDATE`/`DELETE` events into the accelerator. No Kafka, no Debezium, no external services.
39
+
-**[Debezium](../components/data-connectors/debezium)** — for sources where Debezium + Kafka is already deployed, or for non-PostgreSQL databases (MySQL, SQL Server, etc.).
0 commit comments