Skip to content

Commit c41e583

Browse files
Mehul BatraMehul Batra
authored andcommitted
compress image size
1 parent 5d45de8 commit c41e583

File tree

7 files changed

+3
-3
lines changed

7 files changed

+3
-3
lines changed

website/blog/2025-12-02-fluss-x-iceberg-why-your-lakehouse-is-not-streamhouse-yet.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -12,7 +12,7 @@ Apache Fluss represents a new architectural approach: the **Streamhouse** for re
1212

1313
After working on Fluss–Iceberg lakehouse integration and deploying this architecture at a massive scale, including Alibaba's 3 PB production deployment processing 40 GB/s, we're ready to share the architectural lessons learned. Specifically, why existing systems fall short, how Fluss and Iceberg naturally complement each other, and what this means for finally building true real-time lakehouses.
1414

15-
![Banner](assets/fluss-x-iceberg/fluss-lakehouse-streaming.png)
15+
![Banner](assets/fluss-x-iceberg/fluss-lakehouse-streaming_comp.png)
1616

1717
<!-- truncate -->
1818

@@ -44,7 +44,7 @@ Four converging forces are driving the need for sub-second data infrastructure:
4444

4545
Yet critical use cases demand sub-second to second-level latency: search and recommendation systems with real-time personalization, advertisement attribution tracking, anomaly detection for fraud and security monitoring, operational intelligence for manufacturing/logistics/ride-sharing, and Gen AI model inference requiring up-to-the-second features. The industry needs a **hot real-time layer** sitting in front of the lakehouse.
4646

47-
![Evolution Timeline](assets/fluss-x-iceberg/evolution.png)
47+
![Evolution Timeline](assets/fluss-x-iceberg/evolution_comp.png)
4848
## What is Fluss × Iceberg?
4949

5050
### The Core Concept: Hot/Cold Unified Storage
@@ -60,7 +60,7 @@ Think of your data as having two thermal zones:
6060

6161
Traditional architectures force you to maintain **separate systems** for these zones: Kafka/Kinesis for streaming (hot), Iceberg for analytics (cold), complex ETL pipelines to move data between them, and applications writing to both systems (dual-write problem).
6262

63-
![Kappa vs Lambda Architecture](assets/fluss-x-iceberg/kappa-vs-lambda.png)
63+
![Kappa vs Lambda Architecture](assets/fluss-x-iceberg/kappa-vs-lambda_comp.png)
6464

6565
**Fluss × Iceberg unifies these as tiered storage with Kappa architecture:** Applications write once to Fluss. A stateless Tiering Service (Flink job) automatically moves data from hot to cold storage based on configured freshness (e.g., 30 seconds, 5 minutes). Query engines see a single table that seamlessly spans both tiers—eliminating the dual-write complexity of Lambda architecture.
6666

-5.89 MB
Binary file not shown.
928 KB
Loading
-1000 KB
Binary file not shown.
231 KB
Loading
-4.02 MB
Binary file not shown.
640 KB
Loading

0 commit comments

Comments
 (0)