You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: website/blog/2025-07-23-hands-on-fluss-lakehouse.md
+6-6Lines changed: 6 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,6 +1,6 @@
1
1
---
2
2
slug: hands-on-fluss-lakehouse
3
-
title: "Hands-on Fluss Lakehouse with Paimon S3"
3
+
title: "From Stream to Lake: Hands-On with Fluss Tiering into Paimon on Minio"
4
4
authors: [gyang94]
5
5
toc_max_heading_level: 5
6
6
---
@@ -23,17 +23,17 @@ toc_max_heading_level: 5
23
23
limitations under the License.
24
24
-->
25
25
26
-
# Hands-on Fluss Lakehouse with Paimon S3
26
+
# From Stream to Lake: Hands-On with Fluss Tiering into Paimon on Minio
27
27
28
28
Fluss stores historical data in a lakehouse storage layer while keeping real-time data in the Fluss server. Its built-in tiering service continuously moves fresh events into the lakehouse, allowing various query engines to analyze both hot and cold data. The real magic happens with Fluss's union-read capability, which lets Flink jobs seamlessly query both the Fluss cluster and the lakehouse for truly integrated real-time processing.
In this hands-on tutorial, we'll walk you through setting up a local Fluss lakehouse environment, running some practical data operations, and getting first-hand experience with the complete Fluss lakehouse architecture. By the end, you'll have a working environment for experimenting with Fluss's powerful data processing capabilities.
33
33
34
-
## Integrate Paimon S3 Lakehouse
34
+
## Integrate with Paimon Minio Lakehouse
35
35
36
-
For this tutorial, we'll use **Fluss 0.7** and **Flink 1.20** to run the tiering service on a local cluster. We'll configure **Paimon** as our lake format and**S3** as the storage backend. Let's get started:
36
+
For this tutorial, we'll use **Fluss 0.7** and **Flink 1.20** to run the tiering service on a local cluster. We'll configure **Paimon** as our lake format on**Minio** as the storage backend. Let's get started:
37
37
38
38
### Minio Setup
39
39
@@ -97,7 +97,7 @@ For this tutorial, we'll use **Fluss 0.7** and **Flink 1.20** to run the tiering
97
97
datalake.paimon.s3.path.style.access: true
98
98
```
99
99
100
-
This configures Paimon as the datalake format with S3 as the warehouse.
100
+
This configures Paimon as the datalake format on Minio as the warehouse.
101
101
102
102
4. Start Fluss
103
103
@@ -366,4 +366,4 @@ Now let's dive into some actual data processing. We'll use the Flink SQL Client
366
366
367
367
## Summary
368
368
369
-
In this guide, we've explored the Fluss lakehouse architecture and set up a complete local environment with Fluss, Flink, Paimon, and S3. We've walked through practical examples of data processing that showcase how Fluss seamlessly integrates real-time and historical data. With this setup, you now have a solid foundation for experimenting with Fluss's powerful lakehouse capabilities on your own machine.
369
+
In this guide, we've explored the Fluss lakehouse architecture and set up a complete local environment with Fluss, Flink, Paimon, and Minio. We've walked through practical examples of data processing that showcase how Fluss seamlessly integrates real-time and historical data. With this setup, you now have a solid foundation for experimenting with Fluss's powerful lakehouse capabilities on your own machine.
0 commit comments