Skip to content

Commit 7301542

Browse files
committed
address comments
1 parent a772ea9 commit 7301542

File tree

1 file changed

+5
-5
lines changed

1 file changed

+5
-5
lines changed

website/blog/releases/0.7.md

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -48,7 +48,7 @@ These enhancements make Fluss 0.7 ready for most production use cases.
4848
In Fluss 0.5, we first introduced the Streaming Lakehouse feature, with the powerful Union Read ability, we can significantly reduce the cost of streaming storage and improve the data freshness of Lakehouse.
4949
> Note: Union Reads allows querying and combining the results from both Lakehouse (historical) and Fluss (real-time) data.
5050
51-
However, the initial implementation had architectural limitations affecting scalability and operability in production. In 0.7, we’ve completely re-architected the Streaming Lakehouse feature to address these challenges.
51+
However, the initial implementation had architectural limitations affecting scalability and operability in production. In Fluss 0.7, we’ve completely re-architected the Streaming Lakehouse feature to address these challenges.
5252

5353
### Elastic Stateless Service
5454
Previously, the lake tiering service was implemented as a Flink job encapsulating Fluss as a Source and Paimon as a Sink, storing sync offsets state in Flink’s State, this making it a stateful service.
@@ -70,7 +70,7 @@ This optimization significantly reduces the load on Fluss and boosts batch proce
7070
* **Actionable Offset Visibility:** The offset state is now queryable, providing greater insight and control over data processing.
7171

7272
This design ensures end-to-end consistency across all operations.
73-
Furthermore, this stateless design also decouples us from tight Flink dependency, paving the way for future lightweight execution models, such as running on FaaS (Function as a Service).
73+
Furthermore, this stateless design also decouples us from the tight Flink dependency, paving the way for future lightweight execution models, such as running on FaaS (Function as a Service).
7474

7575
### Pluggable Lake Format
7676
The previous implementation had a tight coupling with Apache Paimon, which restricted our ability to integrate with other lake formats, such as Iceberg.
@@ -95,13 +95,13 @@ Apart from that, we also support the following advanced partition features to ma
9595
* **Dynamic Partition Creation:** Automatically creates required partitions based on incoming data, no manual pre-creation is required.
9696
* **Automatic Partition Discovery:** Fluss source adds matched new partitions to the subscription in real-time.
9797

98-
Using real business data from **Taobao - the largest online shopping in China,** we tested the read and write performance between non-partitioned and partitioned tables (with 20 auto-created partitions). The write results show that the multi-level partition and dynamic partition creation mechanism do not have a significant impact on the write performance.
98+
Using real business data from **Taobao - the largest online shopping platform in China,** we tested the read and write performance between non-partitioned and partitioned tables (with 20 auto-created partitions). The write results show that the multi-level partition and dynamic partition creation mechanism do not have a significant impact on the write performance.
9999
![Write Perf](../assets/0.7/write_perf.jpg)
100100

101101
At the same time, under the same data scale, we tested the streaming read performance of non-partitioned tables and partitioned tables under three partition conditions:
102102
**unconditional**, **medium matching** (hitting five partitions), and **exact matching** (hitting one partition).
103-
From the results we can observe that when the partition condition only matches 1/20 of the partitions,
104-
**the network traffic is reduced by about 20 times** and the **processing time is reduced by nearly 9 times**,
103+
From the results, we can observe that when the partition condition only matches 1/20 of the partitions,
104+
**the network traffic is reduced by about 20x** and the **processing time is reduced by nearly 9x**,
105105
demonstrating the huge performance benefit of partition pruning in streaming reads.
106106

107107

0 commit comments

Comments
 (0)