OLake-Fusion

OLake-Fusion is a lakehouse table management system for Apache Iceberg.
It helps teams run faster queries, lower storage cost, and operate Iceberg at scale with less effort.

Why OLake-Fusion

Operating Iceberg in production is powerful, but day-2 operations can be expensive and complex. OLake-Fusion adds an operational layer on top of Iceberg so your team can focus on data products instead of maintenance jobs.

With OLake-Fusion, you can:

Keep query performance stable with continuous self-optimization.
Reduce storage and compute waste from small-file and metadata overhead.
Manage tables consistently across different catalogs and environments.
Build infra-decoupled, stream-and-batch-fused, lake-native data platforms.

Architecture

Fusion (Management Service): Handles table lifecycle operations such as self-optimization and data expiration, and provides a unified catalog interface across engines.
Spark Optimizer: Runs optimization tasks that improve file layout and maintain read efficiency.

Key Features

Self-Optimizing Tables: Automatically compacts files and organizes data to keep read latency low.
Multi-Catalog Support: Works with catalogs such as Glue, JDBC, and REST-based catalogs.
Infrastructure Independent: Deploy on private cloud, public cloud, hybrid cloud, or multi-cloud.
Lakehouse Ready: Designed for modern analytics workloads on open table formats.

Benchmark Highlights

Up to 2x faster than vanilla Spark compaction in benchmark scenarios.
Around 5% better query performance in tested workloads.

Read the full benchmark details: Compaction Benchmark

Quick Start

Start with the first end-to-end setup guide:

Configure Your First Compaction

Helpful next reads:

Community

Join us on Slack
Ask questions and report issues via GitHub Issues
Follow docs and updates at olake.io/docs

Contributing

Contributions of all sizes are welcome.

Core project: CONTRIBUTING.md
UI project: OLake UI Repository
Docs and website: OLake Docs Repository
Contributor rewards: Bounty Program

Name		Name	Last commit message	Last commit date
Latest commit History 1,824 Commits
.github		.github
.idea		.idea
.mvn		.mvn
.vscode		.vscode
amoro-ams		amoro-ams
amoro-common		amoro-common
amoro-format-hudi		amoro-format-hudi
amoro-format-iceberg		amoro-format-iceberg
amoro-format-mixed		amoro-format-mixed
amoro-format-paimon		amoro-format-paimon
amoro-metrics		amoro-metrics
amoro-openapi-sdk		amoro-openapi-sdk
amoro-optimizer		amoro-optimizer
amoro-web		amoro-web
build		build
charts/amoro		charts/amoro
dev		dev
dist		dist
docker		docker
docs		docs
grafana		grafana
http		http
licenses-binary		licenses-binary
local-test		local-test
site		site
tools		tools
.asf.yaml		.asf.yaml
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
DISCLAIMER		DISCLAIMER
LICENSE		LICENSE
LICENSE-binary		LICENSE-binary
Makefile		Makefile
NOTICE		NOTICE
NOTICE-binary		NOTICE-binary
README.md		README.md
fusion-arch.png		fusion-arch.png
mvnw		mvnw
mvnw.cmd		mvnw.cmd
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OLake-Fusion

Why OLake-Fusion

Architecture

Key Features

Benchmark Highlights

Quick Start

Community

Contributing

About

Licenses found

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

OLake-Fusion

Why OLake-Fusion

Architecture

Key Features

Benchmark Highlights

Quick Start

Community

Contributing

About

Resources

License

Licenses found

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages