RFC: Introduce io-uring for localfile storage

# Motivation

According to the paper [io_uring: Rethinking Asynchronous I/O for Storage Systems](https://arxiv.org/pdf/2512.04859v1)￼, io_uring can significantly improve CPU utilization and I/O latency for storage systems, especially those with a buffer manager.

Based on this observation, Riffle introduces an io_uring-based I/O handler (UringIO) to optimize local shuffle storage performance.

# Prerequisites
1. Linux kernel version >= 5.10
2. Currently verified on Anolis OS 8

# Conclusions
1. With io_uring enabled and 16 threads per disk (still under tuning), CPU load is reduced by approximately 3× compared to the non-io_uring implementation.
**Update: Further tuning shows that 2 threads per disk are sufficient, achieving comparable overall throughput.**
2. After enabling io_uring:

> 1. Write throughput reaches 5 GB/s, compared to 3.75 GB/s without io_uring (~25% improvement).
> 2. Read throughput shows no significant improvement, which is expected because:
> 3. Read requests are not intensive
> 4. Each read operation transfers relatively large data blocks
> 5. io_uring provides limited benefits for this access pattern


# Benchmark

The benchmark runs a 2.2 TB TeraSort application, with a single Riffle server handling all shuffle data.

## Performance Results
| Type             | Write Time | Read Time |
|------------------|------------|-----------|
| With io_uring    | 3.4 min    | 6.3 min   |
| Without io_uring | 4.1 min    | 8.3 min   |

## cpu load comparison

<img width="1280" height="541" alt="Image" src="https://github.com/user-attachments/assets/79c414b5-6b89-444e-8eb3-4178767828fe" />


## Subtasks
- [x] #549 
- [x] #551 
- [x] #553 
- [x] #560

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

RFC: Introduce io-uring for localfile storage #554

Motivation

Prerequisites

Conclusions

Benchmark

Performance Results

cpu load comparison

Subtasks

Sub-issues

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

RFC: Introduce io-uring for localfile storage #554

Description

Motivation

Prerequisites

Conclusions

Benchmark

Performance Results

cpu load comparison

Subtasks

Sub-issues

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions