Nunchaku April Development Roadmap

Hello everyone,

As promised, last month we brought multiple-LoRA and ControlNet-Union-Pro support with faster generation speed. Additionally, we expanded support for 20-series GPUs. We understand some of you may still have faced issues, but rest assured. We're actively working on refining the codebase for better stability, compatibility and user experience.

This roadmap outlines our key development goals for April 2025. The next release is scheduled for mid-May. As always, we welcome your contributions and feedback!

## April Focus Areas

- Simplify the [deepcompressor](https://github.com/mit-han-lab/deepcompressor/) backend to reduce quantization costs.
- More comprehensive control support.
- Address memory-related issues to improve stability.

### Quantization

- [ ] Simplify [deepcompressor](https://github.com/mit-han-lab/deepcompressor/) backend to ease the use and reduce the quantization cost (@synxlin, https://github.com/mit-han-lab/ComfyUI-nunchaku/issues/31)
- [ ] Add customized model quantization support in [ComfyUI-nunchaku](https://github.com/mit-han-lab/deepcompressor/) (@lmxyy)
- [ ] Improve fidelity of the 4-bit T5 text encoder (@Aprilhuu)

### LoRA  

- [ ] Add FLUX-turbo support with FLUX-fill base model (@lmxyy, mit-han-lab/ComfyUI-nunchaku#46)
- [ ] Support additional LoRA formats (@lmxyy, mit-han-lab/ComfyUI-nunchaku#64, https://github.com/mit-han-lab/nunchaku/issues/265)
- [ ] Fix LoRA combination bugs (@lmxyy, https://github.com/mit-han-lab/ComfyUI-nunchaku/issues/71)

### Controls

- [x] FP8 ControlNet-Union-Pro support (@ita9naiwa,mit-han-lab/nunchaku#241, mit-han-lab/ComfyUI-nunchaku#37)
- [ ] Expand support for other ControlNet models (@ita9naiwa, mit-han-lab/ComfyUI-nunchaku#37)
- [ ] Add EasyControl support
- [ ] Add PuLID support (@bowen, mit-han-lab/ComfyUI-nunchaku#50, mit-han-lab/nunchaku#258)
- [ ] INT4/FP4 ControlNets (mit-han-lab/ComfyUI-nunchaku#37, mit-han-lab/nunchaku#256)

### Speed

- [ ]  Implement fine-grained First-Block Cache (@Bluear7878)

### Memory & Stability

- [ ] Optimize memory usage when loading T5 (@Aprilhuu )
- [ ] Clean memory cache when deleting models (@lmxyy @sxtyzhangzk, mit-han-lab/ComfyUI-nunchaku#65, mit-han-lab/ComfyUI-nunchaku#57)
- [ ] Serialization errors (@sxtyzhangzk , mit-han-lab/ComfyUI-nunchaku#60)
- [ ] Improve CPU offloading speed in ComfyUI (@lmxyy)

### Quality

- [ ] Investigate FLUX.1-fill quality performance (@lmxyy)
- [ ] Resolve quality issues when combining [ACE-plus](https://github.com/ali-vilab/ACE_plus) with FLUX.1-fill (@lmxyy)

### Installation  

- [x] Create an installation and usage guide video (@lmxyy)
- [ ] Add PyPI installation support (@lmxyy)

### Other Fixes & Improvements

- [ ] Enable multiple-batch inference (@sxtyzhangzk @Bluear7878, mit-han-lab/nunchaku#148)
- [ ] Improve HuggingFace and Modelscope model documentation (@lmxyy)
- [ ] Fix device ID setting (@sxtyzhangzk , mit-han-lab/ComfyUI-nunchaku#45)
- [ ] downloading `cache_dir` handling (@lmxyy, mit-han-lab/nunchaku#255)
- [ ] Reset `residual_diff_threshold` in First-Block Cache (@ita9naiwa, mit-han-lab/nunchaku#242)
- [ ] Autotests and deployment CI.

## Some future features in plan

- [ ] [Wan2.1](https://github.com/Wan-Video/Wan2.1) Support.
- [ ] 8-bit model support.
- [ ] Operator modularization.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Nunchaku April Development Roadmap #81

April Focus Areas

Quantization

LoRA

Controls

Speed

Memory & Stability

Quality

Installation

Other Fixes & Improvements

Some future features in plan

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Nunchaku April Development Roadmap #81

Description

April Focus Areas

Quantization

LoRA

Controls

Speed

Memory & Stability

Quality

Installation

Other Fixes & Improvements

Some future features in plan

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions