[Bug] Inconsistencies in Kontext Dev Support and Performance Between INT4 and FP8 Versions

### Checklist

- [x] 1. I have searched for related issues and FAQs (https://github.com/mit-han-lab/nunchaku/blob/main/docs/faq.md) but was unable to find a solution.
- [x] 2. The issue persists in the latest version.
- [x] 3. Please note that without environment information and a minimal reproducible example, it will be difficult for us to reproduce and address the issue, which may delay our response.
- [x] 4. If your report is a question rather than a bug, please submit it as a discussion at https://github.com/mit-han-lab/ComfyUI-nunchaku/discussions/new/choose. Otherwise, this issue will be closed.
- [x] 5. I will do my best to describe the issue in English.

### Describe the Bug

We have conducted tests on the latest release with respect to Kontext Dev support and identified several issues in both single-reference and multi-reference configurations. Compared to the FP8 version, the output results differ noticeably.

Specifically, we observed that the INT4 version maintains a constant iteration speed of approximately 6 it/s, which matches the baseline text-to-image performance. However, in the case of Kontext, each additional reference image typically introduces a performance drop by a factor of around 1.7×, a behavior that is clearly present in the FP8 version but absent in the current INT4 implementation.

Furthermore, we noted that the main Nunchaku package remains at version 0.3.1, whereas comfyui-nunchaku has already been updated to version 0.3.3. Given that Kontext models support referenceLatent injection—a feature introduced in recent diffusers updates—we suspect that the root cause of the observed discrepancies may lie in the fact that the mainline Nunchaku package has not yet been updated accordingly.

We would appreciate any clarification on this, and whether an update to the main Nunchaku version is planned to align with the newer features.

### Environment

Torch 2.6, nunchaku 0.3.1, comfyui-nunchaku 0.3.3

### Reproduction Steps

### Single reference
### Workflow: 
[kontext-dev ComfyUI single reference.json](https://github.com/user-attachments/files/20972485/kontext-dev.ComfyUI.single.reference.json)
### Input Image:
![Image](https://github.com/user-attachments/assets/c8897d21-0382-4815-bf29-f688ff7ba825)
### Output Image:
![Image](https://github.com/user-attachments/assets/813267d6-13fe-431a-9d27-2060728cb627)
### Multiple reference
### Workflow:
[kontext-dev ComfyUI multi reference.json](https://github.com/user-attachments/files/20972486/kontext-dev.ComfyUI.multi.reference.json)
### Input Image:
![Image](https://github.com/user-attachments/assets/956f9f20-ec57-4890-85cb-bccb6ed37352)
![Image](https://github.com/user-attachments/assets/fb4bf180-9c47-4346-afa3-160e7642350f)
### Output Image:
![Image](https://github.com/user-attachments/assets/f9c4fa7d-32c0-4688-ad34-e76aa028e26e)
### FP8 Output:
![Image](https://github.com/user-attachments/assets/e75ef022-eb18-4fc3-ab81-fe3ae9737cea)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug] Inconsistencies in Kontext Dev Support and Performance Between INT4 and FP8 Versions #322

Checklist

Describe the Bug

Environment

Reproduction Steps

Single reference

Workflow:

Input Image:

Output Image:

Multiple reference

Workflow:

Input Image:

Output Image:

FP8 Output:

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Bug] Inconsistencies in Kontext Dev Support and Performance Between INT4 and FP8 Versions #322

Description

Checklist

Describe the Bug

Environment

Reproduction Steps

Single reference

Workflow:

Input Image:

Output Image:

Multiple reference

Workflow:

Input Image:

Output Image:

FP8 Output:

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions