Skip to content

[Bug] Inconsistencies in Kontext Dev Support and Performance Between INT4 and FP8 Versions #322

@philipy1219

Description

@philipy1219

Checklist

Describe the Bug

We have conducted tests on the latest release with respect to Kontext Dev support and identified several issues in both single-reference and multi-reference configurations. Compared to the FP8 version, the output results differ noticeably.

Specifically, we observed that the INT4 version maintains a constant iteration speed of approximately 6 it/s, which matches the baseline text-to-image performance. However, in the case of Kontext, each additional reference image typically introduces a performance drop by a factor of around 1.7×, a behavior that is clearly present in the FP8 version but absent in the current INT4 implementation.

Furthermore, we noted that the main Nunchaku package remains at version 0.3.1, whereas comfyui-nunchaku has already been updated to version 0.3.3. Given that Kontext models support referenceLatent injection—a feature introduced in recent diffusers updates—we suspect that the root cause of the observed discrepancies may lie in the fact that the mainline Nunchaku package has not yet been updated accordingly.

We would appreciate any clarification on this, and whether an update to the main Nunchaku version is planned to align with the newer features.

Environment

Torch 2.6, nunchaku 0.3.1, comfyui-nunchaku 0.3.3

Reproduction Steps

Single reference

Workflow:

kontext-dev ComfyUI single reference.json

Input Image:

Image

Output Image:

Image

Multiple reference

Workflow:

kontext-dev ComfyUI multi reference.json

Input Image:

Image
Image

Output Image:

Image

FP8 Output:

Image

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions