Skip to content

Support fine-tuning Qwen3.5 models in qwen-vl-finetune #2092

@yossibiton

Description

@yossibiton

I'm trying to fine-tune Qwen3.5 models (e.g. Qwen/Qwen3.5-9B) using qwen-vl-finetune.

Model loading works via AutoModelForImageTextToText.from_pretrained() (resolves to Qwen3_5ForConditionalGeneration). However, the data processor computes position IDs using get_rope_index_3 when model_type="qwen3vl".

However, Qwen3.5 has a different architecture (Gated DeltaNet + partial RoPE with partial_rotary_factor=0.25) compared to Qwen3-VL's 3D mRoPE with temporal/spatial sections. Is get_rope_index_3 correct for Qwen3.5, or does it need a dedicated position ID function?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions