add support for draft model of eagle3 #1468

xufang-lisa · 2025-10-12T15:22:03Z

What does this PR do?

This PR adds conversion of draft model in eagle3 pipeline.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

rkazants · 2025-10-13T03:09:58Z

optimum/exporters/openvino/model_configs.py

+        )
+        return self.random_float_tensor(shape, framework=framework, dtype=float_dtype)
+
+@register_in_tasks_manager( "llamaeagle3",*["text-generation","text-generation-with-past"],library_name="transformers")


what kind of model can we convert with such addition? I am asking because original model has a different model type llama3.
Can you convert only local copy with modified model type? I am not sure that it is capable to convert original eagle3 llama model.
Also, implemented solution looks not scalable for other eagle3 models such as https://huggingface.co/nvidia/gpt-oss-120b-Eagle3

We have verified conversion and GENAI pipeline with yuhuili/EAGLE3-LLaMA3.1-Instruct-8B and Tengyunw/qwen3_8b_eagle3 locally, and AngelSlim/Qwen3-1.7B_eagle3 will be added to GENAI repo test openvinotoolkit/openvino.genai#2740. Checked the list on EAGLE3 github repo, most of them are llama type, they can be converted in theory or with limit update, can we merge this PR firstly and leave the verification per OpenVINO base model support progress and customer requirements?

AngelSlim/Qwen3-14B_eagle3/config.json: "model_type": "qwen3", AngelSlim/Qwen3-a3B_eagle3/config.json: "model_type": "llama", AngelSlim/Qwen3-32B_eagle3/config.json: "model_type": "llama", AngelSlim/Qwen3-4B_eagle3/config.json: "model_type": "llama", AngelSlim/Qwen3-8B_eagle3/config.json: "model_type": "llama", AngelSlim/Qwen3-1.7B_eagle3/config.json: "model_type": "llama", linglingdan/Eagle3_for_MiniCPM4/config.json: "model_type": "llama", lmsys/EAGLE3-gpt-oss-120b-bf16/config.json: "model_type": "llama", lmsys/sglang-EAGLE3-Llama-4-Scout-17B-16E-Instruct-v1/config.json: "model_type": "llama", lmsys/Qwen3-235B-A22B-EAGLE3/config.json: "model_type": "llama", lmsys/sglang-EAGLE3-Llama-4-Maverick-17B-128E-Instruct-v1/config.json: "model_type": "llama", nvidia/gpt-oss-120b-Eagle3/config.json: "model_type": "llama", nvidia/Qwen3-235B-A22B-Eagle3/config.json: "model_type": "llama", nvidia/Llama-4-Maverick-17B-128E-Eagle3 ??, Tengyunw/qwen3_30b_moe_eagle3/config.json: "model_type": "llama", Tengyunw/qwen3_8b_eagle3/config.json: "model_type": "llama", wantsleep/OLMoE_1B_7B_Eagle3/config.json: "model_type": "olmoe", yuhuili/EAGLE3-LLaMA3.3-Instruct-70B/config.json: "model_type": "llama", yuhuili/EAGLE3-DeepSeek-R1-Distill-LLaMA-8B/config.json: "model_type": "llama", yuhuili/EAGLE3-LLaMA3.1-Instruct-8B/config.json: "model_type": "llama", yuhuili/EAGLE3-Vicuna1.3-13B/config.json: "model_type": "llama", Zjcxy-SmartAI/Eagle3-Qwen3-4B-Instruct-2507-zh/config.json: "model_type": "llama",

why don't we use original model type? For this, it relies on the different model type that seems to be modified manually by you. That is not how it should work. These changes should allow to convert the original model. where does llamaeagle3 model type come from?
Does it mean that user should re-create all eagle3 model and modify its model type, etc.?

@rkazants Discussed with Fang, WIP to avoid config.json modification by passing model_type="llamaeagle3" to AutoConfig.from_pretrained

why don't we use original model type? llama modeling in transformers can't support eagle3 draft model, the modeling for eagle3 draft model is from https://github.com/SafeAILab/EAGLE/blob/main/eagle/model/cnets.py. Current PR should support the conversion of eagle3 draft model with model_type: "llama" in config.json

rkazants

needs tests

xufang-lisa added 6 commits October 9, 2025 10:07

add support for eagle3 draft model

0181cbd

rename target_hidden_state_input to hidden_states

39e8ada

add eagle3 parameter to support eagles draft model conversion

8b6987e

restore config.json for draft model

f91076e

remove redundant operation

9b17e4d

reuse some implementations of llama in transformers

a651914

rkazants reviewed Oct 13, 2025

View reviewed changes

rkazants suggested changes Oct 13, 2025

View reviewed changes

rkazants requested review from IlyasMoutawwakil and echarlaix October 13, 2025 03:10

xufang-lisa added 11 commits October 13, 2025 11:18

add some checks

bff9576

extract_d2t supports safetensor

dffcb40

fix file renaming failure

5b44a72

download eagle3 model

febc064

download draft model to temp directory

ecf0b96

reuse LlamaConfig

51ef1fd

adapt to higher version transformers

781023f

pass in a new config file instead of modifying original one

b744788

use absolute path

9034c24

support Qwen3-1.7B_eagle3

7eb3a4a

support more rope_type

bce2108

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add support for draft model of eagle3 #1468

add support for draft model of eagle3 #1468

xufang-lisa commented Oct 12, 2025

Uh oh!

rkazants Oct 13, 2025

Uh oh!

peterchen-intel Oct 14, 2025

Uh oh!

rkazants Oct 14, 2025 •

edited

Loading

Uh oh!

peterchen-intel Oct 14, 2025

Uh oh!

peterchen-intel Oct 15, 2025 •

edited

Loading

Uh oh!

rkazants left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

add support for draft model of eagle3 #1468

Are you sure you want to change the base?

add support for draft model of eagle3 #1468

Conversation

xufang-lisa commented Oct 12, 2025

What does this PR do?

Before submitting

Uh oh!

rkazants Oct 13, 2025

Choose a reason for hiding this comment

Uh oh!

peterchen-intel Oct 14, 2025

Choose a reason for hiding this comment

Uh oh!

rkazants Oct 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

peterchen-intel Oct 14, 2025

Choose a reason for hiding this comment

Uh oh!

peterchen-intel Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rkazants left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

rkazants Oct 14, 2025 •

edited

Loading

peterchen-intel Oct 15, 2025 •

edited

Loading