Skip to content

Add Granite Speech speculative decoding evaluation#129

Open
gsaon wants to merge 1 commit intohuggingface:mainfrom
gsaon:main
Open

Add Granite Speech speculative decoding evaluation#129
gsaon wants to merge 1 commit intohuggingface:mainfrom
gsaon:main

Conversation

@gsaon
Copy link
Contributor

@gsaon gsaon commented Mar 7, 2026

Adds evaluation scripts for cascade speculative decoding with Granite Speech models.

Summary

  • run_eval_speculative.py: Implements encoder draft → entropy gating → LLM verification → AR fallback pipeline
  • run_speculative.sh: Runs evaluations across ESB datasets with granite-4.0-1b-speech

Tested with Pytorch 2.7 and transformers 4.57.6 on 1 A100 80GB

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant