-
Notifications
You must be signed in to change notification settings - Fork 77
Pull requests: quic/efficient-transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
revert(export): Revert proxy-only ONNX transform gating and restore default export behavior
1.21.0
#912
opened Apr 10, 2026 by
vbaddi
Contributor
Loading…
feat: Enable benchmark-mode module inventory/export across all CausalLM architectures
enhancement
New feature or request
#906
opened Apr 3, 2026 by
vbaddi
Contributor
Loading…
feat: Named graph specializations in specializations.json (Prefill/Decode/Vision/Encoder/Embedding)
enhancement
New feature or request
#904
opened Apr 3, 2026 by
vbaddi
Contributor
Loading…
[CI: NIGHTLY] Three-way Execution full layer, few layer, and dummy
#903
opened Apr 2, 2026 by
abukhoy
Contributor
Loading…
Merge ft_experimental_v1 branch to main
fine-tuning
ready for review
#887
opened Mar 25, 2026 by
quic-akuruvil
Contributor
Loading…
Undo deepstack_features based changes for Qwen3VL and Qwen3VL_MOE models
#869
opened Mar 18, 2026 by
quic-dhirajku
Contributor
•
Draft
MLA : update attention in fused_forward, head blocking and add prefillonly transform
#857
opened Mar 16, 2026 by
quic-mamta
Contributor
Loading…
Added fp16/bf16 based export and compile support for VLMs
#819
opened Mar 2, 2026 by
asmigosw
Contributor
Loading…
FT-CI enabled on torch-qaic-env
fine-tuning
#799
opened Feb 18, 2026 by
quic-akuruvil
Contributor
Loading…
Add support for num_crops and valid_size from vLLM
#796
opened Feb 17, 2026 by
quic-vargupt
Contributor
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.