-
Notifications
You must be signed in to change notification settings - Fork 48
Pull requests: nod-ai/shark-ai
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Bump IREE requirement pins to 3.5.0rc20250523
#1510
opened May 23, 2025 by
shark-pr-automator
bot
Loading…
Handle more datatypes gracefully in the dump_gguf tool
#1508
opened May 22, 2025 by
KyleHerndon
Loading…
[sharktank] Fix module patching and safetensors comparison tool
#1506
opened May 22, 2025 by
sogartar
Loading…
[sharktank] Add mixture of experts(moe) support for Llama 4 model
#1491
opened May 21, 2025 by
vivekkhandelwal1
Loading…
[Shortfin][LLM] Add initial support for disaggregated invocations
#1463
opened May 16, 2025 by
vinayakdsci
•
Draft
Fuse transposition into first indexing for kv cache read
#1456
opened May 16, 2025 by
rsuderman
Loading…
[WIP] Enable execution on multiple HIP streams on one physical device
#1412
opened May 8, 2025 by
vinayakdsci
•
Draft
Update llama_serving.md doc to reflect the latest changes
#1386
opened May 5, 2025 by
pravg-amd
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.