Skip to content

Pull requests: waybarrios/vllm-mlx

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat: MTP per-request routing in BatchedEngine
#223 opened Mar 24, 2026 by Thump604 Loading…
2 of 3 tasks
cli: expose harmony and gpt-oss tool parsers
#216 opened Mar 24, 2026 by krystophny Loading…
tokenizer: return successful mlx-lm load result
#215 opened Mar 24, 2026 by krystophny Loading…
server: add OpenAI-compatible /v1/responses endpoint
#214 opened Mar 24, 2026 by krystophny Loading…
Fix MLLM cache stats in /v1/status
#193 opened Mar 21, 2026 by janhilgard Loading…
4 tasks
fix: parse tool calls in streaming reasoning branch
#177 opened Mar 18, 2026 by Thump604 Loading…
fix: MLLM continuous batching for hybrid models
#165 opened Mar 16, 2026 by Thump604 Loading…
ProTip! Filter pull requests by the default branch with base:main.