Skip to content

Pull requests: eloe/mlx-vlm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat: add request cancellation and timeout support
#20 opened Apr 6, 2026 by eloe Owner Loading…
feat: add --max-context-tokens for OOM prevention
#19 opened Apr 6, 2026 by eloe Owner Loading…
feat: add JSON mode via response_format parameter
#18 opened Apr 6, 2026 by eloe Owner Loading…
feat: add logprobs support to /chat/completions
#17 opened Apr 6, 2026 by eloe Owner Loading…
feat: enforce tool_choice parameter in chat/completions
#16 opened Apr 6, 2026 by eloe Owner Loading…
feat: add stop sequences support for both endpoints
#15 opened Apr 6, 2026 by eloe Owner Loading…
feat: concurrency guard for Metal GPU serialization
#13 opened Apr 6, 2026 by eloe Owner Loading…
feat: prompt prefix caching for server endpoints
#12 opened Apr 6, 2026 by eloe Owner Loading…
ProTip! no:milestone will show everything without a milestone.