Commit 1f7bb8f
fix: streaming crash with --no-thinking (enable_thinking kwarg leak)
stream_chat() passed enable_thinking through **kwargs to
stream_generate() → MLXLanguageModel.stream_generate() which doesn't
accept it, causing TypeError on every streaming request. Now popped
from kwargs before passing downstream, matching the non-streaming path
which already did this correctly.
Also fixed MLLM stream_chat path for the same issue.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>1 parent d6d0c50 commit 1f7bb8f
1 file changed
+3
-1
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
505 | 505 | | |
506 | 506 | | |
507 | 507 | | |
| 508 | + | |
| 509 | + | |
508 | 510 | | |
509 | 511 | | |
510 | 512 | | |
| |||
540 | 542 | | |
541 | 543 | | |
542 | 544 | | |
543 | | - | |
| 545 | + | |
544 | 546 | | |
545 | 547 | | |
546 | 548 | | |
| |||
0 commit comments