Fix Qwen3Renderer stripping out last <think>, and Qwen3DisableThinkingRenderer not adding <think>\n\n</think> #178

thejaminator · 2025-12-15T07:35:36Z

Suggested fix for #177 and #176

… format The previous implementation only added `<think>\n` to assistant messages via the parent class, but the official Qwen3-8B tokenizer format requires the complete empty thinking block: `<think>\n\n</think>\n\n`. This commit fixes the issue by: 1. Overriding render_message() to prepend the complete empty thinking block to assistant messages that don't already have one 2. Delegating to the parent class for all rendering logic 3. Adding test cases to verify the fix matches official tokenizer behavior Fixes: thinking-machines-lab#176

thejaminator · 2025-12-15T07:36:10Z

tinker_cookbook/renderers.py

            self.strip_thinking_from_history
            and message["role"] == "assistant"
            and "</think>" in ac_content
+            and not is_last


thejaminator · 2025-12-15T07:37:14Z

tinker_cookbook/renderers.py

+            if "<think>" not in content:
+                message = message.copy()
+                message["content"] = "<think>\n\n</think>\n\n" + content
+        return super().render_message(idx, message, is_last=is_last)


for Qwen3DisableThinkingRenderer, it did not add <think>\n\n</think>\n\n during SFT.

================================================================================ BUG: Official tinker-cookbook Qwen3DisableThinkingRenderer ================================================================================ Actual output from renderer: <|im_start|>user What is 2+2?<|im_end|> <|im_start|>assistant <think> <---- Missing the \n\n</think> tokens The answer is 4.<|im_end|> ================================================================================ Expected output from Qwen3-8B tokenizer: ================================================================================ <|im_start|>user What is 2+2?<|im_end|> <|im_start|>assistant <think> </think> The answer is 4.<|im_end|>

joschu · 2025-12-15T08:00:44Z

Thanks for looking into this -- I agree that there's a bug. Could you check (and add a test) that build_generation_prompt does the right thing for Qwen3DisableThinkingRenderer?

thejaminator · 2025-12-15T08:51:43Z

tinker_cookbook/renderers.py

+        elif message["role"] == "assistant" and "<think>" not in ac_content and is_last:
            # Matching the paper, we force the assistant to start with <think>. Some SFT datasets include
            # <think> in the assistant messages, we so don't need to re-add it in those cases.
            ob_str += "<think>\n"


thejaminator · 2025-12-15T08:57:33Z

tinker_cookbook/renderers.py

-        # XXX this causes inefficiency in RL, because the observations don't grow by appending to the end.
-        # Maybe we should just insert this empty thinking block in every message?
+        prefill = "<think>\n\n</think>\n\n" + (prefill or "")
        return super().build_generation_prompt(messages, role, prefill)


oops, the new generation test caught a bug. we still need to prefill like this

thejaminator · 2025-12-15T09:01:55Z

tinker_cookbook/tests/test_qwen3_disable_thinking.py

+        f"HF tokens: {hf_tokens}\n"
+        f"HF string: {tokenizer.decode(hf_tokens)}"
+    )
+


added test for generation

thejaminator added 2 commits December 15, 2025 15:05

add suggested fixes

28b6b29

thejaminator commented Dec 15, 2025

View reviewed changes

type ignore tests

4dabf02

fix generation for Qwen3 thinking and disabled thinking

031739c

thejaminator commented Dec 15, 2025

View reviewed changes

remove comment

a33474f

thejaminator commented Dec 15, 2025

View reviewed changes

move tests

9209ba6

thejaminator commented Dec 15, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix Qwen3Renderer stripping out last <think>, and Qwen3DisableThinkingRenderer not adding <think>\n\n</think> #178

Fix Qwen3Renderer stripping out last <think>, and Qwen3DisableThinkingRenderer not adding <think>\n\n</think> #178

thejaminator commented Dec 15, 2025

Uh oh!

thejaminator Dec 15, 2025 •

edited

Loading

Uh oh!

thejaminator Dec 15, 2025

Uh oh!

joschu commented Dec 15, 2025

Uh oh!

thejaminator Dec 15, 2025

Uh oh!

thejaminator Dec 15, 2025 •

edited

Loading

Uh oh!

thejaminator Dec 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix Qwen3Renderer stripping out last <think>, and Qwen3DisableThinkingRenderer not adding <think>\n\n</think> #178

Are you sure you want to change the base?

Fix Qwen3Renderer stripping out last <think>, and Qwen3DisableThinkingRenderer not adding <think>\n\n</think> #178

Conversation

thejaminator commented Dec 15, 2025

Uh oh!

thejaminator Dec 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

thejaminator Dec 15, 2025

Choose a reason for hiding this comment

Uh oh!

joschu commented Dec 15, 2025

Uh oh!

thejaminator Dec 15, 2025

Choose a reason for hiding this comment

Uh oh!

thejaminator Dec 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

thejaminator Dec 15, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

thejaminator Dec 15, 2025 •

edited

Loading

thejaminator Dec 15, 2025 •

edited

Loading