feat: support for gpt-oss in interactive chat by madclaws · Pull Request #79 · tilesprivacy/tiles

madclaws · 2026-01-29T10:38:33Z

fix: refactor

coderabbitai · 2026-01-29T10:38:53Z

📝 Walkthrough

Walkthrough

The pull request extends the chat workflow in the Rust runtime to support memory-mode-aware behavior throughout the response handling pipeline. Changes update function signatures to accept RunArgs, implement conditional response parsing and output logic based on memory mode settings, add streaming delta tracking for selective rendering, and modify default modelfile resolution. The Python change is purely formatting with no behavioral impact.

Sequence Diagram

sequenceDiagram
    participant User
    participant ChatFn as chat()
    participant Streaming as Streaming Handler
    participant Conversion as convert_to_chat_response()
    participant Extraction as extract_reply()
    
    User->>ChatFn: Input + RunArgs
    activate ChatFn
    ChatFn->>Streaming: Stream response with memory_mode
    activate Streaming
    Streaming->>Streaming: Track is_answer_start flag
    alt memory_mode enabled
        Streaming->>User: Print full response
    else memory_mode disabled
        Streaming->>User: Print dimmed deltas until marker
        Streaming->>User: Print normal deltas after marker
    end
    Streaming-->>ChatFn: Final content
    deactivate Streaming
    ChatFn->>Conversion: Content + memory_mode
    activate Conversion
    Conversion->>Extraction: Content + memory_mode
    activate Extraction
    alt memory_mode enabled
        Extraction->>Extraction: Extract reply tag content
    else memory_mode disabled
        Extraction->>Extraction: Extract final answer (marker-based)
    end
    Extraction-->>Conversion: Parsed reply
    deactivate Extraction
    Conversion-->>ChatFn: ChatResponse
    deactivate Conversion
    ChatFn-->>User: ChatResponse
    deactivate ChatFn

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Possibly related PRs

packing default modelfiles + taking sys prompt from modelfile #76: Directly modifies the same runtime file (tiles/src/runtime/mlx.rs) with overlapping changes to memory\_mode/run\_args handling, chat function signatures, and modelfile resolution logic.

🚥 Pre-merge checks | ✅ 1 | ❌ 2

❌ Failed checks (1 warning, 1 inconclusive)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.
Description check	❓ Inconclusive	The description 'fix: refactor' is vague and generic, using non-descriptive terms that don't convey meaningful information about the specific changes in the changeset.	Expand the description to explain the refactoring intent, such as: 'Refactor chat workflow to support gpt-oss model by adding memory_mode parameter handling and updating default modelfile resolution.'

✅ Passed checks (1 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title 'feat: support for gpt-oss in interactive chat' directly aligns with the main change in the changeset, which involves updating default modelfile resolution to use gpt-oss instead of mem-agent.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

📝 Generate docstrings

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 0

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

tiles/src/runtime/mlx.rs (1)
487-533: Answer‑marker detection can miss when the marker is split across stream chunks.

SSE deltas can split **[Answer]** across chunks, so checking only the current delta can leave is_answer_start false and keep all output dimmed. Consider detecting on the accumulated buffer (or a small rolling window) to handle chunk boundaries.
🛠️ Suggested fix
-                if !run_args.memory && delta.contains("**[Answer]**") {
-                    is_answer_start = true;
-                }
+                if !run_args.memory && !is_answer_start && accumulated.contains("**[Answer]**") {
+                    is_answer_start = true;
+                }

feat: support for gpt-oss in interactive chat

2b29bcd

fix: refactor

madclaws linked an issue Jan 29, 2026 that may be closed by this pull request

Integrate gpt-oss as the default model in Alpha #65

Closed

coderabbitai bot reviewed Jan 29, 2026

View reviewed changes

madclaws merged commit 0a23e73 into main Jan 29, 2026
2 checks passed

madclaws deleted the harmony-support branch January 29, 2026 10:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support for gpt-oss in interactive chat#79

feat: support for gpt-oss in interactive chat#79
madclaws merged 1 commit intomainfrom
harmony-support

madclaws commented Jan 29, 2026

Uh oh!

coderabbitai bot commented Jan 29, 2026 •

edited

Loading

Walkthrough

Sequence Diagram

Estimated code review effort

Possibly related PRs

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

madclaws commented Jan 29, 2026

Uh oh!

coderabbitai bot commented Jan 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Sequence Diagram

Estimated code review effort

Possibly related PRs

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

coderabbitai bot commented Jan 29, 2026 •

edited

Loading