link-assistant
diff --git a/‎FREE_MODELS.md‎
Lines changed: 5 additions & 9 deletions b/‎FREE_MODELS.md‎
Lines changed: 5 additions & 9 deletions
diff --git a/‎MODELS.md‎
Lines changed: 13 additions & 15 deletions b/‎MODELS.md‎
Lines changed: 13 additions & 15 deletions
diff --git a/‎README.md‎
Lines changed: 2 additions & 2 deletions b/‎README.md‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎docs/case-studies/issue-242/README.md‎
Lines changed: 111 additions & 0 deletions b/‎docs/case-studies/issue-242/README.md‎
Lines changed: 111 additions & 0 deletions
@@ -9,7 +9,7 @@ This document lists all free AI models currently supported by the agent. Free mo
 Use any free model with the `--model` flag:
 
 ```bash
-echo "hello" | agent --model opencode/qwen3.6-plus-free
+echo "hello" | agent --model opencode/nemotron-3-super-free
 ```
 
 ## OpenCode Zen Free Models
@@ -18,19 +18,15 @@ echo "hello" | agent --model opencode/qwen3.6-plus-free
 
 | Model                   | Model ID                           | Context Window  | Description                                         |
 | ----------------------- | ---------------------------------- | --------------- | --------------------------------------------------- |
-| Qwen 3.6 Plus Free      | `opencode/qwen3.6-plus-free`      | ~1,000,000      | **Default.** Largest context, strong agent performance |
-| Nemotron 3 Super Free   | `opencode/nemotron-3-super-free`   | ~262,144        | NVIDIA hybrid Mamba-Transformer, strong reasoning   |
+| Nemotron 3 Super Free   | `opencode/nemotron-3-super-free`   | ~262,144        | **Default.** NVIDIA hybrid Mamba-Transformer, strong reasoning |
 | MiniMax M2.5 Free       | `opencode/minimax-m2.5-free`      | ~200,000        | Strong general-purpose performance                  |
 | GPT 5 Nano              | `opencode/gpt-5-nano`             | ~400,000        | Reliable OpenAI-powered free option                 |
 | Big Pickle              | `opencode/big-pickle`             | ~200,000        | Stealth model, free during evaluation period        |
 
 ### Usage Examples
 
 ```bash
-# Qwen 3.6 Plus Free (default)
-echo "hello" | agent --model opencode/qwen3.6-plus-free
-
-# Nemotron 3 Super Free
+# Nemotron 3 Super Free (default)
 echo "hello" | agent --model opencode/nemotron-3-super-free
 
 # MiniMax M2.5 Free
@@ -85,6 +81,7 @@ The following models were previously free but are no longer available:
 
 | Model              | Former Model ID               | Status                                   |
 | ------------------ | ----------------------------- | ---------------------------------------- |
+| Qwen 3.6 Plus Free | `opencode/qwen3.6-plus-free`  | Free promotion ended (April 2026) — now requires OpenCode Go subscription. See [issue #242](https://github.com/link-assistant/agent/issues/242) |
 | Kimi K2.5 Free     | `opencode/kimi-k2.5-free`     | Removed from OpenCode Zen (March 2026) — see [issue #208](https://github.com/link-assistant/agent/issues/208) |
 | Grok Code Fast 1   | `opencode/grok-code`          | Discontinued January 2026                |
 | MiniMax M2.1 Free  | `opencode/minimax-m2.1-free`  | Replaced by `opencode/minimax-m2.5-free` |
@@ -98,7 +95,7 @@ The following models were previously free but are no longer available:
 
 ### Use OpenCode Zen when:
 - You want the most tested and reliable free models
-- You prefer `qwen3.6-plus-free` as the default with ~1M context window
+- You prefer `nemotron-3-super-free` as the default with ~262K context window
 - You need a simple, curated list of models
 
 ### Use Kilo Gateway when:
@@ -110,7 +107,6 @@ The following models were previously free but are no longer available:
 
 The agent intelligently routes model requests:
 
-- `qwen3.6-plus-free` without provider prefix → OpenCode Zen (`opencode/qwen3.6-plus-free`)
 - `nemotron-3-super-free` without provider prefix → OpenCode Zen (`opencode/nemotron-3-super-free`)
 - `big-pickle` without provider prefix → OpenCode Zen (`opencode/big-pickle`)
 - `kilo/minimax-m2.5-free` explicitly → Kilo Gateway
 
@@ -30,12 +30,12 @@ Below are the prices per 1M tokens for OpenCode Zen models. Models are sorted by
 | Model                                    | Model ID                      | Input  | Output | Cached Read | Cached Write |
 | ---------------------------------------- | ----------------------------- | ------ | ------ | ----------- | ------------ |
 | **Free Models (Output: $0.00)**          |
-| Qwen 3.6 Plus Free (default)            | `opencode/qwen3.6-plus-free`  | Free   | Free   | Free        | -            |
-| Nemotron 3 Super Free                    | `opencode/nemotron-3-super-free` | Free | Free   | Free        | -            |
+| Nemotron 3 Super Free (default)          | `opencode/nemotron-3-super-free` | Free | Free   | Free        | -            |
 | MiniMax M2.5 Free                        | `opencode/minimax-m2.5-free`  | Free   | Free   | Free        | -            |
 | GPT 5 Nano                               | `opencode/gpt-5-nano`         | Free   | Free   | Free        | -            |
 | Big Pickle                               | `opencode/big-pickle`         | Free   | Free   | Free        | -            |
 | **Discontinued Free Models**             |
+| ~~Qwen 3.6 Plus Free~~                   | `opencode/qwen3.6-plus-free`  | ~~Free~~ | ~~Free~~ | ~~Free~~ | -         |
 | ~~Kimi K2.5 Free~~                       | `opencode/kimi-k2.5-free`     | ~~Free~~ | ~~Free~~ | ~~Free~~ | -         |
 | ~~Grok Code Fast 1~~                     | `opencode/grok-code`          | ~~Free~~ | ~~Free~~ | ~~Free~~ | -         |
 | ~~MiniMax M2.1 Free~~                    | `opencode/minimax-m2.1-free`  | ~~Free~~ | ~~Free~~ | ~~Free~~ | -         |
@@ -60,40 +60,38 @@ Below are the prices per 1M tokens for OpenCode Zen models. Models are sorted by
 
 ## Default Model
 
-The default model is **Qwen 3.6 Plus Free** (`opencode/qwen3.6-plus-free`), which is completely free and offers the largest context window (~1M tokens) among free models.
+The default model is **Nemotron 3 Super Free** (`opencode/nemotron-3-super-free`), which is completely free and offers strong reasoning capabilities with a ~262K token context window (NVIDIA hybrid Mamba-Transformer architecture).
 
-> **Note:** MiniMax M2.5 Free (`opencode/minimax-m2.5-free`) was previously the default free model. Qwen 3.6 Plus Free is now the default due to its superior context window and agent performance. See [issue #232](https://github.com/link-assistant/agent/issues/232).
+> **Note:** Qwen 3.6 Plus Free (`opencode/qwen3.6-plus-free`) was previously the default free model, but OpenCode Zen ended the free promotion in April 2026. The model now requires an OpenCode Go subscription. See [issue #242](https://github.com/link-assistant/agent/issues/242).
+
+> **Note:** MiniMax M2.5 Free (`opencode/minimax-m2.5-free`) was previously the default free model. See [issue #232](https://github.com/link-assistant/agent/issues/232).
 
 > **Note:** Kimi K2.5 Free (`opencode/kimi-k2.5-free`) was previously the default free model, but it was removed from the OpenCode Zen provider in March 2026. See [Case Study #208](docs/case-studies/issue-208/README.md) for details.
 
 > **Note:** Grok Code Fast 1 (`opencode/grok-code`) was previously the default free model, but xAI ended the free tier for this model on OpenCode Zen in January 2026. **The grok-code model is no longer included as a free model in OpenCode Zen subscription.** See [Case Study #133](docs/case-studies/issue-133/README.md) for details.
 
 ### Free Models (in order of recommendation)
 
-1. **Qwen 3.6 Plus Free** (`opencode/qwen3.6-plus-free`) - Default free model (~1M context, strong agent performance)
-2. **Nemotron 3 Super Free** (`opencode/nemotron-3-super-free`) - NVIDIA hybrid Mamba-Transformer (~262K context, strong reasoning)
-3. **MiniMax M2.5 Free** (`opencode/minimax-m2.5-free`) - Strong general-purpose performance (~200K context)
-4. **GPT 5 Nano** (`opencode/gpt-5-nano`) - Reliable OpenAI-powered free option (~400K context)
-5. **Big Pickle** (`opencode/big-pickle`) - Stealth model, free during evaluation (~200K context)
+1. **Nemotron 3 Super Free** (`opencode/nemotron-3-super-free`) - Default free model, NVIDIA hybrid Mamba-Transformer (~262K context, strong reasoning)
+2. **MiniMax M2.5 Free** (`opencode/minimax-m2.5-free`) - Strong general-purpose performance (~200K context)
+3. **GPT 5 Nano** (`opencode/gpt-5-nano`) - Reliable OpenAI-powered free option (~400K context)
+4. **Big Pickle** (`opencode/big-pickle`) - Stealth model, free during evaluation (~200K context)
 
-> **Note:** `opencode/kimi-k2.5-free`, `opencode/minimax-m2.1-free`, and `opencode/glm-4.7-free` are no longer available as free models on OpenCode Zen. See [OpenCode Zen Documentation](https://opencode.ai/docs/zen/) for the current list of free models.
+> **Note:** `opencode/qwen3.6-plus-free`, `opencode/kimi-k2.5-free`, `opencode/minimax-m2.1-free`, and `opencode/glm-4.7-free` are no longer available as free models on OpenCode Zen. See [OpenCode Zen Documentation](https://opencode.ai/docs/zen/) for the current list of free models.
 
 ## Usage Examples
 
 ### Using the Default Model (Free)
 
 ```bash
-# Uses opencode/qwen3.6-plus-free by default
+# Uses opencode/nemotron-3-super-free by default
 echo "hello" | agent
 ```
 
 ### Using Other Free Models
 
 ```bash
-# Qwen 3.6 Plus Free (default)
-echo "hello" | agent --model opencode/qwen3.6-plus-free
-
-# Nemotron 3 Super Free
+# Nemotron 3 Super Free (default)
 echo "hello" | agent --model opencode/nemotron-3-super-free
 
 # MiniMax M2.5 Free
 
@@ -83,7 +83,7 @@ See [rust/README.md](rust/README.md) for full documentation.
 
 We're creating a slimmed-down, public domain version of OpenCode CLI focused on the "agentic run mode" for use in virtual machines, Docker containers, and other environments where unrestricted AI agent access is acceptable. This is **not** for general desktop use - it's for isolated environments where you want maximum AI agent freedom.
 
-**OpenCode Compatibility**: We maintain 100% compatibility with OpenCode's JSON event streaming format, so tools expecting `opencode run --format json --model opencode/qwen3.6-plus-free` output will work with our agent-cli.
+**OpenCode Compatibility**: We maintain 100% compatibility with OpenCode's JSON event streaming format, so tools expecting `opencode run --format json --model opencode/nemotron-3-super-free` output will work with our agent-cli.
 
 ## Why Choose Agent Over OpenCode?
 
@@ -123,7 +123,7 @@ echo '{"message":"hi"}' | agent
 **With custom model:**
 
 ```bash
-echo "hi" | agent --model opencode/qwen3.6-plus-free
+echo "hi" | agent --model opencode/nemotron-3-super-free
 ```
 
 **Direct prompt mode:**
 
@@ -0,0 +1,111 @@
+# Case Study: Replace Deprecated qwen3.6-plus-free Default with nemotron-3-super-free
+
+**Issue:** [#242](https://github.com/link-assistant/agent/issues/242)
+**PR:** [#243](https://github.com/link-assistant/agent/pull/243)
+
+## Problem Statement
+
+OpenCode Zen ended the free promotion for `qwen3.6-plus-free` (Qwen 3.6 Plus Free). The model now requires an OpenCode Go subscription. Since `qwen3.6-plus-free` was the default model for the agent, all users without an OpenCode Go subscription experienced immediate failures when running the agent without specifying an alternative model.
+
+## Timeline of Events
+
+| Timestamp (UTC)        | Event                                                                                         |
+| ---------------------- | --------------------------------------------------------------------------------------------- |
+| April 2026 (early)     | `qwen3.6-plus-free` set as default model in [PR #234](https://github.com/link-assistant/agent/pull/234) (issue #232) |
+| ~April 9, 2026         | OpenCode Zen ends free promotion for Qwen 3.6 Plus Free                                       |
+| 2026-04-09T09:10:31Z   | Agent run fails with `ModelError`: "Free promotion has ended for Qwen3.6 Plus Free"           |
+| 2026-04-09T09:10:31Z   | Compaction cascade also fails: `ProviderModelNotFoundError` for `qwen3.6-plus-free`            |
+| 2026-04-09T09:10:31Z   | Available free models at time of failure: `big-pickle`, `gpt-5-nano`, `nemotron-3-super-free`  |
+
+## Root Cause Analysis
+
+### Primary Cause: External model deprecation
+
+OpenCode Zen removed `qwen3.6-plus-free` from the free tier. The API now returns HTTP 401 with:
+
+```json
+{
+  "type": "error",
+  "error": {
+    "type": "ModelError",
+    "message": "Free promotion has ended for Qwen3.6 Plus Free. You can continue using the model by subscribing to OpenCode Go - https://opencode.ai/go"
+  }
+}
+```
+
+### Secondary Impact: Compaction cascade failure
+
+The default compaction models cascade included `qwen3.6-plus-free` as the largest-context model:
+```
+(big-pickle nemotron-3-super-free minimax-m2.5-free gpt-5-nano qwen3.6-plus-free same)
+```
+
+When the model was removed, the cascade gracefully skipped it (`"skipping unresolvable compaction model in cascade"`), but the configuration was still referencing a non-existent model.
+
+### Tertiary Impact: Provider priority lists
+
+`qwen3.6-plus-free` was listed first in:
+- `getSmallModel()` OpenCode priority list (used for title/summary generation)
+- Global model sort priority list
+
+## Evidence from Logs
+
+Source: [solution-draft-log.txt](solution-draft-log.txt) (full log from failed run on 2026-04-09)
+
+**Key log entries:**
+
+1. **Model resolution succeeds** (line 648): Model resolves to `qwen3.6-plus-free` via default
+2. **Model not in catalog** (line 657): `"model not found - refusing to silently fallback"` — `qwen3.6-plus-free` no longer in available models list
+3. **Compaction cascade skip** (line 665): `"error": "ProviderModelNotFoundError"` for `qwen3.6-plus-free`
+4. **API error** (line 1679): `"Free promotion has ended for Qwen3.6 Plus Free"`
+5. **HTTP 401** (line 1692): `"status": 401, "statusText": "Unauthorized"`
+
+## Available Free Models (Post-Deprecation)
+
+From the log at time of failure (line 649-656):
+
+| Model                  | Provider | Context    | Status                     |
+| ---------------------- | -------- | ---------- | -------------------------- |
+| big-pickle             | opencode | ~200,000   | Available                  |
+| gpt-5-nano             | opencode | ~400,000   | Available                  |
+| nemotron-3-super-free  | opencode | ~262,144   | Available                  |
+| glm-5-free             | kilo     | ~202,752   | Available                  |
+| glm-4.5-air-free       | kilo     | ~131,072   | Available                  |
+| minimax-m2.5-free      | kilo     | ~204,800   | Available (Kilo only)      |
+| qwen3.6-plus-free      | opencode | ~1,000,000 | **Deprecated (paid only)** |
+
+## Solution
+
+### New Default Model: `opencode/nemotron-3-super-free`
+
+**Rationale:** Among remaining free OpenCode models, `nemotron-3-super-free` has the largest context window (~262K tokens) and strong reasoning capabilities (NVIDIA hybrid Mamba-Transformer architecture). While `gpt-5-nano` has a larger context (~400K), `nemotron-3-super-free` is better suited as a primary model due to its stronger general-purpose agent performance.
+
+### Changes Made
+
+1. **`js/src/cli/defaults.ts`**: Changed `DEFAULT_MODEL` from `opencode/qwen3.6-plus-free` to `opencode/nemotron-3-super-free`
+2. **`js/src/cli/defaults.ts`**: Removed `qwen3.6-plus-free` from compaction models cascade
+3. **`js/src/provider/provider.ts`**: Updated `getSmallModel()` and global sort priority lists — removed `qwen3.6-plus-free`
+4. **`js/src/cli/argv.ts`**: Updated compaction models comment
+5. **Documentation**: Moved `qwen3.6-plus-free` to deprecated/discontinued sections in FREE_MODELS.md, MODELS.md, README.md
+6. **Tests**: Updated assertions for new default model and cascade
+
+### Updated Compaction Models Cascade
+
+```
+Old: (big-pickle nemotron-3-super-free minimax-m2.5-free gpt-5-nano qwen3.6-plus-free same)
+New: (big-pickle minimax-m2.5-free nemotron-3-super-free gpt-5-nano same)
+```
+
+Note: `nemotron-3-super-free` is now the default model, so `same` at the end of the cascade effectively includes it. The cascade order remains smallest-to-largest context.
+
+## Lessons Learned
+
+1. **Free model promotions are temporary** — The agent should be resilient to model deprecation. This is the fourth time a default free model has been deprecated (grok-code → kimi-k2.5-free → minimax-m2.5-free → qwen3.6-plus-free).
+2. **Compaction cascade provides resilience** — The cascade correctly skipped the unavailable model and continued with available alternatives, preventing total compaction failure.
+3. **Verbose logging was critical** — The detailed HTTP response body logging (added in previous PRs) immediately revealed the exact error message from OpenCode Zen.
+
+## Related Issues and PRs
+
+- [Issue #232](https://github.com/link-assistant/agent/issues/232) — Original PR that set `qwen3.6-plus-free` as default
+- [Issue #208](https://github.com/link-assistant/agent/issues/208) — Previous default model deprecation (`kimi-k2.5-free`)
+- [Issue #133](https://github.com/link-assistant/agent/issues/133) — First default model deprecation (`grok-code`)