aaif-goose
diff --git a/‎documentation/blog/2025-06-16-multi-model-in-goose/index.md‎
Lines changed: 26 additions & 20 deletions b/‎documentation/blog/2025-06-16-multi-model-in-goose/index.md‎
Lines changed: 26 additions & 20 deletions
diff --git a/‎documentation/blog/2025-08-11-llm-tag-team-lead-worker-model/index.md‎
Lines changed: 127 additions & 0 deletions b/‎documentation/blog/2025-08-11-llm-tag-team-lead-worker-model/index.md‎
Lines changed: 127 additions & 0 deletions
@@ -1,29 +1,34 @@
 ---
 title: "Treating LLMs Like Tools in a Toolbox: A Multi-Model Approach to Smarter AI Agents"
-description: How goose uses multiple LLMs within a single task, optimizing for speed, cost, and reliability in AI agent workflows
+description: How Goose uses multiple LLMs within a single task, optimizing for speed, cost, and reliability in AI agent workflows
+unlisted: true
 authors:
     - mic
     - angie
 ---
 
+:::danger Outdated
+Lead/Worker mode has been removed from goose. It has been replaced by [Planning Mode](/docs/guides/creating-plans), which uses a dedicated planner model with the `/plan` command. See the [multi-model guide](/docs/guides/multi-model/) for current workflows.
+:::
+
 ![blog cover](multi-model-ai-agent.png)
 
 
 Not every task needs a genius. And not every step should cost a fortune.
 
-That's something we've learned while scaling goose, our open source AI agent. The same model that's great at unpacking a planning request might totally fumble a basic shell command, or worse - it might burn through your token budget doing it.
+That's something we've learned while scaling Goose, our open source AI agent. The same model that's great at unpacking a planning request might totally fumble a basic shell command, or worse - it might burn through your token budget doing it.
 
 So we asked ourselves: what if we could mix and match models in a single session?
 
-Not just switching based on user commands, but building goose with an actual system for routing tasks between different models, each playing to their strengths.
+Not just switching based on user commands, but building Goose with an actual system for routing tasks between different models, each playing to their strengths.
 
-This is the gap smarter multi-model workflows are designed to fill.
+This is the gap the lead/worker model is designed to fill.
 
 <!-- truncate -->
 
 ## The Problem with Single-Model Sessions
 
-Originally, every goose session used a single model from start to finish. That worked fine for short tasks, but longer sessions were harder to tune:
+Originally, every Goose session used a single model from start to finish. That worked fine for short tasks, but longer sessions were harder to tune:
 
 * Go too cheap, and the model might miss nuance or break tools.
 * Go too premium, and your cost graph starts looking like a ski slope.
@@ -32,26 +37,29 @@ There was no built-in way to adapt on the fly.
 
 We saw this tension in real usage where agents would start strong, then stall out when the model struggled to follow through. Sometimes users would manually switch models mid-session. But that's not scalable, and definitely not agent like.
 
-## Designing a Multi-Model System
+## Designing the Lead/Worker System
 
 The core idea is simple:
 
-* Use a strong reasoning model when you need planning and strategy.
-* Use a faster, lower-cost model for day-to-day execution.
-* Switch intentionally based on the task, with `/plan` and dedicated planner settings when you need deeper reasoning.
+* Start the session with a lead model that's strong at reasoning and planning.
+* After a few back and forths between you and the model (what we call "turns"), hand off to a worker model that's faster and cheaper, but still capable.
+* If the worker gets stuck, Goose can detect the failure and temporarily bring the lead back in.
+
+
+You can configure how many turns the lead handles upfront (`GOOSE_LEAD_TURNS`), how many consecutive failures trigger fallback (`GOOSE_LEAD_FAILURE_THRESHOLD`), and how long the fallback lasts before Goose retries the worker.
 
 This gives you a flexible, resilient setup where each model gets used where it shines.
 
 One of the trickiest parts of this feature was defining what failure looks like.
 
-We didn't want goose to swap models just because an API timed out. Instead, we focused on real task failures:
+We didn't want Goose to swap models just because an API timed out. Instead, we focused on real task failures:
 
 * Tool execution errors
 * Syntax mistakes in generated code
 * File not found or permission errors
 * User corrections like "that's wrong" or "try again"
 
-goose tracks these signals and knows when to escalate. And once the fallback model stabilizes things, it switches back without missing a beat.
+Goose tracks these signals and knows when to escalate. And once the fallback model stabilizes things, it switches back without missing a beat.
 
 ## The Value of Multi-Model Design
 
@@ -63,22 +71,20 @@ We've found that this multi-model design unlocks new workflows:
 * **Cross-provider setups** (Claude for planning, OpenAI for execution)
 * **Lower-friction defaults** for teams worried about LLM spend
 
-It also opens the door for even smarter routing in the future with things like switching based on tasks, ensemble voting, or maybe even letting goose decide which model to call based on tool context.
+It also opens the door for even smarter routing in the future with things like switching based on tasks, ensemble voting, or maybe even letting Goose decide which model to call based on tool context.
 
 ## Try It Out
 
-Use a dedicated planner model when you want strong up-front reasoning, and keep a fast default model for execution:
+Lead/worker mode is already available in Goose.  To enable, export these variables with two models that have already been configured in Goose:
 
 ```bash
-export GOOSE_PROVIDER="anthropic"
+export GOOSE_LEAD_MODEL="gpt-4o"
 export GOOSE_MODEL="claude-4-sonnet"
-export GOOSE_PLANNER_PROVIDER="openai"
-export GOOSE_PLANNER_MODEL="gpt-4o"
 ```
 
-Then use `/plan` for strategy and continue normal execution in your session.
+From there, Goose takes care of the hand off, the fallback, and the recovery. You just... keep vibing.
 
-For setup details, see the [planning guide](/docs/guides/creating-plans).
+If you're curious how it all works under the hood, we've got a [full tutorial](/docs/tutorials/lead-worker).
 
 ---
 
@@ -89,11 +95,11 @@ If you're experimenting with multi-model setups, [share what's working and what
   <meta property="og:title" content="Treating LLMs Like Tools in a Toolbox: A Multi-Model Approach to Smarter AI Agents" />
   <meta property="og:type" content="article" />
   <meta property="og:url" content="https://goose-docs.ai/blog/2025/06/16/multi-model-in-goose" />
-  <meta property="og:description" content="How goose uses multiple LLMs within a single task, optimizing for speed, cost, and reliability in AI agent workflows" />
+  <meta property="og:description" content="How Goose uses multiple LLMs within a single task, optimizing for speed, cost, and reliability in AI agent workflows" />
   <meta property="og:image" content="https://goose-docs.ai/assets/images/multi-model-ai-agent-d408feaeba3e13cafdbfe9377980bc3d.png" />
   <meta name="twitter:card" content="summary_large_image" />
   <meta property="twitter:domain" content="goose-docs.ai" />
   <meta name="twitter:title" content="Treating LLMs Like Tools in a Toolbox: A Multi-Model Approach to Smarter AI Agents" />
-  <meta name="twitter:description" content="How goose uses multiple LLMs within a single task, optimizing for speed, cost, and reliability in AI agent workflows" />
+  <meta name="twitter:description" content="How Goose uses multiple LLMs within a single task, optimizing for speed, cost, and reliability in AI agent workflows" />
   <meta name="twitter:image" content="https://goose-docs.ai/assets/images/multi-model-ai-agent-d408feaeba3e13cafdbfe9377980bc3d.png" />
 </head>
@@ -0,0 +1,127 @@
+---
+title: "LLM Tag Team: Who Plans, Who Executes?"
+description: Dive into Goose's Lead/Worker model where one LLM plans while another executes - a game-changing approach to AI collaboration that can save costs and boost efficiency.
+unlisted: true
+authors: 
+    - ebony
+---
+
+:::danger Outdated
+Lead/Worker mode has been removed from goose. It has been replaced by [Planning Mode](/docs/guides/creating-plans), which uses a dedicated planner model with the `/plan` command. See the [multi-model guide](/docs/guides/multi-model/) for current workflows.
+:::
+
+![blog cover](header-image.png)
+
+Ever wondered what happens when you let two AI models work together like a tag team? That’s exactly what we tested in our latest livestream—putting Goose’s Lead/Worker model to work on a real project. Spoiler: it’s actually pretty great.
+
+The Lead/Worker model is one of those features that sounds simple on paper but delivers some amazing benefits in practice. Think of it like having a project manager and a developer working in perfect harmony - one does the strategic thinking, the other gets their hands dirty with the actual implementation.
+
+<!-- truncate -->
+
+## What's This Lead/Worker Thing All About?
+
+Instead of asking one LLM to do everything, Lead/Worker lets you split the load. Your lead model takes care of the thinking, decision-making, and big-picture planning, while your worker model focuses on execution—writing code, running commands, and making the plan happen. The magic is in the balance: you can put a more powerful (and sometimes more expensive) model in the lead and let a faster, more cost-effective one handle the heavy lifting.
+
+Popular model pairings people are loving:
+
+  - GPT-4 + Claude Sonnet – Balanced intelligence and efficiency.
+  - Claude Opus + GPT-3.5 – Creative planning with quick execution.
+  - GPT-4o + Local models – Privacy-focused builds where data stays in-house.
+
+## Why You'll Love This Setup
+
+- 💰 Cost Optimization
+Use cheaper models for execution while keeping the premium models for strategic planning. Your wallet will thank you.
+
+- ⚡ Speed Boost  
+Get solid plans from capable models, then let optimized execution models fly through the implementation.
+
+- 🔄 Mix and Match Providers
+This is where it gets really cool - you can use Claude for reasoning and OpenAI for execution, or any combination that works for your workflow.
+
+- 🏃‍♂️ Handle Long Dev Sessions
+Perfect for those marathon coding sessions where you need sustained performance without breaking the bank.
+
+## [Setting It Up](/docs/tutorials/lead-worker#configuration)
+
+Getting started with the Lead/Worker model is surprisingly straightforward. In the Goose desktop app, you just need to:
+
+1. **Enable the feature** - Look for the enable button in your settings
+2. **Choose your lead model** - Pick something powerful for planning (like GPT-4)
+3. **Select your worker model** - Go with something efficient for execution (like Claude Sonnet)
+4. **Configure the behavior** - Set how many turns the worker gets before consulting the lead
+
+The default settings work great for most people, but you can customize things like:
+- **Number of turns**: How many attempts the worker model gets before pulling in the lead
+- **Failure handling**: What happens when things don't go as planned
+- **Fallback behavior**: How the system recovers from issues
+
+## Real-World Magic in Action
+
+During our [livestream](https://www.youtube.com/embed/IbBDBv9Chvg), we tackled a real project: adding install buttons to the MCP servers documentation page. What made this interesting wasn't just the end result, but watching how the two models collaborated.
+
+The lead model would analyze the requirements, understand the existing codebase structure, and create a plan. Then the worker model would jump in and start implementing, making the actual code changes and handling the technical details.
+
+### The Project: Documentation Enhancement
+
+We wanted to add install buttons to our MCP server cards, similar to what we already had on our extensions page. We needed to figure out how to add this functionality without breaking existing workflows.
+
+Here's what the Lead/Worker model helped us accomplish:
+- **Analyzed the existing documentation structure**
+- **Identified the best approach** (creating a custom page vs. modifying existing ones)
+- **Implemented the solution** with proper routing and styling
+- **Handled edge cases** like maintaining tutorial links while adding install functionality
+
+## The Developer Experience
+
+One thing that really stood out was how natural the interaction felt. You're not constantly switching contexts or managing different tools. You just describe what you want, and the system figures out the best way to divide the work.
+
+The lead model acts as your strategic partner, while the worker model becomes your implementation buddy. It's like pair programming, but with AI models that never get tired or need coffee breaks.
+
+## Pro Tips from Our Session
+
+### Start with Good Goose Hints
+We always recommend setting up your [goosehints](/docs/guides/context-engineering/using-goosehints) to give context about your project. It saves you from re-explaining the same things over and over.
+
+### Don't Micromanage
+Let the lead model do its planning thing. Sometimes the best results come from giving high-level direction and letting the system figure out the details.
+
+### Use Git for Safety
+Always work in a branch when experimenting. The models are smart, but having that safety net means you can be more adventurous with your requests.
+
+### Visual Feedback Helps
+While the desktop UI doesn't show the model switching as clearly as the CLI does, you can still follow along by expanding the tool outputs to see what's happening under the hood.
+
+## The Results Speak for Themselves
+
+By the end of our session, we had:
+- ✅ Successfully added install buttons to our MCP server documentation
+- ✅ Maintained all existing functionality (tutorial links still worked)
+- ✅ Improved the user experience with better visual hierarchy
+- ✅ Organized content into logical sections (community vs. built-in servers)
+
+The best part? The models made smart decisions we hadn't even thought of, like automatically categorizing the servers and improving the overall page layout.
+
+## Ready to Try It Yourself?
+
+The [Lead/Worker model](/docs/tutorials/lead-worker) is available now in Goose. Whether you're working on documentation, building features, or tackling complex refactoring, having two specialized models working together can be a game changer.
+
+Want to see it in action? Check out the full stream where we built this feature live:
+
+<iframe class="aspect-ratio" width="560" height="315" src="https://www.youtube.com/embed/IbBDBv9Chvg" title="LLM Tag Team: Who Plans, Who Executes?" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe>
+
+Got questions or want to share your own Lead/Worker success stories? Join us in our [Discord community](https://discord.gg/goose-oss) - we'd love to hear what you're building!
+
+
+<head>
+  <meta property="og:title" content="LLM Tag Team: Who Plans, Who Executes?" />
+  <meta property="og:type" content="article" />
+  <meta property="og:url" content="https://goose-docs.ai/blog/2025/08/11/llm-tag-team-lead-worker-model" />
+  <meta property="og:description" content="Dive into Goose's Lead/Worker model where one LLM plans while another executes - a game-changing approach to AI collaboration that can save costs and boost efficiency." />
+  <meta property="og:image" content="https://goose-docs.ai/assets/images/header-image-bed3ed59a52ea231c1da0707b9b6d287.png" />
+  <meta name="twitter:card" content="summary_large_image" />
+  <meta property="twitter:domain" content="goose-docs.ai" />
+  <meta name="twitter:title" content="LLM Tag Team: Who Plans, Who Executes?" />
+  <meta name="twitter:description" content="Dive into Goose's Lead/Worker model where one LLM plans while another executes - a game-changing approach to AI collaboration that can save costs and boost efficiency." />
+  <meta name="twitter:image" content="https://goose-docs.ai/assets/images/header-image-bed3ed59a52ea231c1da0707b9b6d287.png" />
+</head>