Claude Tasks vs Beads. Does this change the picture? #1887
Replies: 2 comments
-
|
Practitioner perspective — these solve different problems I've been using Beads as the coordination backbone for an agentic development framework (multi-phase implementations, cross-session state, dependency tracking). From that experience, I think the comparison reveals an important distinction that's easy to miss: scheduled jobs and agentic workflows are fundamentally different things, even though both involve "an LLM doing work on your behalf." Claude Tasks = scheduler/trigger ("do X on a schedule, each run is stateless") A scheduled job is fire-and-forget: trigger Claude, it runs, it's done. An agentic workflow is stateful: Phase 2 can't start until Phase 1 passes review, the agent needs to know what gaps the reviewer flagged, and all of this has to survive across sessions because Claude starts each one with zero memory. The gap is agent amnesia, not schedulingClaude Tasks can fire a job every morning, but each run has no memory of what it did yesterday, what's blocked on CI, or which phase of a multi-step implementation it's in the middle of. That's the structural problem Beads solves:
Layers, not alternativesThese aren't competing tools — they're different layers of the same stack: Claude Tasks would slot in as the top trigger layer. A Tasks run could On composability and portabilityYour instinct here is right. Beads is git-backed (Dolt), so the execution state travels with the repo and isn't locked to any provider. A Claude Tasks -> Beads workflow today could become a Gemini/OpenAI -> Beads workflow tomorrow without losing any of the accumulated state. That portability matters more as agentic workflows get longer-lived. TL;DR: Claude Tasks answers "when should work happen?" Beads answers "what work is ready and what does the agent need to know?" Scheduled jobs and stateful agentic workflows are complementary layers, not competing ones. |
Beta Was this translation helpful? Give feedback.
-
|
Follow-up: what about subagent delegation within a single Task run? A fair counterargument: Claude can spawn subagents (parallel workers) within a single session. So could a single scheduled Task run orchestrate a complex multi-step workflow without needing cross-session state? In theory, yes. In practice, single-session orchestration hits hard limits:
Non-trivial agentic work needs to span sessions. In our workflow, each spec phase is designed to be session-sized because one-phase-per-session hits the quality sweet spot — focused enough for good output, bounded enough to checkpoint before things go wrong. Amnesia is fractalThe memory gap actually exists at every boundary, not just between sessions: Subagents start with zero context — no conversation history, no prior tool results. The orchestrator must explicitly pack everything the subagent needs into its prompt. This works within a session because the orchestrator has the context to share. But across sessions or across scheduled runs, there's no orchestrator with context — you need an external state store. That's the structural role Beads fills. Each boundary needs its own memory mechanism, and Claude Tasks adds another boundary (run-to-run) without adding any memory mechanism to bridge it. Beads checkpoints survive all of these boundaries — they're in Dolt, queryable via |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Anthropic recently rolled out Claude Tasks, which lets you schedule recurring or one-off automated jobs directly in Claude, things like daily briefings, monitoring, summarization, etc. On the surface, this covers a lot of the same ground Beads is targeting: giving an LLM a persistent job to do on your behalf without you babysitting it. So the obvious question: does Claude Tasks eliminate the need for Beads, or is there still a meaningful gap? My gut says there's still a gap around composability, portability, and not being locked into one provider's ecosystem, but I'm curious what others think. Are Tasks a "good enough" version of this for most people, or is the Beads model solving a fundamentally different problem?
Beta Was this translation helpful? Give feedback.
All reactions