1.2 Multi-Agent Orchestration
1.2.1 When One Agent Isn't Enough
In Lesson 1.1 you built a single agent: one loop, one model, working through a task. That carries you a long way. But picture a harder job — "research the impact of AI across every creative industry and write me a cited report." A single agent would have to search music, then film, then writing, then visual arts, then design, reading hundreds of articles, one after another, cramming every result into one ever-growing conversation. Two things go wrong: it's slow (everything happens in sequence), and the conversation balloons until the important early findings get buried under everything that came later.
Now picture how a human research team handles the same job. A lead researcher breaks the topic into areas, hands each to a specialist, lets them work in parallel, and then pulls their findings together into one report. Each specialist keeps their own notes; the lead doesn't need to read every article — just the summaries. The work gets done faster, and no single person drowns in detail.
That team is exactly what multi-agent orchestration gives you. One agent — the coordinator — breaks a big task into pieces and delegates each to a specialized subagent (which is itself just an agent running its own loop from Lesson 1.1). The subagents work independently, often at the same time, and report back. Task Statement 1.2 is about designing this team well: how the coordinator divides the work, how the pieces talk to each other, and — most importantly for the exam — how to spot when the team produces a bad result and trace it to the right cause.
A single agent grinds through subtopics sequentially in one swelling context. A coordinated team splits the work across specialists who run in parallel — faster, and no one context overflows.
The one idea to hold onto
Multi-agent orchestration is a coordinator that breaks a big task into pieces and delegates each to a specialized subagent. It buys you parallelism and keeps any one context from overflowing — at the cost of more complexity and far more tokens.
1.2.2 The Hub-and-Spoke Shape
There are many ways to wire a team together, but the one the exam cares about — and the one that works best in practice — is hub-and-spoke. Picture a bicycle wheel: a hub in the centre, spokes radiating out to the rim. The coordinator is the hub. Each subagent is at the end of a spoke. And here's the defining rule: the spokes connect only to the hub, never to each other.
Concretely, that means every piece of communication flows through the coordinator. The search subagent never hands its results directly to the synthesis subagent. It returns them to the coordinator, and the coordinator decides what to pass onward. The subagents are like consultants who report only to the project manager and never email each other directly.
Why force everything through the centre? Because routing through one place buys you three things that a free-for-all can't: observability (the coordinator sees everything that happens, so you can debug it), consistent error handling (one place decides what to do when something fails), and controlled information flow (the coordinator decides exactly what each subagent sees, so nothing leaks or gets duplicated by accident).
Hub-and-spoke: subagents connect only to the coordinator, never directly to one another. That single rule gives you observability, consistent error handling, and controlled information flow.
1.2.2 — Key Concept
In hub-and-spoke, the coordinator manages ALL communication, error handling, and routing. Subagents return results only to the coordinator and never talk to each other. If an exam answer proposes direct subagent-to-subagent communication, it's wrong.
1.2.3 The Isolation Principle — Subagents Start Blank
This is the single most important — and most tested — fact about subagents, and it follows directly from Lesson 1.1's idea that each agent has its own conversation. When the coordinator spawns a subagent, that subagent does NOT inherit the coordinator's conversation. It does not see the coordinator's history, it does not see what other subagents found, and it remembers nothing from any previous time it was invoked. It starts blank.
Think of each subagent as a brand-new contractor who walks in having read none of the project emails. If you want them to know something, you have to tell them — explicitly — in the brief you hand them. There is no shared whiteboard they can glance at, no group memory. This is the isolation principle, and it's a feature, not a bug: it's what keeps each subagent's context clean and focused (and it's what makes the coordinator's job to pass context properly — the subject of all of Lesson 1.3).
The practical upshot for 1.2: a coordinator that assumes its subagents 'just know' the background will get useless results. Every subagent invocation must carry, in its own brief, exactly what that subagent needs to do its job.
A subagent starts with a blank context — no inherited history, no peer results, no memory between invocations. Whatever it needs to know must travel in the brief the coordinator hands it.
1.2.3 — Key Concept
Subagents operate with isolated context: they do NOT automatically inherit the coordinator's history, other subagents' results, or shared state, and they keep no memory between invocations. The coordinator must pass everything explicitly (the whole of Lesson 1.3).
1.2.4 The Coordinator's Four Jobs
If the coordinator is the brain of the operation, what exactly is it responsible for? Four things. When a multi-agent system misbehaves, the cause is almost always a failure in one of these — usually the first one.
| The coordinator's job | What it means, concretely |
|---|---|
| Decompose & select | Break the task into the RIGHT pieces, and invoke only the subagents a given query actually needs — not always the whole pipeline |
| Partition scope | Assign distinct subtopics or sources to each subagent so they don't duplicate each other's work |
| Refine iteratively | After synthesis, check the result for gaps; if some are found, re-delegate targeted queries and synthesize again |
| Route everything | Pass all communication through itself for observability, consistent errors, and controlled flow |
The coordinator's four responsibilities. 'Decompose' is the highest-stakes — get the pieces wrong and even flawless subagents produce an incomplete answer.
Look closely at 'decompose'. The coordinator's first act is to carve the task into subtasks. If it carves badly — too narrowly, missing whole parts of the question — then it hands its subagents the wrong assignments. The subagents will execute those wrong assignments perfectly, and the final report will be confidently incomplete. This is the failure mode the exam tests more than any other, so we'll give it its own section next.
1.2.4 — Key Concept
The coordinator's four jobs are decompose-and-select, partition scope, refine iteratively, and route everything. Most multi-agent failures trace to decomposition — the quality of the whole system is capped by how well the coordinator splits the task.
1.2.5 The Signature Failure: Blame the Decomposition, Not the Workers
Here is the scenario the exam returns to again and again. You run a research system on "the impact of AI on creative industries." Every subagent reports success: the search agent finds good articles, the analysis agent summarises them correctly, the synthesis agent writes a coherent report. And yet the final report covers only visual arts — music, writing, and film are completely missing. Where's the bug?
The instinct is to blame a worker: maybe the search agent's queries weren't broad enough, or the synthesis agent should have noticed the gap. Resist that instinct and look at the coordinator's logs. There you find it: the coordinator decomposed "creative industries" into just three subtasks — "digital art", "graphic design", and "photography." Every one is a visual art. The subagents did their jobs flawlessly; they were simply never ASSIGNED music, writing, or film. The coverage gap was baked in at the moment of decomposition, before any subagent ran.
The lesson generalises: when a multi-agent result is incomplete but every subagent succeeded within its scope, the root cause is upstream — the coordinator's decomposition — not the downstream workers. The fix is to broaden the decomposition so it spans all relevant parts of the topic. Adding more subagents, rewriting the search queries, or giving the synthesis agent a gap-checker all miss the real cause.
Every subagent succeeds — but the coordinator only assigned visual-arts subtasks, so whole branches of the topic never get researched. The gap is in the decomposition, upstream of the workers.
1.2.5 — Exam Trap
✗ Blaming the search/analysis/synthesis agent for a coverage gap they were never assigned to fill. ✗ Adding more subagents, expanding search queries, or bolting on a gap-detector. ✓ Fix the coordinator's decomposition so it covers all relevant domains of the topic. Read the decomposition logs first.
1.2.6 The Price Tag — and When It's Worth Paying
Multi-agent systems are genuinely powerful, but they are not free, and a good architect knows when NOT to reach for one. The numbers are striking. In Anthropic's own research system, a multi-agent setup (an Opus-4 lead coordinating Sonnet-4 subagents) beat a single Opus-4 agent by 90.2% on their internal research evaluation — a huge gain. But it used about 15× more tokens than a normal chat (general agents use about 4×). In fact, token usage alone explains roughly 80% of the variance in how well these systems do. More agents means more parallel reasoning, which means more tokens.
So the deciding question is not "is this task hard?" — it's "what SHAPE is this task?" Multi-agent pays off when the work is naturally parallel, when the information exceeds what one context can hold, when many complex tools are involved, or when the query is breadth-first (explore several independent directions at once). It is the wrong choice when all the agents would need to share the same context with many dependencies between them, and for most coding, where the subtasks usually depend on each other and can't truly run in parallel.
| Reach for multi-agent when… | Stick with a single agent when… |
|---|---|
| The work is heavily parallelizable | Every step depends on the previous one |
| Information exceeds one context window | Everything fits comfortably in one context |
| The query is breadth-first (many directions) | The task is narrow and deep |
| Many complex tools are involved | Agents would all need the same shared state |
Match the architecture to the shape of the work. Given the ~15× token cost, a tightly-coupled task is usually better — and cheaper — as a single agent.
1.2.6 — Key Concept
The decision to go multi-agent is about the SHAPE of the work (parallelizable, context-exceeding, breadth-first), not its difficulty. Scale to complexity: a simple query is 1 agent with 3–10 tool calls; complex research may use 10+ subagents. Each subagent's brief needs an objective, an output format, tool/source guidance, and clear boundaries.
1.2.7 Put It Together: Build a Coordinator
You now have the whole picture: why teams beat soloists for the right tasks, the hub-and-spoke shape, the isolation principle, the coordinator's four jobs, the signature decomposition failure, and the cost trade-off. As with 1.1, the way to lock it in is to build a small version and then deliberately reproduce the failure mode.
1.2.7 — Build Exercise (60 min)
Build a coordinator with a web-search subagent and a document-analysis subagent. (1) Give each subagent everything it needs explicitly in its brief — confirm it works with no inherited context. (2) Have the coordinator emit BOTH delegations in a single response so they run in parallel; time it against running them one after another. (3) Add an iterative-refinement step: after synthesis, have the coordinator check coverage and re-delegate targeted queries for any gap. (4) Now reproduce the signature bug — make the coordinator decompose a broad topic too narrowly, watch the report come back incomplete despite every subagent succeeding, then fix it at the coordinator.
Everything here rests on subagents being able to receive a proper brief — because they start blank. HOW you write that brief, pass findings between stages, and keep citations intact is the entire subject of the next lesson, 1.3.
Where this shows up on the exam
1.2 questions are usually 'the multi-agent system produced a bad result — what's the cause?' If you instinctively check the coordinator's decomposition first, and you remember that subagents are isolated and route through the hub, you'll answer them correctly.
Key Takeaways
- ✓Multi-agent orchestration is a coordinator that splits a big task into pieces and delegates each to a specialized subagent — buying parallelism and avoiding context overflow, at ~15× the token cost.
- ✓Hub-and-spoke: the coordinator manages ALL communication, error handling, and routing; subagents return only to the coordinator and never talk to each other (that buys observability, consistent errors, controlled flow).
- ✓Isolation principle (most tested): subagents start blank — no inherited history, no peers' results, no memory between invocations — so the coordinator must pass everything explicitly in the brief.
- ✓The coordinator has four jobs: decompose-and-select the right subagents, partition scope to avoid duplication, refine iteratively to close gaps, and route everything centrally.
- ✓Signature failure: when a result is incomplete but every subagent succeeded, the cause is the coordinator's overly narrow decomposition — not the downstream workers; read the decomposition logs and broaden them.
- ✓Real numbers: multi-agent (Opus lead + Sonnet subagents) beat single-agent Opus by 90.2% on research but used ~15× the tokens (general agents ~4×); token usage explains ~80% of success variance.
- ✓Choose multi-agent by the SHAPE of the work — parallelizable, context-exceeding, breadth-first, many tools — not by difficulty; tightly-coupled or coding tasks usually belong to a single agent.
Check Your Understanding
Test what you learned in this lesson.
Q1.A research report on "impact of AI on creative industries" covers only visual arts, missing music, writing, and film. Every subagent ran successfully; coordinator logs show it decomposed the topic into "digital art", "graphic design", and "photography". What is the root cause?
Q2.In hub-and-spoke orchestration, how do two subagents share information?
Q3.Which task is the BEST fit for a multi-agent (coordinator-subagent) architecture?
Q4.A coordinator always invokes its full pipeline of five subagents for every query, even trivial ones. Which coordinator responsibility is it neglecting?
Practice This Lesson