📚 Domain 1 · Task Statement 1.6

Design Task Decomposition Strategies for Complex Workflows

📊 Domain Weight: 27% ⭐ High Exam Priority 🔗 Scenario: Developer Productivity Tool

Not all complex tasks should be decomposed the same way. Prompt chaining works when you know the steps in advance — a fixed pipeline where each stage feeds the next. Dynamic adaptive decomposition works when you don't — where each step's output determines what comes next. The exam tests your ability to choose the right pattern for the scenario, and to apply the code-review-specific technique of per-file local passes + cross-file integration pass to avoid attention dilution.

📋 Contents

Analogy — The Architect vs The Explorer
The Two Decomposition Patterns
Prompt Chaining — Fixed Sequential Pipelines
Attention Dilution & the Per-File + Integration Pattern
Dynamic Adaptive Decomposition
Decomposing Open-Ended Tasks: Adding Tests to Legacy Codebases
Decision Framework: When to Use Which Pattern
Anti-Patterns & Pro Tips
Summary & Exam Key Points

🏗️ Analogy — The Architect vs The Explorer

🏗️ Two Ways to Navigate a Complex Project

The Architect receives a blueprint before groundbreaking. They know: pour foundation → frame walls → install plumbing → electrical → drywall → finish. Every step is defined before the first shovel breaks ground. This is prompt chaining — you know the decomposition upfront, each stage feeds the next in a fixed order, and the plan doesn't change based on what you find inside the walls.

The Explorer enters an unmapped cave. They don't know what's inside. They map the entrance chamber first → discover a passage leading left → explore that passage → find a branching junction → decide to send a scout right → based on the scout's report, decide to focus on the left tunnel. Each step's result determines the next plan. This is dynamic adaptive decomposition — the workflow shape emerges from what is discovered.

Code reviews are the Architect. You always need to check security, performance, correctness, and style — the four categories don't change based on the code. Adding tests to an unfamiliar legacy codebase is the Explorer — you must first understand what's there before you can plan what to build.

🔀 The Two Decomposition Patterns

Dimension	Prompt Chaining (Fixed Pipeline)	Dynamic Adaptive Decomposition
When steps are known	Yes — before execution starts	No — determined by intermediate findings
Workflow shape	Linear or parallel, pre-defined	Tree-shaped, emergent
Typical use case	Code reviews, document processing, multi-aspect analysis	Open-ended investigation, legacy codebase exploration, research
AI Fluency 4D Parallel	Heavy emphasis on Diligence (executing fixed steps perfectly)	Heavy emphasis on Description (understanding the unknown before acting)
Subtask generation	Hardcoded in pipeline definition	Generated dynamically from findings at each step
Exam scenario	Code review tool (Scenario 4)	Add comprehensive tests to legacy codebase (Scenario 4)
Failure mode	Rigidity — can't adapt if something unexpected found	Loss of control — may spiral into unbounded exploration

💡 The Exam's Distinguishing Question

When an exam question asks you to choose a decomposition strategy, the key signal is: "Is the structure of the task known before execution begins?" If yes → prompt chaining. If the structure only becomes clear as you investigate → dynamic adaptive decomposition. A code review is always prompt chaining (same aspects every time). "Explore this codebase and find all security vulnerabilities" is dynamic (you don't know what you'll find).

⛓️ Prompt Chaining — Fixed Sequential Pipelines

Prompt chaining breaks a complex task into a predetermined sequence of focused stages. Each stage produces output that becomes the input for the next. Claude sees a smaller, well-scoped problem at each step rather than the full complexity at once.

The Code Review Pipeline (Exam Guide's Primary Example)

A multi-file code review is the canonical prompt chaining example. The structure is always the same: analyze each file individually (local passes), then run a cross-file integration analysis (global pass). This structure is defined before any code is seen.

Figure 1 — Code Review Prompt Chain: Per-File + Cross-File Integration

Python — Prompt Chaining: Code Review Pipeline

import anthropic
import asyncio

client = anthropic.Anthropic()

async def review_single_file(filename: str, content: str) -> dict:
    """PHASE 1: Per-file local analysis pass."""
    response = client.messages.create(
        model="claude-opus-4-5",
        max_tokens=4096,
        messages=[{
            "role": "user",
            "content": f"""Review this file for LOCAL issues only.
Do NOT attempt cross-file analysis — focus strictly on this file.

File: {filename}
```python
{content}
```

Return JSON with:
- security_issues: list of {{issue, severity, line}}
- logic_errors: list of {{issue, description, line}}
- performance_issues: list of {{issue, impact, line}}
- summary: one-paragraph overview"""
        }]
    )
    return {"file": filename, "findings": response.content[0].text}


async def run_integration_pass(per_file_results: list[dict]) -> str:
    """PHASE 2: Cross-file integration analysis."""
    findings_text = "\n\n".join([
        f"=== {r['file']} ===\n{r['findings']}"
        for r in per_file_results
    ])

    response = client.messages.create(
        model="claude-opus-4-5",
        max_tokens=4096,
        messages=[{
            "role": "user",
            "content": f"""You are given per-file analysis results for a codebase.
Identify CROSS-FILE issues that individual analysis missed:

{findings_text}

Identify:
1. Data flow issues (data from File A passed unsafely to File B)
2. Inconsistent error handling conventions across files
3. Dependency conflicts (circular imports, version mismatches)
4. Cross-file security vulnerabilities (e.g. auth bypass via API→DB path)
5. Contradictory findings (File A says X is safe; File B shows it isn't)

Return structured JSON with cross_file_issues[] and synthesis_summary."""
        }]
    )
    return response.content[0].text


async def code_review_pipeline(files: dict[str, str]) -> dict:
    """Full prompt-chaining code review: local passes → integration pass."""
    # PHASE 1: Run all per-file analyses in PARALLEL
    per_file_tasks = [
        review_single_file(fname, content)
        for fname, content in files.items()
    ]
    per_file_results = await asyncio.gather(*per_file_tasks)

    # PHASE 2: Integration pass uses per-file results as input
    integration_findings = await run_integration_pass(per_file_results)

    return {
        "per_file_findings": per_file_results,
        "integration_findings": integration_findings
    }

🎯 Attention Dilution & the Per-File + Integration Pattern

Attention dilution occurs when a model reviews many files simultaneously in a single large prompt. The model reliably processes content at the beginning and end of a long input — but findings from middle sections are systematically under-represented or missed entirely. This is the "lost in the middle" effect applied to code review.

❌ All Files in One Prompt (Anti-Pattern)

Sending 8 files in a single 40,000-token prompt. Claude reviews auth.py well (start), db.py well (end), but api.py (middle) gets shallow analysis. The model's "attention" — its ability to focus on specific content — is diluted across too many concerns simultaneously.

✅ Per-File Local Passes (Correct Pattern)

Each file gets its own dedicated API call with focused instructions: "Analyze THIS file for local issues only." The model's full attention goes to one file at a time. Then a separate integration pass synthesizes cross-file findings from the per-file results.

⚠️ Why the Integration Pass Cannot Be Skipped

Per-file passes only catch local bugs — issues within a single file. Critical cross-file vulnerabilities are invisible to local-only analysis: an authentication bypass requires seeing both auth.py AND api.py's call patterns together; a data flow security issue requires tracing data from ingestion through processing to storage across multiple files. The integration pass is not optional for production code review.

⭐ Pro Tip — Explicit Scope Restriction in Each Pass

Add explicit instructions in per-file passes: "Do NOT attempt cross-file analysis. Focus strictly on this file." Without this, Claude may try to reason about imports, dependencies, or callers it can't see — producing speculative findings that aren't grounded in actual code. Scope restriction focuses each pass and prevents hallucinated cross-file guesses.

🌿 Dynamic Adaptive Decomposition

Dynamic adaptive decomposition lets the workflow shape emerge from what is discovered at each step. Instead of a fixed pipeline, the agent generates the next set of subtasks based on intermediate findings. This is the correct pattern for open-ended investigation tasks where the problem structure is unknown before exploration begins.

The Core Loop

1️⃣ Map

First, understand the structure of the problem space. For a codebase: read directory trees, list modules, identify entry points. Produce a structural map before planning what to investigate.

2️⃣ Identify High-Impact Areas

Based on the map, rank areas by impact. For test coverage: find modules with no tests, highest cyclomatic complexity, or most dependencies. This drives prioritization.

3️⃣ Generate Adaptive Subtasks

Create a prioritized plan from step 2. Crucially: this plan may change as you execute it. Discovering an unexpected dependency in module A may reprioritize or introduce module B into the plan.

4️⃣ Execute & Adapt

Execute subtasks. At each step, check if findings change the remaining plan. A discovered circular dependency might require rerouting the testing strategy for multiple modules.

Figure 2 — Dynamic Adaptive Decomposition vs Fixed Pipeline

🧪 Decomposing Open-Ended Tasks: Adding Tests to a Legacy Codebase

The exam guide's specific open-ended example is: "add comprehensive tests to a legacy codebase." This is a three-phase adaptive process where each phase's output determines the next phase's plan.

The Three-Phase Adaptive Investigation

💡 Exam Guide Language — Memorise This

The exam guide states: "Decomposing open-ended tasks (e.g., 'add comprehensive tests to a legacy codebase') by first mapping structure, identifying high-impact areas, then creating a prioritized plan that adapts as dependencies are discovered."

These three bolded concepts — map structure, identify high-impact, prioritized plan adapts — are the exact language you need to match in answers.

Python — Dynamic Adaptive Decomposition: Legacy Test Generation

import anthropic
import json

client = anthropic.Anthropic()

def add_tests_to_legacy_codebase(repo_path: str):
    """Dynamic adaptive decomposition for open-ended test generation task."""

    # ─── PHASE 1: MAP STRUCTURE ──────────────────────────────────────────
    # Don't plan yet — discover first. The structure is unknown.
    mapping_response = client.messages.create(
        model="claude-opus-4-5",
        max_tokens=4096,
        messages=[{
            "role": "user",
            "content": f"""Map the structure of this codebase at: {repo_path}

Use the file system tools to explore. Produce a structural map containing:
- Complete module list with file counts
- Modules with ZERO test coverage (critical)
- Modules with partial coverage
- Dependency relationships between modules
- Entry points and critical paths

Return as JSON. Do NOT write any tests yet — mapping only."""
        }],
        tools=[/* file system tools */]
    )
    codebase_map = json.loads(mapping_response.content[0].text)

    # ─── PHASE 2: IDENTIFY HIGH-IMPACT AREAS ─────────────────────────────
    # Based on map, rank modules by impact before planning any tests.
    prioritization_response = client.messages.create(
        model="claude-opus-4-5",
        max_tokens=2048,
        messages=[{
            "role": "user",
            "content": f"""Given this codebase structure:
{json.dumps(codebase_map, indent=2)}

Identify and rank the TOP 5 modules to test first, based on:
1. Zero current coverage (highest priority)
2. Critical path modules (payment, auth, data processing)
3. High dependency count (testing these blocks others)
4. Complexity (harder to test = more value in doing it right)

Return prioritized list with rationale for each. Do NOT write tests yet."""
        }]
    )
    priority_modules = json.loads(prioritization_response.content[0].text)

    # ─── PHASE 3: ADAPTIVE EXECUTION ─────────────────────────────────────
    # Generate tests for each priority module — plan adapts as dependencies found.
    generated_tests = []
    discovered_dependencies = []  # Grows as we explore

    for module in priority_modules["ranked_modules"]:
        test_response = client.messages.create(
            model="claude-opus-4-5",
            max_tokens=8096,
            messages=[{
                "role": "user",
                "content": f"""Generate comprehensive tests for: {module['name']}

Previously discovered dependencies (must mock these):
{json.dumps(discovered_dependencies, indent=2)}

If you discover NEW dependencies during test generation,
report them in a 'new_dependencies' field — they will be
added to the remaining testing plan.

Write pytest tests covering: happy path, edge cases, error conditions."""
            }],
            tools=[/* file read tools */]
        )

        result = json.loads(test_response.content[0].text)
        generated_tests.append(result["tests"])

        # KEY: Plan adapts as new dependencies are discovered
        if result.get("new_dependencies"):
            discovered_dependencies.extend(result["new_dependencies"])
            # These may be added to priority_modules for subsequent iterations

    return generated_tests

✅ Why the Plan Must Adapt

When generating tests for payment.py, the agent discovers it calls legacy_db.py which has its own untested logic. A fixed pipeline would miss this — it only tests what was planned upfront. The adaptive plan adds legacy_db.py to the test queue because it was discovered, not because it was pre-planned. This is the core value of adaptive decomposition: it handles unknown unknowns.

🎯 Decision Framework: When to Use Which Pattern

Signal in the Task Description	Pattern to Use	Reasoning
"Review this code for security, performance, and correctness"	Prompt Chaining	Review aspects are known upfront. Same 3 categories regardless of code content.
"Add comprehensive tests to this legacy codebase"	Dynamic Adaptive	Test targets determined by what's discovered in the codebase structure.
"Analyze each file in this PR for quality issues"	Prompt Chaining	Per-file passes are well-defined; integration pass is predictable.
"Investigate why this production system is slow"	Dynamic Adaptive	Investigation path depends entirely on what profiling and logs reveal.
"Generate a report covering legal, financial, and technical risk"	Prompt Chaining	Three defined dimensions. Sequential pipeline: legal → financial → technical → synthesis.
"Refactor this deprecated API to use the new authentication system"	Dynamic Adaptive	Must first map what uses the old API before knowing what to change.
"Extract structured data from these 1,000 invoices"	Prompt Chaining	Same extraction schema for each document. Batch processing pipeline.

💡 The One-Sentence Decision Rule

If you can draw the complete workflow diagram before seeing any data → Prompt Chaining.
If the workflow diagram changes based on what you find → Dynamic Adaptive Decomposition.

⚠️ Anti-Patterns & Pro Tips

❌ All Files in One Prompt for Code Review

Sending a 10-file PR as one 60,000-token prompt. Middle files get attention dilution. Per-file passes with a separate integration step is always correct for multi-file review.

❌ Fixed Pipeline for Open-Ended Exploration

Using a hardcoded 5-step pipeline to "add tests to a legacy codebase" without ever mapping what exists. Steps 2-5 are planned blind — they'll be wrong when the codebase structure is unexpected.

❌ No Scope Restriction in Per-File Passes

Not telling Claude "focus on this file only." Claude will try to reason about imports and callers it can't see, producing speculative cross-file findings that pollute the per-file analysis.

❌ Skipping the Integration Pass

Running per-file passes and delivering those as the final review. Cross-file vulnerabilities — data flow issues, auth bypass paths — will be entirely absent from the report.

✅ Parallel Per-File Passes

Run all per-file analysis passes simultaneously using asyncio.gather() or parallel subagent spawning. This reduces code review latency from O(n) sequential to O(1) parallel — time equals the slowest single file.

✅ Map Before You Plan

For open-ended tasks, always invest one API call in mapping structure before committing to a plan. A mapping step is cheap; re-planning mid-execution after discovering a wrong assumption is expensive.

✅ Bounded Adaptive Plans

Cap adaptive plans (max 3 rounds of adaptation, max 20 subtasks generated). Unbounded adaptive decomposition can spiral — each discovered dependency adds more items, which add more items.

✅ Carry Discoveries Forward

In adaptive decomposition, maintain a running discovered_dependencies list and include it in each subsequent subtask prompt. Each step builds on what previous steps discovered.

📝 Summary & Exam Key Points

🎯 Exam Scenario — Developer Productivity Tool (Scenario 4)

The primary exam scenario: "You are building developer productivity tools using the Claude Agent SDK. The agent helps engineers explore unfamiliar codebases, understand legacy systems, generate boilerplate code, and automate repetitive tasks. It uses built-in tools (Read, Write, Bash, Grep, Glob) and integrates with MCP servers."

Questions test: (1) selecting the correct decomposition pattern for a given task description, (2) recognizing that multi-file code reviews require per-file passes + integration pass to avoid attention dilution, and (3) identifying the three phases of open-ended task decomposition: map structure → identify high-impact areas → create prioritized plan that adapts as dependencies are discovered.

Prompt chaining is for workflows with a known structure before execution. Fixed sequential pipelines work when you can define all stages upfront — multi-aspect code reviews, document extraction with defined fields, multi-dimensional reports. The shape doesn't change based on what you find.

Dynamic adaptive decomposition is for open-ended investigation tasks. The workflow shape emerges from what is discovered at each step. Open-ended tasks like "add tests to a legacy codebase" or "investigate why this system is slow" require adaptive plans that change based on intermediate findings.

The code review pattern: per-file local passes + separate cross-file integration pass. This is the exam guide's explicit example. Analyze each file individually (local issues only, run in parallel), then run a single integration pass over all per-file results to find cross-file data flow issues, dependency conflicts, and contradictory findings.

Attention dilution = what happens when you put too many files in one prompt. The model reliably processes content at the beginning and end of long inputs but systematically misses middle sections. Per-file passes eliminate this — each file gets dedicated, undiluted attention.

The three phases of open-ended task decomposition (exact exam guide language): (1) mapping structure — understand what exists before planning, (2) identifying high-impact areas — rank by criticality, coverage gap, complexity, (3) creating a prioritized plan that adapts as dependencies are discovered — each execution step may add new items to the plan.

Per-file passes should include explicit scope restriction. Instruct Claude: "Focus on this file only. Do NOT attempt cross-file analysis." Without this, Claude speculates about imports and callers it cannot see, producing hallucinated cross-file findings in the per-file pass.

Per-file passes run in parallel; the integration pass runs sequentially after all local passes complete. The integration pass's input is the collected output of all per-file results — so it cannot run until all local passes finish. Integration → per-file is sequential at the workflow level, but within per-file, all files run concurrently.

Decision rule: Can you draw the complete workflow before seeing any data? Yes → Prompt Chaining. No (the structure emerges from discovery) → Dynamic Adaptive Decomposition. This single test correctly classifies every task decomposition scenario on the exam.

← Previous Task 1.5 — Agent SDK Hooks & Tool Interception

Next → Task 1.7 — Session State, Resumption & Forking