📚 Domain 1 · Task Statement 1.1

Design & Implement Agentic Loops for Autonomous Task Execution

📊 Domain Weight: 27% 🎯 Difficulty: Core Concept 🔗 Scenario: Customer Support Agent

The agentic loop is the heartbeat of every autonomous Claude application. Understanding its lifecycle — how Claude decides what to do, when to stop, and how tool results feed back into reasoning — is the single most critical concept on the exam. This guide breaks it all down with real-world analogies, working code patterns, and the anti-patterns you must avoid.

📋 Contents

The Big Picture: A Real-World Analogy
The Agentic Loop Lifecycle
Deep Dive: stop_reason — The Control Signal
Conversation History as Working Memory
Model-Driven vs Pre-Configured Decision Trees
Implementation: Code Patterns
Anti-Patterns to Avoid
Exam Readiness & Key Takeaways

🍳 The Big Picture: A Real-World Analogy

🧑‍🍳 Analogy — Claude as a Head Chef

Imagine Claude is a head chef in a restaurant kitchen. You (the orchestrator) hand the chef a task: "Prepare a three-course dinner for a guest with a nut allergy."

The chef doesn't complete the whole meal in one step. Instead, they work in a continuous loop: check what ingredients are available → prep the starter → taste and adjust → move to the main → check the allergy list before plating → finish with dessert → announce the meal is ready. At each step, the chef uses tools (the pantry, the stove, the recipe book) and feeds the result of each action back into their next decision.

The loop ends when the chef says "Service!" (end_turn) — not when a timer goes off, not after a fixed number of steps. The goal drives the termination, not an arbitrary countdown.

This is exactly how an agentic loop works with Claude. The model receives a task, decides which tool to call, gets the result, reasons about what to do next, and keeps looping until it decides it's done. Your code is the kitchen manager — you handle the tool execution plumbing, but Claude drives the decision-making.

🔄 The Agentic Loop Lifecycle

Every agentic loop — regardless of complexity — follows the same four-stage cycle. Understanding each stage precisely is what separates architects who pass this exam from those who don't.

Figure 1 — Agentic Loop Lifecycle (Full Cycle)

🤖 AI Fluency: The 3 Modes of Interaction

Before diving into the loop mechanics, it's critical to understand how Agentic Loops fit into the overarching AI Fluency Framework. We interact with AI via three distinct modes:

Automation: AI executes specific tasks based on exact human instruction (e.g., standard API calls).
Augmentation: Humans and AI collaborate as thinking partners.
Agency (The Agentic Loop): Humans configure AI to independently perform future tasks on their behalf. The model drives the decisions dynamically.

Phase 1 — Building and Sending the Request

Every iteration begins by constructing a request containing three things: a system prompt (the agent's role & instructions), the current conversation history (messages array), and a list of available tools. For subsequent iterations, this growing conversation history is the critical mechanism by which Claude maintains context (working memory).

Phase 2 — Claude's Reasoning Process

When Claude receives the request, it processes the entire conversation history along with the tool definitions. This reasoning is holistic — it natively considers what the user originally asked, what it has already attempted, what the results of prior tool calls were, and what logical next step would move the task forward. This invisible reasoning emerges as either a tool_use block, a text response, or both.

Phase 3 — The stop_reason Gate

Claude generates a response returning the stop_reason field. This is the single most important signal your control loop reads to know what to do next. The definitive signal about what to do next is always the stop_reason, not the presence of text content.

"tool_use" — Claude wants to execute one or more tools before continuing. The loop MUST continue.
"end_turn" — Claude has finished reasoning; the response is complete. Extract final text and END the loop.
"max_tokens" — The response was truncated. Treat this as a continuation or error, NOT a completed task.

Phase 4 — Tool Execution and Result Appending

When stop_reason == "tool_use", your orchestrator code reads the tool_use content blocks, calls the actual function/API/database, and collects results. Claude never directly executes tools. Your application performs the exact action. Most critically, you must append the results formatted as tool_result blocks matched to the specific tool_use_id.

Phase 5 — Loop Termination on end_turn

When Claude returns stop_reason = 'end_turn', it decides it has sufficient information to produce a complete, final response. Return this to the user and stop the loop. This is a model-driven decision, where Claude alone decides it is finished.

💡 Key Insight — You Are the Executor

Claude acts as the brain (decides what tool to call and why). Your backend code acts as the hands (actually runs the tool and returns results). Claude trusts your tool results implicitly — this is why input validation in your tool layer matters so much for security.

Stage 4 — Append Results and Loop

You take the tool result and append it to the conversation history as a tool_result message with the matching tool_use_id. Then you send the entire updated history back to Claude as the next API request. Claude reads its own prior reasoning and the fresh tool result to decide the next action.

🚦 Deep Dive: stop_reason — The Control Signal

stop_reason is arguably the most important field in the entire Claude API response for agentic systems. It's the foundation of your loop's control flow. Let's be precise about every possible value:

stop_reason Value	Meaning	Your Action
`"tool_use"`	Claude is requesting one or more tool calls. Inspect `response.content` for `type:"tool_use"` blocks.	Execute tools → append results → loop
`"end_turn"`	Claude decided the task is complete and produced a final answer.	Extract text response → EXIT loop
`"max_tokens"`	The response was cut off because it hit the `max_tokens` limit.	Handle truncation — do NOT treat as end_turn
`"stop_sequence"`	A custom stop sequence was hit (rarely used in agentic loops).	Application-specific handling

🎯 Exam Focus — The Binary Decision

The exam will test whether you know the exact logic: if stop_reason == "tool_use" → continue; if stop_reason == "end_turn" → stop. Any other approach (checking text content, counting iterations, parsing for "I'm done") violates the API contract and is an anti-pattern.

Figure 2 — Sequence: stop_reason Control Flow (Two-Turn Example)

🧠 Conversation History as Working Memory

🧑‍🔬 Analogy — Claude's Short-Term Memory is Stateless

Claude has no persistent memory between API calls. Every request must carry the complete conversation history, like handing a new colleague a printed transcript of everything that happened before they joined the meeting. Claude reads the whole transcript to get up to speed, then contributes the next piece. Your app is the filing cabinet that stores and resends this transcript each time.

The Message Structure: What Gets Appended

After each tool call, your code must append two new entries to the messages array before looping:

The assistant's message (Claude's response, including the tool_use content block with id, name, and input)
A user message containing a tool_result content block (with the matching tool_use_id and the actual result content)

Python — Appending Tool Results

# After Claude responds with stop_reason = "tool_use":

messages.append({
    "role": "assistant",
    "content": response.content  # Contains tool_use block
})

for block in response.content:
    if block.type == "tool_use":
        result = execute_tool(block.name, block.input)
        
        messages.append({
            "role": "user",
            "content": [{
                "type": "tool_result",
                "tool_use_id": block.id,   # MUST match the tool_use block's id
                "content": str(result)
            }]
        })

Why the tool_use_id Matters

Claude can request multiple tools simultaneously in a single response (parallel tool calls). Each tool_use block has a unique id. When you return results, Claude uses the tool_use_id to match each result back to its specific tool request. Getting the ID wrong causes Claude to misinterpret results — a subtle but devastating bug.

💡 Real-World Example — Customer Support Agent

A customer asks: "What's the status of my last three orders?". Claude decides to call get_order_status three times in parallel, each with a different order ID and a different tool_use_id (e.g., tool_01, tool_02, tool_03). Your code executes all three, then returns three tool_result messages, each mapped to its tool_use_id. Claude aggregates and produces a unified answer. This parallel pattern is a performance optimization you should know for the exam.

⚖️ Model-Driven Decision-Making vs Pre-Configured Decision Trees

The exam distinguishes sharply between two architectural philosophies for agent decision-making. Understanding the tradeoffs is essential.

Aspect	Model-Driven (Claude Decides)	Pre-Configured (Static Flowchart)
How "next step" is determined	Claude reasons about tool results, conversation context, and instructions to decide the next action dynamically	Your code checks conditions and routes to the next step via if/else or a state machine
Flexibility	High — handles unexpected inputs gracefully	Low — breaks on edge cases not anticipated in the flowchart
Predictability	Probabilistic — may vary across runs	Deterministic — same input → same path
Best for	Open-ended tasks: research, customer support, code review	Compliance-critical workflows: identity verification before financial ops
Risk	LLM may skip steps if not enforced programmatically	Cannot handle novel situations not encoded in the flowchart
Claude SDK equivalent	Agentic loop with rich system prompt + tools	Hooks + programmatic prerequisites (Task 1.4 topic)

🔁 Analogy — GPS vs Fixed Train Route

Model-driven is like GPS navigation. You state the destination, and the GPS dynamically computes the best route, re-routes if there's traffic, and adapts to road closures — but it might occasionally take a weird detour if its map data is wrong.

Pre-configured is like a fixed train route. It always stops at the same stations in the same order — completely predictable — but if your destination isn't on the line, you can't get there.

The best production systems combine both: use model-driven reasoning for the intelligence layer, and programmatic enforcement (hooks, gates) for critical compliance steps.

💻 Implementation: Complete Agentic Loop Pattern

Below is the canonical agentic loop implementation pattern. Study this structure carefully — the exam tests whether you can identify correct vs incorrect implementations.

Python — Canonical Agentic Loop

import anthropic

client = anthropic.Anthropic()

def run_agent(user_task: str, tools: list) -> str:
    messages = [{"role": "user", "content": user_task}]
    
    while True:  # Loop continues until break
        
        # STEP 1: Send request to Claude with full history
        response = client.messages.create(
            model="claude-opus-4-5",
            max_tokens=8096,  # Set high enough for complex tool use responses
            tools=tools,
            messages=messages
        )
        
        # STEP 2: Append Claude's response to history
        messages.append({
            "role": "assistant",
            "content": response.content
        })
        
        # STEP 3: Inspect stop_reason — the ONLY correct termination check
        if response.stop_reason == "end_turn":
            # Task complete — extract and return text response
            for block in response.content:
                if hasattr(block, "text"):
                    return block.text
        
        elif response.stop_reason == "tool_use":
            # STEP 4: Execute each requested tool
            tool_results = []
            for block in response.content:
                if block.type == "tool_use":
                    result = dispatch_tool(block.name, block.input)
                    tool_results.append({
                        "type": "tool_result",
                        "tool_use_id": block.id,
                        "content": str(result)
                    })
            
            # Append all tool results as a user message
            messages.append({
                "role": "user",
                "content": tool_results
            })
            # Loop continues — back to STEP 1
        
        elif response.stop_reason == "max_tokens":
            # Response was truncated — do NOT treat as end_turn
            # Options: increase max_tokens, summarise context, or raise an error
            raise RuntimeError("Response truncated (max_tokens hit). Increase token budget or reduce prompt size.")
        else:
            # stop_sequence or other unexpected stop reason
            raise RuntimeError(f"Unexpected stop_reason: {response.stop_reason}")

💡 Why "while True" is Fine Here

You'll notice there's no iteration counter. That's intentional — the loop exits exactly when Claude decides the task is done, which is the correct pattern. The task statement explicitly says that setting arbitrary iteration caps as the primary stopping mechanism is an anti-pattern. That said, you may add a safety cap as a secondary failsafe for runaway loops — just not as the primary termination logic.

⛔ Anti-Patterns to Avoid

The exam explicitly calls out three anti-patterns. Memorise these — they appear in distractor answers:

⛔ Anti-Pattern 1: Parsing Natural Language to Detect Completion

Checking if Claude's response text contains phrases like "I have completed the task" or "Done!" to decide when to stop. Claude is probabilistic — it might say "Done" mid-task or omit it at actual completion.

⛔ Anti-Pattern 2: Arbitrary Iteration Caps as Primary Stop

Using if iteration >= 10: break as your main exit condition. The agent may legitimately need 15 steps for complex tasks, and cutting it short produces incomplete results. Caps are only valid as emergency failsafes.

⛔ Anti-Pattern 3: Checking for Text Content as Completion Indicator

Assuming that if Claude's response contains a text block (not a tool_use block), the task must be done. Claude can produce a text explanation alongside a tool_use in the same response.

✅ Correct Pattern 1: Use stop_reason Exclusively

if response.stop_reason == "end_turn" is the only reliable termination signal. It is set by the API, not by Claude's text — so it is deterministic and immune to prompt drift.

✅ Correct Pattern 2: Safety Cap as Secondary Guard

If you want protection against infinite loops, add if iteration > MAX_STEPS: raise LoopLimitError() after the stop_reason check. It's a safety net, not the primary gate.

✅ Correct Pattern 3: Inspect Content Block Types

Iterate over response.content and check block.type — either "tool_use" or "text". Only treat stop_reason == "end_turn" as the terminal state.

Python — ❌ WRONG vs ✅ RIGHT Termination Patterns

## ❌ WRONG — Parsing natural language
if "task complete" in response.content[0].text.lower():
    break  # DON'T DO THIS

## ❌ WRONG — Arbitrary cap as primary stop
for i in range(10):
    response = client.messages.create(...)
    # This exits after 10 rounds regardless

## ❌ WRONG — Checking for text as terminal indicator
if any(b.type == "text" for b in response.content):
    return response.content[0].text  # Text can exist alongside tool_use!

## ✅ CORRECT — Use stop_reason exclusively
while True:
    response = client.messages.create(...)
    if response.stop_reason == "end_turn":
        break     # The API told us we're done
    elif response.stop_reason == "tool_use":
        # Execute tools, append, continue
        ...

🎯 Exam Readiness & Key Takeaways

🎓 Exam Scenario Context — Customer Support Resolution Agent

The primary exam scenario for this task is: "You are building a customer support resolution agent using the Claude Agent SDK. The agent handles high-ambiguity requests like returns, billing disputes, and account issues. It has access to your backend systems through custom MCP tools (get_customer, lookup_order, process_refund, escalate_to_human). Your target is 80%+ first-contact resolution while knowing when to escalate."

Questions test your ability to identify bugs where the agent: (1) stops too early before completing the task, (2) fails to append tool results correctly causing Claude to re-request already-completed tools, (3) uses text parsing instead of stop_reason to detect completion, or (4) exits prematurely on a max_tokens truncation instead of handling it.

stop_reason is your ONLY termination signal. Inspect "tool_use" vs "end_turn" — these are the two values that drive your agentic loop. "max_tokens" must be handled separately (do not treat as end_turn). Never parse text content to decide loop termination.

Tool results are appended to conversation history so the model can reason about the next action. Append both the assistant's tool_use message AND the user's tool_result message with a matching tool_use_id before each loop iteration.

The distinction between model-driven decision-making and pre-configured decision trees is fundamental. Claude reasons about context to decide its next action (model-driven). Hooks and programmatic prerequisites enforce deterministic compliance (pre-configured). The best systems combine both.

Three named anti-patterns to memorise: (1) Parsing natural language signals to determine loop termination, (2) Setting arbitrary iteration caps as the primary stopping mechanism, (3) Checking for assistant text content as a completion indicator. These will appear as distractor answers.

Continuing correctly when stop_reason is "tool_use": Extract all type: "tool_use" content blocks, execute each tool, collect all results, append as a single user message, then loop back. Never respond early without executing all requested tools.

Claude can request multiple tools in parallel in one response (e.g., calling get_customer and lookup_order simultaneously). Handle all tool_use blocks, collect all results, return them in one user message. The tool_use_id links each result to its request.

← Back Back to Homepage

Next → Task 1.2 — Multi-Agent Orchestration