Pick the 4-week plan if…
- You ship with the Claude API or Claude Code in your day job and need 2-3 hours/day on weekdays plus a longer weekend block
- You want to walk in confident about all 5 domains, not just the two you already know cold
- You can hit at least one full 60-question mock before exam day and have a week to close gaps
- You're targeting a scaled score above 800 🟠, not just the 720 floor 🟡
- You need structured day-by-day pacing for week 1 and theme-based weekly summaries after that
- Your exam date is roughly 4 weeks out, give or take a few days
Otherwise see study plans index for the 2-week sprint or 8-week comprehensive variants.
Foundations and diagnostic
Week 1 is about calibration. You take the diagnostic on day 1 to expose your real gaps, then read the foundational frameworks that every later domain references. By Friday you should be able to draw the agentic loop on a whiteboard, name the 4 stop_reason values 🟡, and explain why subagents never inherit context.
Day 1Diagnostic and domain orientation~3 hr
- practiceTake the 30-question diagnostic~60 min
Sit it cold, no notes, 60 minutes max. Note which of the 5 domains 🟡 you bombed. Don't review answers yet; just record raw scores per domain.
- readRead the exam guide overview~30 min
Internalize the format: 60 MCQs, 120 minutes, 720/1000 to pass, no guessing penalty 🟡. Skim the domain weights so you know D1 (27%) and D3 (20%) carry the most leverage 🟠.
- knowledgeRead the AI Fluency framework (4D)~45 min
The 4D framework (Delegation, Description, Discernment, Diligence) is the conceptual spine the exam uses to frame agentic decisions. You'll see it again in scenarios.
- conceptCross-reference the 4D concept page~30 min
Map the framework to your diagnostic mistakes. Most failed D1 questions trace back to a Delegation or Discernment error, not a tool-mechanics error.
Day 2Claude 101 and API foundations~3 hr
- knowledgeRead Claude 101~45 min
Anchors model selection, message structure, and the basic prompt-and-response loop. Even if you've shipped with Claude for a year, do this; the exam tests the canonical Anthropic framing, not your shop's framing.
- knowledgeRead Claude API foundations~60 min
Focus on
messages.create, system prompts, multi-turn structure, and wherestop_reasonlives in the response. This is the bedrock for every D1 question. - conceptDrill the 4 stop_reason values~30 min
`end_turn`, `tool_use`, `max_tokens`, `stop_sequence` 🟡. Memorize what each implies for control flow. The exam will hand you a transcript and ask which value fired.
- conceptReview system prompts~30 min
Understand the system prompt as the deterministic anchor. Note the 'persona vs. instructions vs. case-facts' separation that case-facts-block doctrine builds on.
Day 3Agentic loops and the orchestration spine~3 hr
- conceptMaster the agentic loop~45 min
The standard loop: send messages, check
stop_reason, iftool_usethen execute and appendtool_result, repeat untilend_turn. This is the single most-tested mechanic across D1 (27%) 🟠. - knowledgeRead the subagents intro~45 min
Understand the ==hub-and-spoke== orchestration pattern and the rule that subagents never inherit parent context 🟡. This single fact is worth at least 2 exam questions.
- conceptCross-reference the subagents concept~30 min
Note when to spawn a subagent (parallel research, isolated context) vs. when to keep the work in the main loop (sequential dependency, shared state).
- scenarioWalk the Multi-Agent Research scenario~60 min
Read the P3.3 🟡 walkthrough end to end. Don't take the questions yet; just absorb how the architecture is justified.
Day 4Context window and capabilities~2.5 hr
- knowledgeRead AI Capabilities and Limitations~45 min
Learn where Claude struggles: long arithmetic chains, exact recall from the middle of a 200K context, deterministic JSON without schema enforcement. The exam tests calibrated trust, not Claude evangelism.
- conceptDrill context window mechanics~30 min
Internalize the lost-in-the-middle effect 🟡 and the progressive-summarization trap. Both show up in D5 (15%) 🟠 questions.
- conceptReview prompt caching~30 min
Know when
CacheControlEphemeralsaves cost vs. when it silently invalidates. Cache invalidation is a favorite trap question. - knowledgeRead Claude Cowork intro~30 min
Frames the human-in-the-loop pattern that D5 reliability questions reference. Pay attention to the structured handoff format.
Day 5Week 1 consolidation and mini-mock~2.5 hr
- practiceTake a 15-question D1 quiz~30 min
D1 carries 27% weight 🟠, which is more than any other domain. If you're below 70% here, plan extra D1 time in week 2.
- scenarioWalk the Customer Support scenario (P3.1)~45 min
The canonical 'first scenario' in the official guide 🟡. Notice how the architecture choices map to D1 + D5 reliability concerns.
- readSkim the anti-pattern reference~30 min
Just the headings for now. This becomes mandatory reading in week 4, but seeing the failure-mode catalog early reframes the rest of your prep.
- conceptReview escalation patterns~30 min
Closes the week on a D5 note. Know when to escalate to a human and what the structured handoff packet contains.
D1 Agentic Architecture and D2 Tool Design
Week 2 is the heaviest content week. D1 (27%) + D2 (18%) = 45% 🟠 of the exam, so spend the time. By Friday you should be able to design a tool schema cold, name all 4 isError categories 🟡, and explain why the 18-tool degradation point matters for tool-set design.
Day 1D1 orchestration: loops, subagents, stop control~3 hr
- conceptAgentic loops, deeper~35 min
Re-read with an eye for the loop's exit conditions. Trace a transcript and predict the next
stop_reasonbefore reading on. D1 is 27% of the exam 🟠, the single biggest block. - conceptSubagents and hub-and-spoke~30 min
Internalize that subagents never inherit context 🟡 — the orchestrator passes down only what it explicitly hands over. Most D1 misses are a hidden context-inheritance assumption.
- conceptStop reason as the loop's traffic signal~25 min
Map each of the 4 values to the control-flow decision it forces. The exam hands you a response object and asks what the loop does next.
Day 2D1 scenarios: trace two full agent systems~3 hr
- scenarioMulti-Agent Research (P3.3) walkthrough~40 min
The canonical hub-and-spoke scenario. Note where the orchestrator delegates vs. does the work itself, and how results merge without sharing context.
- scenarioCustomer Support Resolution (P3.1) walkthrough~40 min
A single-agent loop with escalation. Watch how
stop_reasonplus tool errors drive the escalation decision — this pattern recurs across D1 and D5 questions.
Day 3D2 foundations: tool design as a deterministic selector~3 hr
- conceptTool calling end to end~35 min
The tool description is the deterministic selector that drives tool choice 🟡 — description clarity matters more than schema cleverness. Know the 4
isErrorcategories cold. - concepttool_choice modes~25 min
Memorize
auto,any, a specific tool, andnone, plus the exact situation each is for. The exam tests when to forceanyvs. leaveauto. - conceptStructured outputs and schemas~30 min
Schema design plus the validation contract. This sets up the D4 validation-retry loop you drill in Week 3.
Day 4D2 scenarios: tool design and extraction~2 hr
- scenarioAgentic Tool Design (P3.7)~35 min
Watch how tool granularity and description quality change Claude's selection behaviour — the practical face of the deterministic-selector idea.
- scenarioStructured Data Extraction (P3.6)~35 min
The bridge between D2 (tool/schema) and D4 (structured output). Note where validation lives and how errors are fed back to Claude.
Day 5D2 advanced: MCP and the D1+D2 mock~3 hr
- knowledgeMCP Foundations~35 min
MCP appears in roughly a quarter of D2 questions. Understand the client/server model and the tool-discovery handshake.
- knowledgeMCP Advanced~30 min
Drill the 18-tool degradation rule 🟠 and when to split a tool server. Read this back-to-back with Foundations.
- practice30-question D1+D2 mock~45 min
Sit it timed. D1+D2 are 45% of the exam 🟠 — this is your mid-prep checkpoint on the heaviest block.
D3 Claude Code and D4 Prompt Engineering
Another 38% of the exam 🟠 sits in D3 + D4. Week 3 is where Claude Code stops being 'a tool you use' and starts being 'a system you can configure for someone else'. By Friday you should be able to write CLAUDE.md hierarchy from memory, design a Skill with proper YAML frontmatter, and explain the validation-retry loop for structured output.
Day 1D3 mechanics: CLAUDE.md and configuration~3 hr
- knowledgeClaude Code 101~35 min
The configuration model: where Claude Code reads instructions, how sessions work, and the headless
-pflag for pipelines. - conceptCLAUDE.md hierarchy~40 min
Memorize the precedence project > user > enterprise 🟡 and how YAML frontmatter globs scope rules to file paths. Write the hierarchy from memory by end of day.
Day 2D3 mechanics: Plan Mode and hooks~3 hr
- knowledgeClaude Code in Action~35 min
See configuration applied to a real workflow. Note where Plan Mode is triggered and how hooks gate tool use.
- conceptPlan Mode triggers~25 min
Know when Plan Mode engages and what it changes about tool execution. A common D3 trap is assuming it auto-executes the plan.
- conceptPreToolUse and PostToolUse hooks~30 min
Understand the two hook points and what a hook can block or rewrite. The
denied by PreToolUse hookmessage is a tested failure mode.
Day 3D3 advanced: Agent Skills~3 hr
- knowledgeAgent Skills intro~35 min
Skills are the highest-leverage D3 topic. Learn the SKILL.md frontmatter and when a skill is invoked vs. when the work is inlined.
- conceptSkills concept page~30 min
Map the relationship between Skills and subagents, plus the trigger mechanics. Expect at least one Skills-design question.
- scenarioCode Generation with Claude Code (P3.2)~30 min
Skills plus Claude Code applied to codegen. Note the configuration choices that make the workflow repeatable.
Day 4D3 scenarios: CI/CD and developer tooling~2 hr
- scenarioClaude Code in CI/CD (P3.5)~35 min
The headless
-pflag in a pipeline. Watch how non-interactive runs are gated and how output is consumed downstream. - scenarioAgent Skills for Developer Tooling (P3.12)~30 min
Skills packaged as reusable developer tools. Reinforces the invoke-vs-inline decision.
Day 5D4: prompt engineering and the D3+D4 mock~3 hr
- conceptPrompt engineering techniques~30 min
Few-shot examples drive determinism; system-prompt structure anchors behaviour. Learn the canonical Anthropic framing, not your shop's.
- conceptAttention engineering~25 min
Attention engineering drives recall under long context 🟡 — placement of instructions matters as much as their content.
- conceptEvaluation and the validation-retry loop~30 min
The canonical D4 pattern: validate the JSON, on failure send the validator's error back as a
tool_result, then retry. Be able to draw this loop. - practice30-question D3+D4 mock~45 min
Timed. D3+D4 are another 40% of the exam 🟠 — checkpoint before synthesis week.
D5 Reliability, scenario depth, and final mock
Week 4 is synthesis week. D5 is only 15% 🟠 of the exam, but it shows up in nearly every scenario question, so you fold it in while walking all 13 scenarios. By Sunday you sit a full 60-question mock, close any gaps, and lock in your readiness call.
Day 1D5: reliability and context management~3 hr
- conceptCase-facts-block doctrine~30 min
The case-facts-block separates immutable facts from persona and instructions 🟡 so long sessions don't drift. The single most-tested D5 idea.
- conceptSession state management~30 min
Structured state vs. free-form history. Know why the progressive-summarization trap silently corrupts long sessions.
- conceptCheckpoints~25 min
When to checkpoint and what to persist. Pairs with session-state for the reliability questions woven through scenarios.
Day 2D5 deployment and the scenario sweep begins~2 hr
- knowledgeClaude with Bedrock~25 min
The deployment angle D5 sometimes tests: how the platform changes auth, region, and model id, not the agent logic.
- knowledgeClaude with Vertex~25 min
Same lens as Bedrock. Skim for the differences the exam might contrast; you don't need depth here.
- scenarioLong Document Processing (P3.8)~35 min
D5 reliability under long context: chunking, attention placement, and state. Start the 13-scenario sweep here.
Day 3Scenario sweep: beyond-guide and KM~3 hr
- scenarioConversational AI Patterns (P3.10, beyond-guide)~30 min
Not in the official 6 but on the real exam (13 total) 🟢. Multi-turn control and escalation patterns.
- scenarioAgent Skills for Enterprise KM (P3.11, beyond-guide)~30 min
Skills applied to knowledge management. The second confirmed beyond-guide scenario.
- scenarioClaude for Operations (P3.9)~30 min
Operational agent patterns, where reliability and tool design intersect.
Day 4Final scenarios and the full 60-question mock~3 hr
- scenarioAgent Skills with Code Execution (P3.13)~25 min
Skills that run code. Watch the safety and verification framing.
- scenarioDeveloper Productivity Tools (P3.4)~25 min
Closes the 13-scenario sweep. You should now recognize every scenario family the exam draws from.
- practiceFull 60-question mock, timed~120 min
120 minutes, no notes, 720/1000 to pass 🟡. Sit it like the real thing. This is your readiness call.
Day 5Gap closure and readiness lock-in~3 hr
- readAnti-pattern reference: full read~45 min
Review every miss from the 60-question mock against the anti-pattern catalog. Most misses map to a named anti-pattern, not a knowledge gap.
- practiceSecond 30-question mock on your weakest 2 domains~45 min
Targeted. Confirm the gaps you closed actually moved. If you clear 720+ 🟡 here, you're ready to book.
- concept4D framework: final review~20 min
Re-read the spine (Delegation, Description, Discernment, Diligence) one last time; it frames the judgment questions that separate a pass from a borderline.
Which Knowledge mirrors to read each week
The Knowledge pillar mirrors all 13 Anthropic Skilljar courses. We map them across the 4 weeks so you don't over-read foundations or skip deployment material.
| Week | Knowledge pages |
|---|---|
| Week 1 | ai-fluency-framework · claude-101 · claude-api-foundations · ai-capabilities-limitations · claude-cowork-intro · subagents-intro |
| Week 2 | mcp-foundations · mcp-advanced |
| Week 3 | claude-code-101 · claude-code-in-action · agent-skills-intro |
| Week 4 | claude-with-bedrock · claude-with-vertex |
2-week sprint and 8-week comprehensive
The 4-week plan is canonical. The two alternates use the same content at different paces - they are not separate plans.
2-week sprint
You already operate Claude in production across agentic loops, MCP, and Claude Code, and your exam is locked under 14 days out. You need the compressed path that skips redundant fundamentals.
Collapse week 1 into 2-3 days of diagnostic plus targeted reading on weak domains only. Spend the next 5-6 days on D1 + D2 + D3 deep dives plus all 13 scenarios. Reserve the final 3-4 days for one full mock, gap closure, and a second 30-question mock. Skip foundational knowledge pages like AI Fluency and AI Capabilities & Limitations if your diagnostic shows mastery.
8-week comprehensive
You're newer to Claude or coming from a different LLM stack, you want certification plus durable mastery, or you're scheduling around heavy work commitments at 5-8 hours/week.
Stretch each domain across the first six weeks of reading plus building, with a project phase for original work (a working subagent system, a custom MCP server, a real Claude Code workflow with hooks). Reserve weeks 7 and 8 for two full mocks and scenario walkthroughs. The 8-week plan adds a final 'shadow build' phase where you implement one full reference architecture end to end.
Before you book the proctored exam
Don't book the seat until you can clear this gate cold. The exam fee resets and re-takes burn momentum, so spend the extra 2-3 days if anything below is shaky.
- Hit ≥720/1000 🟡 on a fresh 60-question mock taken under timed conditions, with no notes and no second attempt
- Score ≥70% on each individual domain in your most recent mock (no domain dragging the average up)
- Walked all ==13 scenarios== 🟢, including the 2 beyond-guide ones (P3.10 Conversational AI Patterns, P3.11 Agent Skills for Enterprise KM)
- Reviewed every entry in the /reference anti-pattern catalog at least once and can name the 5 highest-leverage failure modes
- Can recite from memory: the 4
stop_reasonvalues, the 4isErrorcategories, the 4tool_choicemodes, and the 5 domain weights - Can draw the agentic loop on a whiteboard and annotate where each
stop_reasonshort-circuits it - Have a written 'first 10 minutes' plan for the exam: which domain to triage first, when to flag, when to skip
- Logistics confirmed: proctor software installed, ID ready, quiet room booked, 2.5-hour block held with a 30-minute buffer
Once your Skilljar entitlement lands
Once you've booked the seat, the prep shape changes. The next two weeks are about ==consolidation, not expansion==: tighten what you know, don't open new fronts.
Lock the calendar
Block the exam window plus 30 minutes either side. Block 2 hours the morning of for a final scenario re-read and a light D1 warmup. Tell your team you're unavailable; protect the slot.
Switch from learning to drilling
Stop opening new knowledge pages. Re-read your own notes and the ==/reference anti-pattern catalog==. Take one 30-question mock every 3 days, focused on whichever domain trailed in your last full mock.
Build a personal cheat-sheet (for studying, not the exam)
One page: 5 domain weights, 4 stop_reason values 🟡, 4 isError categories, 4 tool_choice modes, CLAUDE.md hierarchy, the 13 scenario names. Read it daily until you can rewrite it from memory in under 5 minutes.
Plan the exam-day mechanics
Decide your triage rule. Most candidates do best by skipping any question that needs more than 90 seconds on first pass, then returning. ==No guessing penalty== 🟡 means you answer everything before time expires; never leave a blank.
Plan the post-exam move
Results take ==2 business days per the official guide, often 7-10 in practice== 🟠. Decide now what you'll do with the credential: update LinkedIn the day results land, draft a short post about what you'd tell yourself 4 weeks ago, file the badge artifact in your portfolio.
