Blog · 2026-05-18· 4 min read

The 60% Rubicon: how to prepare for autonomous AI R&D before 2028

Anthropic co-founder Jack Clark gives 60% odds that AI systems will autonomously build their own successor models by end of 2028. Sustained agentic execution went from seconds in 2022 to ~12 hours in 2026. Anthropic models post 93.9% on SWE-Bench Verified and 95.5% on CORE-Bench. The nearer-term risk is not movie-villain AI; it is competent subagents quietly changing org design before governance catches up.

D1D5jack-clarkautomated-ai-rdrubicon-moment
Painterly walnut horologist's atelier. A long brass-marked timeline runs from 2022 (left) to 2028 (right) with a wax-seal milestone at 12 HOURS in the middle. An ink-drawn Rubicon threshold halfway across. Tiny subagent figurines on parallel rails. Loop the chronicler measures with brass calipers.

Quick answer

Anthropic co-founder Jack Clark gives 60% odds that AI systems will autonomously build their own successor models by end of 2028. Sustained agentic execution went from seconds in 2022 to ~12 hours now. Anthropic models post 93.9% on SWE-Bench Verified and 95.5% on CORE-Bench. The nearer-term risk is not movie-villain AI; it is competent subagents quietly changing org design before governance catches up. Track autonomy frontier, not chatbot vibes.

60% by 2028. Your roadmap is not ready.

Ten quick notes on Anthropic co-founder Jack Clark's automated AI R&D warning. The signal is worth stitching together. Most coverage skips the operational implications.

10 notes worth keeping

1. Start with the number

Clark puts a 60% chance on AI systems building their successor models by end of 2028. That is a planning horizon, not science fiction. Roadmaps that do not reflect this probability are roadmaps for the world Clark thinks has a 40% chance of happening.

2. Translate "Rubicon" correctly

AI R&D stops being human-guided and becomes human-optional. A different risk class entirely. The transition is not gradual; it is a phase change. One day the system needs a human to pick the next experiment. The day after, it does not.

3. Track autonomy, not vibes

Most people watch chatbot vibes (the latest hot take on the latest model release). The signal that actually matters is sustained execution: how long an agent can work on a single task without human intervention. Seconds in 2022. ~12 hours now. The leading indicator is when this number crosses into multi-day runs that produce verifiable artifacts.

4. Benchmark creep matters more than hot takes

Anthropic models post 93.9% on SWE-Bench Verified and 95.5% on CORE-Bench. Those are software-engineering and reasoning benchmarks where headroom is shrinking. When a benchmark designed to be hard becomes routine, the underlying capability has moved. Track the benchmark deltas, not the marketing posts.

5. Cap recursive loops

If you are testing agentic workflows, cap max_iterations. The new Outcomes flow can self-grade and iterate; without a cap it will burn credits fast. After Anthropic Agent SDK's June 15, 2026 programmatic-credits switch, that burn shows up on your invoice. max_iterations=5 to 10 is the cheap insurance. Forgetting it is the expensive mistake.

6. Use /goal for long-horizon work

Most people prompt step by step. /goal works better for long tasks because you define the end state, not every move. The agent picks the path. Your job becomes specifying what done looks like, not narrating each step.

7. Do not confuse Agent View with Managed Agents

Agent View is the local workflow UI. Managed Agents is production infrastructure that Anthropic runs for you with managed credits, retries, and observability. They sound similar; they are completely different products. Confusing them is expensive, especially around credit metering. Read the docs for the one you actually have.

8. The leadership-relevant signal

Society and frontier labs are not prepared for an intelligence explosion if the handoff happens fast. Clark says this himself. The prep work is mostly slow institutional change: governance frameworks, board-level autonomy metrics, audit access (see the Mythos restriction post for the closest worked example). Start before the headlines force the conversation.

9. The realistic near-term risk

It is not movie-villain AI. The realistic risk is competent subagents quietly changing org design before governance catches up. An agent that drafts plans, schedules meetings, and writes OKRs is not unsafe in any classical sense. It is just operating faster than the human governance process. Decisions get made before the org chart knows who decided them. That is the governance gap to close first.

10. Add one board metric this quarter

Percent of R&D work that can run without human intervention for 1, 6, and 12 hours. Today this is probably under 20% for the 1-hour band and near zero for the 12-hour band. By end of 2027 those numbers will be materially higher. Boards that track the metric see the curve coming. Boards that wait for the headlines govern in reactive mode.

The bonus tip most people miss

Anthropic's actual control point is not the model. It is orchestration, success criteria, and budget caps. That is where preparedness actually lives. Teams optimising the model selection are optimising the wrong layer. The layer that controls the autonomy frontier is the orchestration layer above the model.

How this shows up on the exam

D1 (Agentic Architecture, 27%) consistently tests sustained-execution patterns: how to structure an agent loop that runs for hours without human intervention while preserving quality. Questions in this family ask about iteration caps, termination conditions, error handling for long-running loops, and recovery from partial failure. The exam reliably rewards architectural answers that bound the loop (max_iterations, explicit success criteria, escalation triggers) over answers that trust the loop (rely on the model to know when to stop). Clark's "Rubicon" framing is the policy-level expression of this engineering pattern.

D5 (Context Management, 15%) tests sustained-execution from the cost and reliability angle. A 12-hour agentic run requires deliberate context budgeting, state checkpointing, and memory consolidation policies. The lost-in-the-middle problem is real at this scale. The exam-correct answer is structural: explicit summarisation checkpoints (the case-facts block pattern), bounded context windows, and policy-driven consolidation between phases. Without these, the long-running agent degrades in quality before it terminates.

Which of these ten was new to you?

The signal worth asking your team about: which one of these you would action this quarter, and which would wait for 2027. The honest answer for most teams is the iteration cap and the board metric should land now; everything else can sequence. The 60% is the number. The work is what comes after the number.

01 · Read next in the pillars

Where this lands in the exam-prep map

Each blog post bridges into the evergreen pillars. These are the most relevant follow-ups for this story.

02 · FAQ

7 questions answered

What is the Rubicon moment Clark is describing?
The Rubicon moment is when AI research and development stops being human-guided and becomes human-optional. Today, a human picks the research question, evaluates the result, and decides what to train next. Past the Rubicon, an AI system does all three on its own, including building its successor. Clark puts 60% odds on this transition completing by end of 2028. That is not science fiction; it is a near-horizon planning input.
What sustained-execution number actually matters?
Sustained agentic execution measures how long an AI agent can work on a single task without human intervention before quality degrades or the loop terminates. In 2022 this was measured in seconds. In 2026 it is reportedly around 12 hours. The leading indicator for the Rubicon is when sustained execution crosses into multi-day runs that produce verifiable artifacts. Track this number, not chatbot vibes.
Why does benchmark creep matter more than hot takes?
Hot takes about model capability are easy to discount. Benchmark numbers are harder to argue with. Anthropic models reportedly post 93.9% on SWE-Bench Verified and 95.5% on CORE-Bench. Those are software-engineering and reasoning benchmarks where headroom is shrinking fast. When a benchmark designed to be hard becomes routine, the underlying capability has moved. The numbers are uncomfortable on purpose.
What is the max_iterations cap and why does it matter?
The `max_iterations` cap sets a ceiling on how many times an agentic loop will recurse before forced termination. Without it, modern Outcomes-flow agents that self-grade and iterate will burn credits fast (and after Anthropic Agent SDK's June 15 2026 programmatic-credits switch, that burn shows up on your invoice). Setting max_iterations=5 to 10 is the cheap insurance policy. Forgetting it is the expensive mistake.
What is the difference between Agent View and Managed Agents?
Agent View is the local workflow UI in Claude Code; the operator sees what their agents are doing. Managed Agents is the production-infrastructure offering; Anthropic runs the agents for you, with managed credits, retries, and observability. They sound similar and they are completely different products. Confusing the two is an expensive mistake, especially around credit metering. Read the docs for whichever one you actually have.
What is the nearer-term risk if the 60% is right?
Not movie-villain AI. The realistic near-term risk is competent subagents quietly changing org design before governance catches up. An agent that drafts a quarterly plan, schedules cross-team meetings, and writes the OKRs is not unsafe in any classical sense. It is just operating faster than the human governance process around it. The organisational result is that decisions get made before the org chart knows who decided them. Track the autonomy frontier inside your own company first; the policy questions cascade out from there.
What is the one metric every board should add this quarter?
Percent of R&D work that can run without human intervention for 1, 6, and 12 hours. Today the answer is probably under 20% for the 1-hour band and near zero for the 12-hour band. By end of 2027 those numbers will be materially higher. Boards that track the metric will see the curve coming; boards that wait for the headlines will be governing in reactive mode. The metric is concrete, measurable, and trivially benchmarked across competitors.

Synthesized from research output on 2026-05-18. LinkedIn cross-post pending.
Last reviewed 2026-05-18.

Blog post · D1 · Blog

The 60% Rubicon: how to prepare for autonomous AI R&D before 2028, complete.

You've covered the full ten-section breakdown for this primitive, definition, mechanics, code, false positives, comparison, decision tree, exam patterns, and FAQ. One technical primitive down on the path to CCA-F.

More platforms →