Is AI About to “Eat Everything”? | AI Reality Check

May 14, 2026

AI Summary

5 min read

The latest METR update to its AI time horizon chart sparked alarm online, with claims of an intelligence explosion leading to superintelligent AI that will "eat everything." Cal Newport examines the chart closely, explaining its methodology and arguing it shows targeted progress in AI coding tools, not a general leap toward world-altering capabilities.

Decoding the METR Chart

METR measures AI performance on a suite of software tasks, defined by how long they take human programmers to complete under timed conditions. Humans—described as "low context" like new hires—tackle tasks such as fixing bugs in a Python library (about one hour) or exploiting a buffer overflow (over two hours), with results averaged geometrically.

AI systems, combining large language models (LLMs) like Claude Opus 4.6 with "coding harnesses" (scaffolds like Claude Code or Cursor), attempt these tasks six times each. METR plots a model's position based on the longest-duration task it completes successfully at least 50% of the time—at its release date. For instance, Claude Opus 4.6 reaches nearly 12 hours, meaning it handles one specific 12-hour human task half the time. At 80% success, top models like Claude Mythos preview top out around three hours. The chart's upward curve from 2025 accelerates in 2026, but it tracks only programming benchmarks, not general intelligence or any 12-hour human work.

Continue reading the full summary in the app — free to try.

Read Full Summary →

Free • No credit card required

Listen to Audio Summary Open in App

Never miss an episode of Deep Questions with Cal Newport

Get every new episode summarized in your inbox — free, ~5 minutes to read.

No spam. Unsubscribe anytime.

What you'll learn

1 (00:00) **METR Time Horizon Chart Intro** - Cal introduces METR's updated chart showing AI progress on software tasks, notes its scary upward trend post-2025.
2 (00:46) **Viral Reactions to Chart** - Examples of tweets claiming ASI threshold crossed, linking chart to superintelligence conquest.
3 (02:15) **AI Reality Check Setup** - Cal launches episode to demystify chart's meaning amid hype.
4 (02:44) **Chart Methodology: Software Tasks** - METR defines tasks solved via code, benchmarks human completion times using geometric mean.
5 (03:56) **Testing LLMs with Coding Harnesses** - Pairs models with scaffolds (e.g., Claude Code, Cursor) that plan, generate code, verify, and iterate.
6 (05:21) **Plot Explanation: Axes and Dots** - Y-axis: task duration (human time); X-axis: model release date; each dot is a model's max duration.
7 (08:10) **Not General Capabilities** - Measures specific programming tasks, not broad AI power or any 12-hour human work.