Dwarkesh and Ilya Sutskever on What Comes After Scaling

December 15, 2025

AI Summary

5 min read

🎙️ The Voices & The Context

The Format: This long-form interview podcast dives into AI's core challenges and future paths through probing questions and thoughtful responses, blending technical depth with philosophical speculation on AGI's trajectory. Technical and reflective.
The Format: An interview between host Dwarkesh Patel and guest Ilya Sutskever.
The Key Players:
- Guest: Ilya Sutskever – Co-founder of Safe Superintelligence Inc. (SSI), pioneering AI researcher behind AlexNet, key contributor to GPT series and OpenAI's early breakthroughs, known for exceptional research intuition.

🗝️ Key Themes & Topics

Deep technical dissection of AI limitations, scaling laws' limits, human-AI learning gaps, and paths to safe superintelligence amid economic diffusion.

Topic 1: Benchmark vs. Real-World Gap – Models excel on evals like coding competitions but falter in practical tasks (e.g., bug-fixing loops), due to RL overfitting to benchmarks and poor generalization.
Topic 2: Scaling Eras & Generalization – Shift from pre-training "scaling" (2012-2020 research age, 2020-2025 scaling age) to RL-heavy compute; humans generalize better via efficient priors, emotions as value functions, and continual learning, not massive data.
Topic 3: Continual Learning & Superintelligence – AGI as a learner like a "super-efficient 15-year-old" deploye

Continue reading the full summary in the app — free to try.

Read Full Summary →

Free • No credit card required

Listen to Audio Summary Open in App

Never miss an episode of a16z Show

Get every new episode summarized in your inbox — free, ~5 minutes to read.

No spam. Unsubscribe anytime.

What you'll learn

1 (00:00) **🎙️ Introduction: Ilya Sutskever**
2 (01:10) **AI Takeoff Feels Normal Despite Sci-Fi Scale**
3 (02:39) **Eval Smarts vs. Economic Impact Gap**
4 (06:10) **Overcoming Narrow RL: Broader Environments or Better Generalization?**
5 (09:11) **Pre-Training Strengths & Limits**
6 (10:49) **Human Learning Analogies: Childhood, Evolution**
7 (14:31) **Value Functions: RL Efficiency Booster**

+ Full timestamped outline available in the app

Show Notes

AI models feel smarter than their real-world impact. They ace benchmarks, yet still struggle with reliability, strange bugs, and shallow generalization. Why is there such a gap between what they can do on paper and in practice

In this episode from The Dwarkesh Podcast, Dwarkesh talks with Ilya Sutskever, cofounder of SSI and former OpenAI chief scientist, about what is actually blocking progress toward AGI. They explore why RL and pretraining scale so differently, why models outperform on evals but underperform in real use, and why human style generalization remains far ahead.

Ilya also discusses value functions, emotions as a built-in reward system, the limits of pretraining, continual learning, superintelligence, and what an AI driven economy could look like.

Resources:

Transcript: https://www.dwarkesh.com/p/ilya-sutsk...

Apple Podcasts: https://podcasts.apple.com/us/podcast...

Spotify: https://open.spotify.com/episode/7naO...

Stay Updated:

If you enjoyed this episode, be sure to like, subscribe, and share with your friends!

Find a16z on X: https://x.com/a16z

Find a16z on LinkedIn: https://www.linkedin.com/company/a16z

Listen to the a16z Podcast on Spotify: https://open.spotify.com/show/5bC65RDvs3oxnLyqqvkUYX

Listen to the a16z Podcast on Apple Podcasts: https://podcasts.apple.com/us/podcast/a16z-podcast/id842818711

Follow our host: https://x.com/eriktorenberg

Please note that the content here is for informational purposes only; should NOT be taken as legal, business,