AI Summary
5 min readDeepSeek launched preview versions of its V4 Pro and V4 Flash models, positioning them as high-performing, low-cost alternatives to US frontier models. These releases undercut competitors like Anthropic's Claude Sonnet 4 by about five times on token pricing—V4 Pro at $1.74 per million input tokens and $3.48 for output, with V4 Flash even cheaper at 14 cents and 28 cents. The models feature a hybrid attention architecture for better long-conversation memory, a one-million-token context window for handling full codebases or documents, and mixture-of-experts scaling to just 37 billion active parameters per task. DeepSeek claims top coding benchmarks and gains in reasoning and agentic tasks, trailing state-of-the-art by three to six months but deployable on cheaper Huawei Ascend chips. Capacity is limited now due to compute shortages, with pricing expected to drop further later this year; the firm is in fundraising talks with Tencent and Alibaba. This builds on DeepSeek's prior R1 model, which shook markets by matching pricier rivals at lower cost, boosting Chinese chip stocks.
Continue reading the full summary in the app — free to try.
Read Full Summary →Free • No credit card required
What you'll learn
- 1 (00:23) **DeepSeek V4 Pro and V4 Flash Release**
- 2 (05:59) **OpenAI Launches GPT-5.5 and GPT-5.5 Pro**
- 3 (13:05) **Meta Announces 8,000 Layoffs**
- 4 (15:05) **Google Commits Up to $40B to Anthropic**
- 5 (16:48) **Weekend Long Reads**
+ Full timestamped outline available in the app
Show Notes
DeepSeek dropped V4 Pro and V4 Flash, undercutting US labs on price by roughly 5x. OpenAI fired back with GPT-5.5, reclaiming benchmark crowns from Claude. Meta confirms 8,000 layoffs on May 20, and Google plans to invest up to $40B in Anthropic.
- DeepSeek releases its new flagship models V4 Pro and V4 Flash in preview, saying V4 Pro trails the performance of state-of-the-art models by about 3 to 6 months (Bloomberg)
- Simon Willison's comparison chart of DeepSeek V4 pricing vs. US frontier models (Simon Willison)
- OpenAI says "GPT-5.5 matches GPT-5.4 per-token latency in real-world serving", performs "at a much higher level of intelligence", and is more capable for Codex (OpenAI)
- Meta plans to cut 10% of its employees, or ~8,000 jobs, on May 20 and won't fill 6,000 open roles, trying to boost efficiency and offset its heavy AI spending (Bloomberg)
- Google plans to invest up to $40B in Anthropic: $10B now at a $350B valuation, with another $30B if Anthropic hits performance targets (Bloomberg)
Weekend Longreads Suggestions
- A look at the AI nonprofit METR, maker of maybe the most important AI benchmark, whose time-horizon metrics are used by researchers and Wall Street to track AI development (NYT)
- The Infinite Machine Olto e-bike review: a more elegant solution for trips too long to walk but too short to drive (The Verge)
Learn more about your ad choices. Visit megaphone.fm/adchoices
More from this podcast
Tech Brew Ride Home →