Machine Learning

Gemini 3 Deep Think Hits 84.6% on ARC-AGI-2 — But Only 6.5% of Real Research Problems Got Useful Answers

Gemini 3 Deep Think Hits 84.6% on ARC-AGI-2 — But Only 6.5% of Real Research Problems Got Useful Answers - Featured Image

Google’s Gemini 3 Deep Think scored 84.6% on ARC-AGI-2 on February 12, 2026 — a 15.8-point lead over Claude Opus 4.6 and a 31.7-point demolition of GPT-5.2 on the hardest reasoning benchmark in AI. Then Google did something unusual: it…

Read MoreGemini 3 Deep Think Hits 84.6% on ARC-AGI-2 — But Only 6.5% of Real Research Problems Got Useful Answers

AI Is Jevons’ Paradox for Knowledge Work: A Berkeley Study Proves Efficiency Creates More Work, Not Less

AI Is Jevons' Paradox for Knowledge Work: A Berkeley Study Proves Efficiency Creates More Work, Not Less - Featured Image

A randomized controlled trial published by METR in July 2025 tested 16 experienced open-source developers across 246 real tasks. The developers predicted AI tools would save them 24% of their time. After each task, they estimated a 20% speedup. The…

Read MoreAI Is Jevons’ Paradox for Knowledge Work: A Berkeley Study Proves Efficiency Creates More Work, Not Less

Get the Daily Pulse

Sharp AI analysis, daily. Two minutes, every morning.

Get the Daily PulseTwo minutes, every morning