ByteDance ran AI agents for 38,000 hours across 134 real-world tasks, from gravitational-wave detection to CPU design.
One clean mathematical curve predicted their progress almost perfectly across five frontier models.
Agents with memory beat fresh restarts by a wide margin,
The place to track, rank, and understand the entire AI industry in real time. Used by 300,000+ developers.





