The AI Super Bowl starts early
Anthropic and OpenAI are battling for attention with $8M Super Bowl ads while launching competing AI models.
Anthropic and OpenAI are battling for attention with $8M Super Bowl ads while launching competing AI models.

OpenAI just released GPT-5.3-Codex — the first model the company claims “helped build itself” — and they weren’t subtle about the timing: hours after Anthropic’s Opus 4.6 debut. Both releases landed February 5, each backed by provocative marketing. GPT-5.3-Codex achieved…

Anthropic launched Claude Opus 4.6 today with a 144-point Elo lead over GPT-5.2 on GDPval-AA, 500+ zero-day vulnerability discoveries, and agent teams that coordinate parallel work autonomously. Two days after triggering a $285 billion software stock rout, Anthropic is doubling…

On February 2, 2026, StepFun released Step 3.5 Flash—a sparse mixture-of-experts model that achieves frontier-class reasoning with only 11B active parameters per token. The full model contains 196B parameters, but thanks to intelligent expert routing, it runs on consumer hardware…

259 PRs. 497 commits. 40,000 lines of code. Zero IDE sessions. All in 30 days. All by Claude Code. Those aren’t hypothetical metrics. They’re what Boris Cherny—the engineer who created Claude Code as a side project at Anthropic in September…

Here’s a data point that should make you uncomfortable: experienced developers using AI coding assistants are actually 19% slower than those coding without them. That’s not a typo. A rigorous July 2025 study by METR found that developers with an…

Non-technical users just got access to some of the most powerful AI automation capabilities ever released. Claude Cowork, launched January 12, 2026, brings the agentic workflows developers have been using with Claude Code to everyone—no terminal required. Cowork is what…

Google dropped the Gemini Deep Research API on December 11, 2025—the same day OpenAI released GPT-5.2. While OpenAI was responding to Sam Altman’s early December “code red” memo (issued around December 2nd after Gemini 3’s benchmark dominance), Google quietly opened…

LangGraph hit 23.7k GitHub stars and over 22 million monthly downloads in January 2026—and for good reason. While LangChain handles simple chains and RAG pipelines, LangGraph gives you graph-based workflows with explicit state management, conditional routing, and built-in human-in-the-loop patterns.…