The State Of LLMs 2025: Progress, Problems, and Predictions

The State Of LLMs 2025: Progress, Problems, and Predictions

A 2025 review of large language models, from DeepSeek R1 and RLVR to inference-time scaling, benchmarks, architectures, and predictions for 2026.

LLM Research Papers: The 2025 List (July to December)

LLM Research Papers: The 2025 List (July to December)

A curated list of LLM research papers from July–December 2025, organized by reasoning models, inference-time scaling, architectures, training efficiency, and diffusion.

From Random Forests to RLVR: A Short History of ML/AI Hello Worlds

From Random Forests to RLVR: A Short History of ML/AI Hello Worlds

Two years ago, I posted a list of Hello World examples for machine learning and AI on social. Here, the Hello World means beginner-friendly examples to showcase a method. I set a biennial calendar ...

From DeepSeek V3 to V3.2: Architecture, Sparse Attention, and RL Updates

From DeepSeek V3 to V3.2: Architecture, Sparse Attention, and RL Updates

Similar to DeepSeek V3, the team released their new flagship model over a major US holiday weekend. Given DeepSeek V3.2's really good performance (on GPT-5 and Gemini 3.0 Pro) level, and the fact t...

Recommendations for Getting the Most Out of a Technical Book

Recommendations for Getting the Most Out of a Technical Book

This short article compiles a few notes I previously shared when readers ask how to get the most out of my building large language model from scratch books. I follow a similar approach when I read ...