Hello, I'm Sebastian Raschka, PhD
I am an LLM Research Engineer with over a decade of experience in artificial intelligence. My work bridges academia and industry, including roles as senior engineer at Lightning AI and as a statistics professor at the University of Wisconsin-Madison.
I am also the author of Build a Large Language Model (From Scratch).
My expertise lies in LLM research and the development of high-performance AI systems, with a deep focus on practical, code-driven implementations. (For my most up-to-date CV details, please visit my LinkedIn profile.)
Recent Notes and Blog Entries
The State Of LLMs 2025: Progress, Problems, and Predictions
Dec 30, 2025
A 2025 review of large language models, from DeepSeek R1 and RLVR to inference-time scaling, benchmarks, architectures, and predictions for 2026.
LLM Research Papers: The 2025 List (July to December)
Dec 30, 2025
A curated list of LLM research papers from July–December 2025, organized by reasoning models, inference-time scaling, architectures, training efficiency, and diffusion.
From Random Forests to RLVR: A Short History of ML/AI Hello Worlds
Dec 8, 2025
Two years ago, I posted a list of Hello World examples for machine learning and AI on social. Here, the Hello World means beginner-friendly examples to showcase a method. I set a biennial calendar ...
From DeepSeek V3 to V3.2: Architecture, Sparse Attention, and RL Updates
Dec 3, 2025
Similar to DeepSeek V3, the team released their new flagship model over a major US holiday weekend. Given DeepSeek V3.2's really good performance (on GPT-5 and Gemini 3.0 Pro) level, and the fact t...
Recommendations for Getting the Most Out of a Technical Book
Nov 12, 2025
This short article compiles a few notes I previously shared when readers ask how to get the most out of my building large language model from scratch books. I follow a similar approach when I read ...