SemiAnalysis
2,456 posts
- The US government, citing national security authorities, has issued an export control directive to suspend all access to Fable 5 and Mythos 5 by any foreign national, whether inside or outside the United States, including foreign national Anthropic employees. The net effect of
- Congrats to @vllm_project & @lmsysorg for releasing MiniMax M3 428B on both the CUDA & ROCm stack on day 0! MiniMax M3 includes: 🟠 Block sparse attention which is 9x faster prefill over M2.7 🟠 Day 0 open MXFP8 weights 🟠 and Furthermore @inferact released Day-0 EAGLE3 open
- The concept of “80,000 hours” career consulting doesn’t even make sense. If someone wants to have a high-impact life, they would be working more than 80,000 hours, i.e. more than 40 hours a week. They should rename themselves to 160,000 Hours. If you want to have a high-impact
- Replying to @SemiAnalysis_Interestingly, the public market is positioned in the opposite direction, with neocloud names trading like the cycle is about to roll over. Our read, which we lay out in the piece, is that the scarcity is real, the long-dated rental floor is much higher than the equity setup
- Replying to @SemiAnalysis_What we walk through in the article is why this isnt a repeat of the 2023 squeeze. The demand side is no longer training-led, its agentic, and our internal SemiAnalysis usage is one of many examples where token spend has moved from a curiosity to a real line item. When
- Replying to @SemiAnalysis_The index has H100 one-year rentals running from $1.70 per hour per GPU in October 2025 to about $2.35 in March 2026, which is a 40% move off the bottom, with another 15-20% step from late January through February alone. Underneath that, on-demand capacity for the major models is
- Alongside the launch of our H100 1-Click Rental Index, we wrote up what the GPU rental market actually looks like in early 2026, and the headline is that the spot market for compute has gone from "finally cooling off" in October to a hard squeeze again, in roughly five months.
- Pretraining fundamentally does not make sense anymore for anyone other than frontier labs. Although there are a lot of people at enterprises & startups who have "Pretrainitis" to show “impact” and get promotions, fundamentally, it doesn’t make sense. There is probably higher ROI
- GPU Racks hitting 400kW? Legacy data centers wont be able to handle it and the grid WILL get throttled. Radiant's 12 month, dirt to AI production, was made possible by bypassing the grid. Head of Infrastructure, Patrick Wohlschlegel tells @JordanNanos
- Intel Should Raise Capital Intel's woes are behind them. The heavy spending is ahead of them. Why an equity issuance in a hot equity market could make Intel so much better sooner.
- SLOP ALERT: Claude Code UI is complete slop. In the in-app file tree, when u click on a .png, it opens it as a base64-encoded file instead of rendering the image. We’d rather Anthropic not release the desktop app than release an L desktop App. Tons of bugs.













