Michael L. Chen (@miclchen) / X

Michael L. Chen

524 posts

Michael L. Chen

@miclchen

AI governance, prev @METR_Evals @stripe Tweets only represent my personal views

Berkeley

Joined November 2012

Pinned
Michael L. Chen
@miclchen
May 19
We made a chart of 44 documented incidents of AI agents acting against user intent – sometimes subverting routine security and deceptively hiding evidence of their actions.
3.6K
Michael L. Chen
@miclchen
Nov 3, 2025
I've started a PhD program at the University of Oxford, researching AI governance! I'll be doing this program part-time while continuing my role as a Member of Policy Staff at METR in Berkeley. I'm in the Department of Engineering Science, advised by Professors Phil Torr and
26K
Michael L. Chen
@miclchen
Aug 7, 2025
Replying to @GayBearRes
Criminal liability for not being a helicopter parent is crazy. Starting in 4th or 5th grade, I was taking the NYC subway home by myself for an hour multiple times a week.
3.6K
Michael L. Chen
@miclchen
Aug 8, 2025
Actually GPT-5 is WAY more capable than expectations! Best model we'll ever need! Everyone working on AI capabilities should pat themselves on the back and go on indefinite vacation.
2.1K
Michael L. Chen
@miclchen
Oct 24, 2024
From the White House's National Security Memorandum on AI: Automated AI R&D might pose a threat to national security. The U.S. AI Safety Institute will be evaluating AI R&D capabilities pre-deployment.
2.3K
Michael L. Chen
@miclchen
Sep 16, 2024
Replying to @hendrycks @cais and @scale_AI
Can I be a co-author now? /s
2.5K
Michael L. Chen
@miclchen
Mar 4, 2025
Replying to @RishiBommasani @METR_Evals and @scale_AI
1/ METR has never taken money from AI labs 2/ METR has had anywhere from a few days to a few months to evaluate a deployment candidate model, prior to deployment (examples: metr.org/blog/2025-02-2…, metr.github.io/autonomy-evals…) 3/ METR is able to share information to the extent it
4.4K
Michael L. Chen
@miclchen
May 18, 2024
Replying to @KelseyTuoc
How is OpenAI’s statement compatible with Daniel K forfeiting equity? “We have never canceled any current or former employee’s vested equity nor will we if people do not sign a release or nondisparagement agreement when they exit.”
2.1K
Michael L. Chen
@miclchen
Dec 30, 2024
had a great interview with Time about AI benchmarks!
AI Models Are Getting Smarter. New Tests Are Racing to Catch Up
From time.com
970
Michael L. Chen
@miclchen
Nov 14, 2025
Replying to @nathan_w_henry
It should be possible to set up a system to automatically detect the "best of n" strategy, especially if the n papers are submitted by the same authors? Even before LLMs, it wouldn't have been hard to write multiple variations/paraphrases of the same paper.
6.2K
Michael L. Chen
@miclchen
May 6, 2025
GDM adopted @METR_Evals' open-source RE-Bench for Gemini 2.5 Pro ML R&D critical capability evaluations
Anca Dragan
@ancadianadragan
Apr 29, 2025
Per our Frontier Safety Framework, we continue to test our models for critical capabilities. Here’s the updated model card for Gemini 2.5Pro with frontier safety evaluations + explanation of how our safety buffer / alert thresholds approach applies to 2.0, 2.5, and what’s coming.
866
Michael L. Chen
@miclchen
Oct 31, 2025
Replying to @AndyMasley
> Airports seem like the single most likely place you’ll get sick per unit of time spent there no, clubbing is definitely higher risk
1.5K
Michael L. Chen
@miclchen
Oct 13, 2024
Curious about how the Japan AI Safety Institute (ＡＩセーフティ・インスティテュート) is thinking about AI safety evaluation and red-teaming? They've put out two English-language reports: aisi.go.jp/assets/pdf/ai_…, aisi.go.jp/assets/pdf/ai_…
2.4K
Michael L. Chen
@miclchen
Aug 5, 2025
There are a lot of papers + model cards out there related to dangerous capability evals! I've made an Airtable to compile them. Definitely not comprehensive but hope this is useful.
292