Log inSign up
Michael L. Chen
524 posts
user avatar
Michael L. Chen
@miclchen
AI governance, prev @METR_Evals @stripe Tweets only represent my personal views
Berkeley
miclchen.com
Joined November 2012
800
Following
1,062
Followers

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms·Privacy·Cookies·Accessibility·Ads Info·© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up
  • Pinned
    user avatar
    Michael L. Chen
    @miclchen
    May 19
    We made a chart of 44 documented incidents of AI agents acting against user intent – sometimes subverting routine security and deceptively hiding evidence of their actions.
    Chart with title, "In documented incidents, AI agents have subverted security measures and hidden evidence from users"
    3.6K
  • user avatar
    Michael L. Chen
    @miclchen
    Nov 3, 2025
    I've started a PhD program at the University of Oxford, researching AI governance! I'll be doing this program part-time while continuing my role as a Member of Policy Staff at METR in Berkeley. I'm in the Department of Engineering Science, advised by Professors Phil Torr and
    26K
  • user avatar
    Michael L. Chen
    @miclchen
    Aug 7, 2025
    Replying to @GayBearRes
    Criminal liability for not being a helicopter parent is crazy. Starting in 4th or 5th grade, I was taking the NYC subway home by myself for an hour multiple times a week.
    3.6K
  • user avatar
    Michael L. Chen
    @miclchen
    Aug 8, 2025
    Actually GPT-5 is WAY more capable than expectations! Best model we'll ever need! Everyone working on AI capabilities should pat themselves on the back and go on indefinite vacation.
    2.1K
  • user avatar
    Michael L. Chen
    @miclchen
    Oct 24, 2024
    From the White House's National Security Memorandum on AI: Automated AI R&D might pose a threat to national security. The U.S. AI Safety Institute will be evaluating AI R&D capabilities pre-deployment.
    2.3K
  • user avatar
    Michael L. Chen
    @miclchen
    Sep 16, 2024
    Replying to @hendrycks @cais and @scale_AI
    Can I be a co-author now? /s
    2.5K
  • user avatar
    Michael L. Chen
    @miclchen
    Mar 4, 2025
    Replying to @RishiBommasani @METR_Evals and @scale_AI
    1/ METR has never taken money from AI labs 2/ METR has had anywhere from a few days to a few months to evaluate a deployment candidate model, prior to deployment (examples: metr.org/blog/2025-02-2…, metr.github.io/autonomy-evals…) 3/ METR is able to share information to the extent it
    4.4K
  • user avatar
    Michael L. Chen
    @miclchen
    May 18, 2024
    Replying to @KelseyTuoc
    How is OpenAI’s statement compatible with Daniel K forfeiting equity? “We have never canceled any current or former employee’s vested equity nor will we if people do not sign a release or nondisparagement agreement when they exit.”
    2.1K
  • user avatar
    Michael L. Chen
    @miclchen
    Dec 30, 2024
    had a great interview with Time about AI benchmarks!
    AI Models Are Getting Smarter. New Tests Are Racing to Catch Up
    From time.com
    970
  • user avatar
    Michael L. Chen
    @miclchen
    Nov 14, 2025
    Replying to @nathan_w_henry
    It should be possible to set up a system to automatically detect the "best of n" strategy, especially if the n papers are submitted by the same authors? Even before LLMs, it wouldn't have been hard to write multiple variations/paraphrases of the same paper.
    6.2K
  • user avatar
    Michael L. Chen
    @miclchen
    May 6, 2025
    GDM adopted @METR_Evals' open-source RE-Bench for Gemini 2.5 Pro ML R&D critical capability evaluations
    user avatar
    Anca Dragan
    @ancadianadragan
    Apr 29, 2025
    Per our Frontier Safety Framework, we continue to test our models for critical capabilities. Here’s the updated model card for Gemini 2.5Pro with frontier safety evaluations + explanation of how our safety buffer / alert thresholds approach applies to 2.0, 2.5, and what’s coming.
    866
  • user avatar
    Michael L. Chen
    @miclchen
    Oct 31, 2025
    Replying to @AndyMasley
    > Airports seem like the single most likely place you’ll get sick per unit of time spent there no, clubbing is definitely higher risk
    1.5K
  • user avatar
    Michael L. Chen
    @miclchen
    Oct 13, 2024
    Curious about how the Japan AI Safety Institute (AIセーフティ・インスティテュート) is thinking about AI safety evaluation and red-teaming? They've put out two English-language reports: aisi.go.jp/assets/pdf/ai_…, aisi.go.jp/assets/pdf/ai_…
    2.4K
  • user avatar
    Michael L. Chen
    @miclchen
    Aug 5, 2025
    There are a lot of papers + model cards out there related to dangerous capability evals! I've made an Airtable to compile them. Definitely not comprehensive but hope this is useful.
    292