Log inSign up
Jerry Wei
300 posts
user avatar
Jerry Wei
@JerryWeiAI
Aligning AIs at @AnthropicAI ⏰ Past: @GoogleDeepMind, @Stanford, @Google Brain
San Francisco, CA
jerrywei.net
Joined June 2015
490
Following
9,017
Followers
  • Pinned
    user avatar
    Jerry Wei
    @JerryWeiAI
    Jun 20, 2024
    Life update: After ~2 years at @Google Brain/DeepMind, I joined @AnthropicAI! I'm deeply grateful to @quocleix and @yifenglou for taking a chance on me and offering me to join their team before I even finished my undergrad at Stanford. Because of their trust in my potential,
    180K
  • user avatar
    Jerry Wei
    @JerryWeiAI
    Mar 8, 2023
    New @GoogleAI paper: How do language models do in-context learning? arxiv.org/abs/2303.03846 Large language models (GPT-3.5, PaLM) can follow in-context exemplars, even if the labels are flipped or semantically unrelated. This ability wasn’t present in small language models. 1/
    322K
  • user avatar
    Jerry Wei
    @JerryWeiAI
    Nov 16, 2023
    *tries to anonymize my paper* reviewers:
    195K
  • user avatar
    Jerry Wei
    @JerryWeiAI
    Dec 8, 2023
    even though you're gone, you'll always be my brother
    user avatar
    Jerry Wei
    @JerryWeiAI
    Dec 8, 2023
    Great to spend time with my brother @_jasonwei!
    190K
  • user avatar
    Jerry Wei
    @JerryWeiAI
    May 16, 2023
    New @GoogleAI+@Stanford paper!📜 Symbol tuning is a simple method that improves in-context learning by emphasizing input–label mappings. It improves robustness to prompts without instructions/relevant labels and boosts performance on algorithmic tasks. arxiv.org/abs/2305.08298
    383K
  • user avatar
    Jerry Wei
    @JerryWeiAI
    Aug 9, 2023
    New @GoogleAI paper! 📜 Language models repeat a user’s opinion, even when that opinion is wrong. This is more prevalent in instruction-tuned and larger models. Finetuning with simple synthetic-data (github.com/google/sycopha…) reduces this behavior. arxiv.org/abs/2308.03958 1/
    193K
  • user avatar
    Jerry Wei
    @JerryWeiAI
    Sep 3, 2024
    One of the most valuable lessons I learned during my time at Google DeepMind was how to not be shy about asking others for help. Many researchers feel that asking others for help makes them look weak and thus may be inclined to try to solve everything themselves. However, I've
    76K
  • user avatar
    Jerry Wei
    @JerryWeiAI
    Jun 11, 2025
    Today marks my one-year anniversary at Anthropic, and I've been reflecting on some of the most impactful lessons I've learned during this incredible journey. One of the most striking realizations has been just how much a small, talent-dense team can accomplish. When I first
    43K
  • user avatar
    Jerry Wei
    @JerryWeiAI
    Mar 28, 2024
    New @GoogleDeepMind+@Stanford paper! 📜 How can we benchmark long-form factuality in language models? We show that LLMs can generate a large dataset and are better annotators than humans, and we use this to rank Gemini, GPT, Claude, and PaLM-2 models. arxiv.org/abs/2403.18802
    167K
  • user avatar
    Jerry Wei
    @JerryWeiAI
    Dec 12, 2023
    Never give up on your research aspirations. About five years ago, I presented a high school science fair project on a simple RNN that could predict political biases in news articles. Since then, I: - published work on AI for medical image analysis - graduated high school - went
    65K
  • user avatar
    Jerry Wei
    @JerryWeiAI
    Aug 1, 2023
    Personal news: I've joined @GoogleDeepMind full-time as a researcher in @quocleix's and @yifenglou's team! I've enjoyed the past eight months as a student researcher at Google Brain/DeepMind, and I'm excited to continue working on large language models and alignment! 😁
    86K
  • user avatar
    Jerry Wei
    @JerryWeiAI
    Jan 7, 2025
    My holiday side quest at @AnthropicAI: How well can Claude play Geoguessr? 🗺️ I had Claude look at 200K+ Street View images and guess the location. The results? Claude-3 models aren't that good, but Claude-3.5 models match or beat the average human! jerrywei.net/blog/claude-pl…
    30K
  • user avatar
    Jerry Wei
    @JerryWeiAI
    Apr 12, 2024
    Fun fact: our paper was put on hold by arxiv for a while because arxiv detected that we used the phrase "time travel," which is a topic that arxiv frequently gets bad submissions for. When we Ctrl-F'd "time travel" in our paper, we had actually just cited a paper called "Time
    user avatar
    Aran Komatsuzaki
    @arankomatsuzaki
    Apr 12, 2024
    Google presents Best Practices and Lessons Learned on Synthetic Data for Language Models Provides an overview of synthetic data research, discussing its applications, challenges, and future directions arxiv.org/abs/2404.07503
    95K
  • user avatar
    Jerry Wei
    @JerryWeiAI
    Jan 25, 2024
    Today marks my first year at Google (DeepMind). One year ago today, I joined Google Brain as a student researcher and first started working on large language models. During my time as a student researcher, I investigated how larger language models can do in-context learning
    user avatar
    Jerry Wei
    @JerryWeiAI
    Aug 9, 2023
    New @GoogleAI paper! 📜 Language models repeat a user’s opinion, even when that opinion is wrong. This is more prevalent in instruction-tuned and larger models. Finetuning with simple synthetic-data (github.com/google/sycopha…) reduces this behavior. arxiv.org/abs/2308.03958 1/
    69K

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms·Privacy·Cookies·Accessibility·Ads Info·© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up