Jerry Wei (@JerryWeiAI) / X

Jerry Wei

300 posts

Jerry Wei

@JerryWeiAI

Aligning AIs at @AnthropicAI ⏰ Past: @GoogleDeepMind, @Stanford, @Google Brain

San Francisco, CA

Joined June 2015

Pinned
Jerry Wei
@JerryWeiAI
Jun 20, 2024
Life update: After ~2 years at @Google Brain/DeepMind, I joined @AnthropicAI! I'm deeply grateful to @quocleix and @yifenglou for taking a chance on me and offering me to join their team before I even finished my undergrad at Stanford. Because of their trust in my potential,
180K
Jerry Wei
@JerryWeiAI
Mar 8, 2023
New @GoogleAI paper: How do language models do in-context learning? arxiv.org/abs/2303.03846 Large language models (GPT-3.5, PaLM) can follow in-context exemplars, even if the labels are flipped or semantically unrelated. This ability wasn’t present in small language models. 1/
322K
Jerry Wei
@JerryWeiAI
Nov 16, 2023
*tries to anonymize my paper* reviewers:
195K
Jerry Wei
@JerryWeiAI
Dec 8, 2023
even though you're gone, you'll always be my brother
Jerry Wei
@JerryWeiAI
Dec 8, 2023
Great to spend time with my brother @_jasonwei!
190K
Jerry Wei
@JerryWeiAI
May 16, 2023
New @GoogleAI+@Stanford paper!📜 Symbol tuning is a simple method that improves in-context learning by emphasizing input–label mappings. It improves robustness to prompts without instructions/relevant labels and boosts performance on algorithmic tasks. arxiv.org/abs/2305.08298
383K
Jerry Wei
@JerryWeiAI
Aug 9, 2023
New @GoogleAI paper! 📜 Language models repeat a user’s opinion, even when that opinion is wrong. This is more prevalent in instruction-tuned and larger models. Finetuning with simple synthetic-data (github.com/google/sycopha…) reduces this behavior. arxiv.org/abs/2308.03958 1/
193K
Jerry Wei
@JerryWeiAI
Sep 3, 2024
One of the most valuable lessons I learned during my time at Google DeepMind was how to not be shy about asking others for help. Many researchers feel that asking others for help makes them look weak and thus may be inclined to try to solve everything themselves. However, I've
76K
Jerry Wei
@JerryWeiAI
Jun 11, 2025
Today marks my one-year anniversary at Anthropic, and I've been reflecting on some of the most impactful lessons I've learned during this incredible journey. One of the most striking realizations has been just how much a small, talent-dense team can accomplish. When I first
43K
Jerry Wei
@JerryWeiAI
Mar 28, 2024
New @GoogleDeepMind+@Stanford paper! 📜 How can we benchmark long-form factuality in language models? We show that LLMs can generate a large dataset and are better annotators than humans, and we use this to rank Gemini, GPT, Claude, and PaLM-2 models. arxiv.org/abs/2403.18802
167K
Jerry Wei
@JerryWeiAI
Dec 12, 2023
Never give up on your research aspirations. About five years ago, I presented a high school science fair project on a simple RNN that could predict political biases in news articles. Since then, I: - published work on AI for medical image analysis - graduated high school - went
65K
Jerry Wei
@JerryWeiAI
Aug 1, 2023
Personal news: I've joined @GoogleDeepMind full-time as a researcher in @quocleix's and @yifenglou's team! I've enjoyed the past eight months as a student researcher at Google Brain/DeepMind, and I'm excited to continue working on large language models and alignment! 😁
86K
Jerry Wei
@JerryWeiAI
Jan 7, 2025
My holiday side quest at @AnthropicAI: How well can Claude play Geoguessr? 🗺️ I had Claude look at 200K+ Street View images and guess the location. The results? Claude-3 models aren't that good, but Claude-3.5 models match or beat the average human! jerrywei.net/blog/claude-pl…
30K
Jerry Wei
@JerryWeiAI
Apr 12, 2024
Fun fact: our paper was put on hold by arxiv for a while because arxiv detected that we used the phrase "time travel," which is a topic that arxiv frequently gets bad submissions for. When we Ctrl-F'd "time travel" in our paper, we had actually just cited a paper called "Time
Aran Komatsuzaki
@arankomatsuzaki
Apr 12, 2024
Google presents Best Practices and Lessons Learned on Synthetic Data for Language Models Provides an overview of synthetic data research, discussing its applications, challenges, and future directions arxiv.org/abs/2404.07503
95K
Jerry Wei
@JerryWeiAI
Jan 25, 2024
Today marks my first year at Google (DeepMind). One year ago today, I joined Google Brain as a student researcher and first started working on large language models. During my time as a student researcher, I investigated how larger language models can do in-context learning
Jerry Wei
@JerryWeiAI
Aug 9, 2023
New @GoogleAI paper! 📜 Language models repeat a user’s opinion, even when that opinion is wrong. This is more prevalent in instruction-tuned and larger models. Finetuning with simple synthetic-data (github.com/google/sycopha…) reduces this behavior. arxiv.org/abs/2308.03958 1/
69K