PhD applicant for Fall '26
Interested in LLMs & AI4Science
Highlights
- Pro
Pinned Loading
-
Cross-Lingual-Pitfalls
Cross-Lingual-Pitfalls Public[ACL 2025] Cross-Lingual Pitfalls: Automatic Probing Cross-Lingual Weakness of Multilingual Large Language Models
Python 42
-
SocialMaze
SocialMaze PublicSocialMaze: A Benchmark for Evaluating Social Reasoning in Large Language Models
Python 4
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.
