Postdoc at Johns Hopkins University. PhD at Chinese University of Hong Kong. BS at Peking University. Previous: USC, Tencent, SenseTime
Pinned Loading
-
open-compass/VLMEvalKit
open-compass/VLMEvalKit PublicOpen-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
-
CUHK-ARISE/PsychoBench
CUHK-ARISE/PsychoBench PublicCode and data for the paper: On the Humanity of Conversational AI: Evaluating the Psychological Portrayal of LLMs
-
CUHK-ARISE/EmotionBench
CUHK-ARISE/EmotionBench PublicCode and data for the paper: Apathetic or Empathetic? Evaluating LLMs' Emotional Alignments with Humans
-
CUHK-ARISE/GAMABench
CUHK-ARISE/GAMABench PublicCode and data for the paper: Competing Large Language Models in Multi-Agent Gaming Environments
-
CUHK-ARISE/MAS-Resilience
CUHK-ARISE/MAS-Resilience PublicCode and data for the paper: On the Resilience of LLM-Based Multi-Agent Collaboration with Faulty Agents
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.

