Skip to content

yunlong10/MMPerspective

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

27 Commits
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

✨ MMPerspective: Do MLLMs Understand Perspective? A Comprehensive Benchmark for Perspective Perception, Reasoning, and Robustness

🌐 Homepage | πŸ”¬ Paper | πŸ‘©β€πŸ’» Code | πŸ“Š Dataset | πŸ“ˆ Evaluation | πŸ† Leaderboard

What is MMPerspective?

MMPerspective is a comprehensive benchmark designed to systematically evaluate the understanding of perspective geometry by Multimodal Large Language Models (MLLMs). It comprises 10 diverse tasks across three key dimensions: Perspective Perception, Reasoning, and Robustness, with 2,711 real-world and synthetic image instances.

alt text

MMPerspective enables researchers and practitioners to uncover the strengths, limitations, and potential areas for improvement in MLLMs, offering valuable insights into the challenges of understanding perspective geometry.

πŸ† Leaderboard

Link

πŸ“‰ Statistics

alt text

Link

Data Curation Pipeline

alt text

πŸ‘€ Visualization Results

alt text

✏️ Citation

@article{tang2025mmperspective,
  title = {MMPerspective: Do MLLMs Understand Perspective? A Comprehensive Benchmark for Perspective Perception, Reasoning, and Robustness},
  author = {Tang, Yunlong and Liu, Pinxin and Feng, Mingqian and Tan, Zhangyun and Mao, Rui and Huang, Chao and Bi, Jing and Xiao, Yunzhong and Liang, Susan and Hua, Hang and Vosoughi, Ali and Song, Luchuan and Zhang, Zeliang and Xu, Chenliang},
  journal = {arXiv preprint arXiv:2505.20426},
  year = {2025}
}

About

[NeurIPS 2025] A Comprehensive Benchmark for Perspective Perception, Reasoning, and Robustness

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •