π Paper β’ π Quick Start β’ π Results β’ π Citation
π This paper has been accepted at ICCV 2025! π
We're excited to share our work on geometry problem solving with the community!
Junbo Zhao1β β’ Ting Zhang1β β β’ Jiayu Sun1 β’ Mi Tian2 β’ Hua Huang1β
1Beijing Normal Universityγγ2TAL
β Equal contributionγγβ Corresponding author
We propose Pi-GPS, a novel framework for geometry problem solving that leverages diagrammatic information to resolve textual ambiguities. Our approach combines multi-modal understanding with symbolic reasoning to achieve state-of-the-art performance.
| Component | Description |
|---|---|
| π§ Rectifier | Utilizes multi-modal language models (MLLMs) to disambiguate text by incorporating diagrammatic context |
| β Verifier | Ensures refined text adheres to geometric rules, effectively reducing model hallucinations |
| π§ Symbolic Solver | Combines neural parsing with symbolic reasoning for robust problem solving |
π Pi-GPS achieves state-of-the-art results, outperforming previous neural-symbolic methods by nearly 10% on standard benchmarks!
| Category | Method | Geometry3K | PGPS9K | ||
|---|---|---|---|---|---|
| Completion | Choice | Completion | Choice | ||
| MLLMs | Qwen-VL | 22.1 | 26.7 | 20.1 | 23.2 |
| GPT-4o | 34.8 | 58.6 | 33.3 | 51.0 | |
| Claude 3.5 Sonnet | 32.0 | 56.4 | 27.6 | 45.9 | |
| Gemini 2 | 38.9 | 60.7 | 38.2 | 56.8 | |
| Neural Methods | NGS | 35.3 | 58.8 | 34.1 | 46.1 |
| Geoformer | 36.8 | 59.3 | 35.6 | 47.3 | |
| SCA-GPS | - | 76.7 | - | - | |
| GOLD* | - | 62.7 | - | 60.6 | |
| PGPSNet-v2-S* | 65.2 | 76.4 | 60.3 | 69.2 | |
| LANS (Diagram GT)* | 72.1 | 82.3 | 66.7 | 74.0 | |
| Neural-symbolic Methods | Inter-GPS | 43.4 | 57.5 | - | - |
| GeoDRL | 57.9 | 68.4 | 55.6 | 66.7 | |
| E-GPS | - | 67.9 | - | - | |
| Pi-GPS (ours) | 70.6 | 77.8 | 61.4 | 69.8 | |
# Clone the repository
git clone https://github.com/hellozting/Pi-GPS.git
cd Pi-GPS
# Install dependencies
pip install -r requirements.txtWe use two standard geometry datasets:
| Dataset | Link | Description |
|---|---|---|
| π Geometry3K | Download | 3K geometry problems with diagrams |
| π PGPS9K | Download | 9K plane geometry problems |
Download and extract datasets:
# Extract images for unified processing
python data/extract.pypython Solver/test.py --label finalClick to expand step-by-step instructions
python Parser/text_parser.pyFollow PGDPNet setup instructions.
# Add points to images
python Disambiguation_module/addPointsToImage.py
# Set your MLLM API in Disambiguation_module/align.py
python Disambiguation_module/GuidedAlign.py# Set your LLM API in Theorem_predictor/LLMPredict.py
python Theorem_predictor/LLMPredict.pypython Solver/test.py --label final_result \
--text_logic_form_path Disambiguation_module/disambiguated_text_logic_forms_pred.json \
--diagram_logic_form_path Parser/diagram_parser/PGDP.json \
--predict_path Theorem_predictor/LLM_pred_seq.jsonIf you find our work helpful, please consider citing:
@InProceedings{Zhao_2025_ICCV,
author = {Zhao, Junbo and Zhang, Ting and Sun, Jiayu and Tian, Mi and Huang, Hua},
title = {Pi-GPS: Enhancing Geometry Problem Solving by Unleashing the Power of Diagrammatic Information},
booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
month = {October},
year = {2025},
pages = {1526-1536}
}We welcome contributions! Please feel free to:
- π Report bugs
- π‘ Suggest features
- π Improve documentation
- π§ Submit pull requests
Questions or suggestions?
π§ [email protected]
π Open an Issue
Made with β€οΈ by the Pi-GPS Team
