CostNav introduces a paradigm shift in how we evaluate navigation systems: from purely technical metrics to actual economic cost and revenue.
Our key contributions are:
- High-Fidelity Physics Simulation with Dynamics for effective Real-World Economic Scenarios.
- Supporting Segway E1 delivery robot, food cargo dynamics with popcorn, detailed collision dynamics, pedestrians
- Real-world referenced Cost-Revenue Model with Break-Even Point Analysis.
- Supporting Energy Cost, Pedestrian Safety Cost, Property Damage Cost, Repair Cost
- Rule-Based and Learning-Based Navigation Evaluation with Multiple IL Baselines
- Comparing Profitability between Nav2 with GPS and Nav2 with AMCL localization
- IL Baselines: ViNT, GNM, NoMaD, NavDP, and CANVAS
You can find more details in our technical report.
The full cost benchmark formula with real world references is available in our google drive: https://drive.google.com/drive/folders/1j1MXm6NMkd6HHBwTJi_nSde7-shv4laX?usp=sharing
Side-by-side comparison of rule-based and learning-based navigation methods in CostNav's urban sidewalk environment.
CostNav's high-fidelity physics simulation enables the modeling of real-world economic scenarios, including critical failures like food spoilage and robot rollovers.
Comparison of existing navigation benchmarks (UnrealZoo, OpenBench, Arena-RosNav, Urban-Sim, DeliveryBench) that focus on task-oriented metrics versus CostNav's integration of physics simulation with comprehensive economic cost modeling.
CostNav's framework linking navigation performance to business value through profit-per-run measurement.
Full documentation is available at worv-ai.github.io/CostNav.
| Guide | Description |
|---|---|
| Quick Reference | Installation, commands, and project structure |
| Assets Setup | Download and configure Omniverse USD assets |
| Isaac Sim Integration | Mission manager, ROS2 topics, and launch.py reference |
| Nav2 Baseline | Rule-based navigation with ROS2 Nav2 |
| IL Baselines | ViNT, NoMaD, GNM, NavDP, and CANVAS |
| Teleoperation | Joystick-based robot control for data collection |
| Evaluation | Unified eval script, metrics, and log output |
| Cost Model | CAPEX, OPEX, revenue, and break-even analysis |
| Contributing | How to contribute, roadmap, and PR guidelines |
Help us build a large-scale, ever-expanding benchmark! We highly encourage contributions via issues and pull requests, especially adding more navigation baselines!
To Cite CostNav, please use the following bibtex citation
@misc{seong2026costnavnavigationbenchmarkrealworld,
title={CostNav: A Navigation Benchmark for Real-World Economic-Cost Evaluation of Physical AI Agents},
author={Haebin Seong and Sungmin Kim and Yongjun Cho and Myunchul Joe and Geunwoo Kim and Yubeen Park and Sunhoo Kim and Yoonshik Kim and Suhwan Choi and Jaeyoon Jung and Jiyong Youn and Jinmyung Kwak and Sunghee Ahn and Jaemin Lee and Younggil Do and Seungyeop Yi and Woojin Cheong and Minhyeok Oh and Minchan Kim and Seongjae Kang and Samwoo Seong and Youngjae Yu and Yunsung Lee},
year={2026},
eprint={2511.20216},
archivePrefix={arXiv},
primaryClass={cs.AI},
url={https://arxiv.org/abs/2511.20216},
}





