RoadBench

🎯 Overview

RoadBench evaluates MLLMs across six distinct tasks that are fundamental to road network understanding:

📋 Task Categories

Bird's-Eye View (BEV) Tasks

Task 1: Lane Counting - Count lanes from aerial/satellite perspective with reference line guidance
Task 2: Lane Designation Recognition - Identify lane purposes from bird's-eye view with directional annotations
Task 3: Road Network Correction - Detect and correct road network topology errors

First-Person View (FPV) Tasks

Task 4: Lane Counting - Determine the number of available lanes for vehicle travel
Task 5: Lane Designation Recognition - Identify turning directions and lane purposes (straight, left-turn, right-turn, u-turn)
Task 6: Road Type Classification - Distinguish between main roads and service roads

🏗️ Architecture

Core Components

roadnetbenchmark/ - Core benchmark library
- vlm_client.py - Universal VLM client supporting OpenAI-compatible APIs
- image.py - Image processing and annotation utilities
- metric.py - Evaluation metrics including F1 scores and geometric distance measures
- coord.py - Coordinate transformation utilities
- satellite.py - Satellite imagery processing
- concurrent_jsonl_writer.py - Efficient concurrent data I/O

Task Scripts

Each task includes three types of scripts:

Evaluation Scripts (*_*.py) - Run VLM inference on datasets
Metrics Scripts (*_*_metrics.py) - Calculate performance metrics
Shell Scripts (*_*.sh) - Automated execution with various configurations

📊 Evaluation Metrics

RoadBench employs multiple sophisticated metrics tailored to different task types:

Geometric Metrics

Junction Point Distance - Assesses accuracy of road segment termination points
Fréchet Distance - Evaluates similarity of road line trajectories
Buffer F1 Score - Measures overlap between predicted and ground truth road geometries

Multi-Class Classification Metrics

Weighted F1 Score - Balanced precision and recall weighted by class frequency
Weighted Precision/Recall - Class-weighted precision and recall metrics
RMSE - Root Mean Square Error for lane counting regression tasks

Multi-Label Classification Metrics

Hamming Loss - Proportion of incorrect label predictions across all lane designations
Exact Match Ratio (Accuracy) - Percentage of samples where all lane designations are correctly predicted

🚀 Getting Started

Prerequisites

Python 3.12 or higher
UV package manager (recommended) or pip

Installation

Clone the repository
Install dependencies:

# Using UV (recommended)
uv install

# Or using pip
pip install -e .

Set up environment variables:

# Create .env file with your API keys
cp .env.example .env
# Edit .env with your VLM API credentials

Environment Variables

Create a .env file with the following variables:

# OpenAI API
OPENAI_API_KEY=your_openai_api_key

# For other VLM providers, add respective API keys
ANTHROPIC_API_KEY=your_anthropic_key
GOOGLE_API_KEY=your_google_key

📖 Usage

Running Individual Tasks

Lane Counting (BEV)

python bev_lane_counting.py

Lane Designations (BEV)

python bev_lane_designations.py

Road Network Correction (BEV)

python bev_roadnet_correction.py

Lane Counting (FPV)

python fpv_lane_counting.py

Lane Designations (FPV)

python fpv_lane_designations.py

Road Type Classification (FPV)

python fpv_road_type_classification.py

Computing Metrics

After running evaluations, compute performance metrics:

# Calculate BEV lane counting metrics
python bev_lane_counting_metrics.py

# Calculate BEV lane designations metrics
python bev_lane_designations_metrics.py

# Calculate BEV road network correction metrics
python bev_roadnet_correction_metrics.py

# Calculate FPV lane counting metrics
python fpv_lane_counting_metrics.py

# Calculate FPV lane designations metrics
python fpv_lane_designations_metrics.py

# Calculate FPV road type classification metrics
python fpv_road_type_classification_metrics.py

📁 Dataset Structure

data/
├── task_1_2/          # BEV Lane Counting & Designations
│   └── dataset/
│       ├── *.png     # BEV road images
│       └── labels.jsonl
├── task_3/            # BEV Road Network Correction
├── task_4_5/          # FPV Lane Tasks
└── task_6/            # FPV Road Type Classification

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
data		data
roadnetbenchmark		roadnetbenchmark
README.md		README.md
bev_lane_counting.py		bev_lane_counting.py
bev_lane_counting.sh		bev_lane_counting.sh
bev_lane_counting_metrics.py		bev_lane_counting_metrics.py
bev_lane_counting_more.sh		bev_lane_counting_more.sh
bev_lane_designations.py		bev_lane_designations.py
bev_lane_designations.sh		bev_lane_designations.sh
bev_lane_designations_metrics.py		bev_lane_designations_metrics.py
bev_lane_designations_more.sh		bev_lane_designations_more.sh
bev_roadnet_correction.py		bev_roadnet_correction.py
bev_roadnet_correction.sh		bev_roadnet_correction.sh
bev_roadnet_correction_metrics.py		bev_roadnet_correction_metrics.py
fpv_lane_counting.py		fpv_lane_counting.py
fpv_lane_counting.sh		fpv_lane_counting.sh
fpv_lane_counting_metrics.py		fpv_lane_counting_metrics.py
fpv_lane_designations.py		fpv_lane_designations.py
fpv_lane_designations.sh		fpv_lane_designations.sh
fpv_lane_designations_metrics.py		fpv_lane_designations_metrics.py
fpv_road_type.py		fpv_road_type.py
fpv_road_type.sh		fpv_road_type.sh
fpv_road_type_metrics.py		fpv_road_type_metrics.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RoadBench

🎯 Overview

📋 Task Categories

Bird's-Eye View (BEV) Tasks

First-Person View (FPV) Tasks

🏗️ Architecture

Core Components

Task Scripts

📊 Evaluation Metrics

Geometric Metrics

Multi-Class Classification Metrics

Multi-Label Classification Metrics

🚀 Getting Started

Prerequisites

Installation

Environment Variables

📖 Usage

Running Individual Tasks

Lane Counting (BEV)

Lane Designations (BEV)

Road Network Correction (BEV)

Lane Counting (FPV)

Lane Designations (FPV)

Road Type Classification (FPV)

Computing Metrics

📁 Dataset Structure

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RoadBench

🎯 Overview

📋 Task Categories

Bird's-Eye View (BEV) Tasks

First-Person View (FPV) Tasks

🏗️ Architecture

Core Components

Task Scripts

📊 Evaluation Metrics

Geometric Metrics

Multi-Class Classification Metrics

Multi-Label Classification Metrics

🚀 Getting Started

Prerequisites

Installation

Environment Variables

📖 Usage

Running Individual Tasks

Lane Counting (BEV)

Lane Designations (BEV)

Road Network Correction (BEV)

Lane Counting (FPV)

Lane Designations (FPV)

Road Type Classification (FPV)

Computing Metrics

📁 Dataset Structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages