Skip to content

VBVR-DataFactory/G-12_grid_obtaining_award_data-generator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

30 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

G-12: Grid Obtaining Award Data Generator

Generates synthetic datasets for training and evaluating vision models on grid navigation and path planning tasks. Each sample contains a grid with a start point, end point, and scattered reward items, requiring the agent to find the shortest path that collects all rewards before reaching the end.

Each sample pairs a task (first frame + prompt describing what needs to happen) with its ground truth solution (final frame showing the result + video demonstrating how to achieve it). This structure enables both model evaluation and training.


📌 Basic Information

Property Value
Task ID G-12
Task Grid Obtaining Award
Category Spatiality
Resolution 1024×1024 px
FPS 16 fps
Duration ~6 seconds
Output PNG images + MP4 video

🚀 Usage

Installation

# 1. Clone the repository
git clone https://github.com/VBVR-DataFactory/G-12_grid_obtaining_award_data-generator.git
cd G-12_grid_obtaining_award_data-generator

# 2. Create and activate virtual environment
python3 -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

# 3. Install dependencies
pip install --upgrade pip
pip install -r requirements.txt
pip install -e .

Generate Data

# Generate 50 samples
python examples/generate.py --num-samples 50

# Custom output directory
python examples/generate.py --num-samples 100 --output data/my_dataset

# Reproducible generation with seed
python examples/generate.py --num-samples 50 --seed 42

# Without videos (faster)
python examples/generate.py --num-samples 50 --no-videos

Command-Line Options

Argument Description
--num-samples Number of tasks to generate (required)
--output Output directory (default: data/questions)
--seed Random seed for reproducibility
--no-videos Skip video generation (images only)

📖 Task Example

Prompt

The scene shows a 10x10 grid with a green start point, a red end point, and 3 triangle reward items scattered across it. A circular agent starts at the green start point and can move to adjacent cells (up, down, left, right). The agent collects a reward by moving to its cell, and once collected, the reward disappears. Find the shortest path that collects all 3 triangle rewards before reaching the red end point.

Visual

Initial Frame
Agent at start, rewards scattered
Animation
Agent collects rewards via shortest path
Final Frame
All rewards collected, agent at end

📖 Task Description

Objective

Navigate a grid to collect all reward items using the shortest path, then reach the end point.

Task Setup

  • Grid: 10×10 grid of cells
  • Start point: Green filled cell (agent starts here)
  • End point: Red filled cell (final destination)
  • Rewards: 3-4 reward items (diamond, circle, triangle, or star shapes) with 30 different colors, scattered across grid
    • Each reward has a unique ID, position, and color
    • Rewards are color-filled (not just outlines)
  • Agent: Orange circular character
  • Movement: Can move up, down, left, right to adjacent cells
  • Background: White grid with black borders
  • Goal: Collect all rewards via shortest path before reaching end

Key Features

  • Rewards disappear when collected (visual feedback synchronized with agent arrival)
  • Path must visit all reward cells before end cell
  • Shortest path optimization required (TSP-like problem)
  • Grid-based movement (no diagonal movement)
  • Clear color coding (green=start, red=end, 30-color palette for rewards, orange=agent)
  • Smooth continuous movement (agent moves pixel-by-pixel between cells, not jumping)
  • Each reward has metadata: ID, position, color, and collection order

📦 Data Format

data/questions/grid_obtaining_award_task/grid_obtaining_award_00000000/
├── first_frame.png      # Initial grid with agent at start
├── final_frame.png      # Agent at end, all rewards collected
├── prompt.txt           # Navigation task instruction
├── ground_truth.mp4     # Animation of optimal path
└── question_metadata.json # Task metadata

File specifications:

  • Images: 1024×1024 PNG format
  • Video: MP4 format, 16 fps, ~6 seconds duration

🏷️ Tags

spatiality path-planning grid-navigation optimization reward-collection


About

This is the data generator for grid: obtaining award task

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages