Skip to content

VBVR-DataFactory/G-141_identify_polygon_with_most_sides_data-generator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

G-141: Identify Polygon with Most Sides Data Generator

Generates synthetic datasets for training and evaluating vision models on geometric shape analysis and comparison tasks. Each sample contains multiple polygons where the one with the most sides must be identified and circled.

Each sample pairs a task (first frame + prompt describing what needs to happen) with its ground truth solution (final frame showing the result + video demonstrating how to achieve it). This structure enables both model evaluation and training.


📌 Basic Information

Property Value
Task ID G-141
Task Identify Polygon with Most Sides
Category Perception
Resolution 1024×1024 px
FPS 16 fps
Duration ~2-3 seconds
Output PNG images + MP4 video

🚀 Usage

Installation

# 1. Clone the repository
git clone https://github.com/VBVR-DataFactory/G-141_identify_polygon_with_most_sides_data-generator.git
cd G-141_identify_polygon_with_most_sides_data-generator

# 2. Create and activate virtual environment
python3 -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

# 3. Install dependencies
pip install --upgrade pip
pip install -r requirements.txt
pip install -e .

Generate Data

# Generate 50 samples
python examples/generate.py --num-samples 50

# Custom output directory
python examples/generate.py --num-samples 100 --output data/my_dataset

# Reproducible generation with seed
python examples/generate.py --num-samples 50 --seed 42

# Without videos (faster)
python examples/generate.py --num-samples 50 --no-videos

Command-Line Options

Argument Description
--num-samples Number of tasks to generate (required)
--output Output directory (default: data/questions)
--seed Random seed for reproducibility
--no-videos Skip video generation (images only)

📖 Task Example

Prompt

Circle the shape with the most sides among several shapes placed side by side.

Visual

Initial Frame
Multiple polygons side by side
Animation
Polygon with most sides circled
Final Frame
Circle around polygon with most sides

📖 Task Description

Objective

Identify and circle the polygon with the maximum number of sides among multiple shapes.

Task Setup

  • Shapes: Several polygons placed side by side
  • Polygon variety: Different shapes (triangle, square, pentagon, hexagon, etc.)
  • Comparison criterion: Number of sides (edges)
  • Task: Count sides for each polygon, find maximum
  • Marking: Circle the shape with most sides
  • Background: White with clear visibility
  • Goal: Circle the polygon with the highest side count

Key Features

  • Geometric shape side counting
  • Maximum value identification across multiple elements
  • Polygon classification and comparison
  • Tests understanding of geometric properties
  • Requires counting and comparison
  • Shape analysis and reasoning

📦 Data Format

data/questions/identify_polygon_with_most_sides_task/identify_polygon_with_most_sides_00000000/
├── first_frame.png      # Multiple polygons without marking
├── final_frame.png      # Polygon with most sides circled
├── prompt.txt           # Maximum sides identification instruction
├── ground_truth.mp4     # Animation of circling process
└── question_metadata.json # Task metadata

File specifications:

  • Images: 1024×1024 PNG format
  • Video: MP4 format, 16 fps
  • Duration: ~2-3 seconds

🏷️ Tags

visual-reasoning geometric-analysis polygon-classification counting comparison shape-properties


About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages