Skip to content

VBVR-DataFactory/G-160_circle_largest_numerical_value_data-generator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

G-160: Circle Largest Numerical Value Data Generator

Generates synthetic datasets for training and evaluating vision models on numerical comparison and maximum identification tasks. Each sample contains multiple numbers where the largest numerical value must be identified and circled.

Each sample pairs a task (first frame + prompt describing what needs to happen) with its ground truth solution (final frame showing the result + video demonstrating how to achieve it). This structure enables both model evaluation and training.


📌 Basic Information

Property Value
Task ID G-160
Task Circle Largest Numerical Value
Category Knowledge
Resolution 1024×1024 px
FPS 16 fps
Duration ~5 seconds
Output PNG images + MP4 video

🚀 Usage

Installation

# 1. Clone the repository
git clone https://github.com/VBVR-DataFactory/G-160_circle_largest_numerical_value_data-generator.git
cd G-160_circle_largest_numerical_value_data-generator

# 2. Create and activate virtual environment
python3 -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

# 3. Install dependencies
pip install --upgrade pip
pip install -r requirements.txt
pip install -e .

Generate Data

# Generate 50 samples
python examples/generate.py --num-samples 50

# Custom output directory
python examples/generate.py --num-samples 100 --output data/my_dataset

# Reproducible generation with seed
python examples/generate.py --num-samples 50 --seed 42

# Without videos (faster)
python examples/generate.py --num-samples 50 --no-videos

Command-Line Options

Argument Description
--num-samples Number of tasks to generate (required)
--output Output directory (default: data/questions)
--seed Random seed for reproducibility
--no-videos Skip video generation (images only)

📖 Task Example

Prompt

The scene shows 5 numbers on a white canvas. First compare the numerical values of all numbers, then draw one red circle around the single largest number. Do not circle any other numbers. Show the complete circling process step by step.

Visual

Initial Frame
6 numbers on canvas
Animation
Largest number circled
Final Frame
Red circle around largest number

📖 Task Description

Objective

Compare numerical values of all displayed numbers and circle the single largest number.

Task Setup

  • Numbers: 6 numbers displayed on canvas
  • Number range: Various numerical values
  • Comparison criterion: Numerical magnitude
  • Task: Read and compare all numbers, identify maximum
  • Marking: Draw one red circle around largest number
  • Constraint: Circle only the largest number
  • Background: White with clear visibility
  • Goal: Circle the single number with highest numerical value

Key Features

  • Numerical value recognition (OCR)
  • Quantitative comparison across multiple numbers
  • Maximum identification among numerical values
  • Tests understanding of numerical ordering
  • Requires reading and comparing magnitudes
  • Single target selection based on maximum value

📦 Data Format

data/questions/circle_largest_numerical_value_task/circle_largest_numerical_value_00000000/
├── first_frame.png      # 6 numbers without marking
├── final_frame.png      # Largest number circled
├── prompt.txt           # Maximum value identification instruction
├── ground_truth.mp4     # Animation of circling process
└── question_metadata.json # Task metadata

File specifications:

  • Images: 1024×1024 PNG format
  • Video: MP4 format, 16 fps
  • Duration: ~5 seconds

🏷️ Tags

visual-reasoning numerical-comparison maximum-identification ocr quantitative-reasoning number-recognition


About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages