Skip to content

VBVR-DataFactory/O-5_symbol_deletion_data-generator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

O-5: Symbol Deletion Data Generator

Generates synthetic tasks where a sequence of colored geometric symbols is shown with one marked by a red border. The goal is to delete the marked symbol while maintaining the sequential order of all remaining symbols.

Each sample pairs a task (first frame + prompt describing what needs to happen) with its ground truth solution (final frame showing the result + video demonstrating how to achieve it). This structure enables both model evaluation and training.


📌 Basic Information

Property Value
Task ID O-5
Task Symbol Deletion
Category Transformation
Resolution 1024×1024 px
FPS 16 fps
Duration ~3.1 seconds
Output PNG images + MP4 video

🚀 Usage

Installation

# Clone the repository
git clone https://github.com/VBVR-DataFactory/O-5_symbol_deletion_data-generator.git
cd O-5_symbol_deletion_data-generator

# Install dependencies
pip install -r requirements.txt

Generate Data

# Generate 100 samples
python examples/generate.py --num-samples 100

# Generate with specific seed
python examples/generate.py --num-samples 100 --seed 42

# Generate without videos
python examples/generate.py --num-samples 100 --no-videos

# Custom output directory
python examples/generate.py --num-samples 100 --output data/my_output

Command-Line Options

Argument Type Description Default
--num-samples int Number of samples to generate 100
--seed int Random seed for reproducibility Random
--output str Output directory data/questions
--no-videos flag Skip video generation False

📖 Task Example

Prompt

The scene shows a horizontal sequence of colored geometric symbols arranged from left to right. Each symbol is a distinct geometric shape with a specific color, and exactly one symbol is marked as the deletion target by a red rectangular border surrounding it. In symbol deletion tasks, the target symbol identified by the red border must be deleted from the sequence while all remaining symbols maintain their original sequential order. First identify the symbol marked with the red border, then delete it from the sequence. The final state must show the remaining symbols in their original order, with the target symbol completely removed. The deletion operation affects only the single marked symbol, leaving all other symbols unchanged in their shapes, colors, and sequential positions.

Visual

Initial Frame
Symbol sequence with red-bordered target
Animation
Target symbol disappears from sequence
Final Frame
Target removed, order preserved

📖 Task Description

Objective

Delete the symbol marked with a red border from the sequence while maintaining the original sequential order of all remaining symbols.

Task Setup

  • Symbol Sequence: Horizontal arrangement of colored geometric symbols
  • Target Marking: Exactly one symbol marked with red rectangular border
  • Deletion: Remove marked symbol completely from sequence
  • Order Preservation: All remaining symbols maintain their original sequential order
  • Position Preservation: Remaining symbols stay at their original positions (no movement)
  • Single Target: Only one symbol is removed per task

Key Features

  • Target identification: Finding the symbol marked with red border
  • Selective deletion: Removing only the marked symbol
  • Sequence preservation: Maintaining order of remaining elements
  • Position preservation: All remaining symbols retain their original positions
  • Unchanged properties: All remaining symbols retain original shape, color, and position
  • Instruction following: Executing precise deletion as directed

📦 Data Format

data/questions/symbol_deletion_task/symbol_deletion_00000000/
├── first_frame.png      # Initial state (sequence with marked target)
├── final_frame.png      # Goal state (target deleted)
├── prompt.txt           # Task instructions
├── ground_truth.mp4     # Solution video (16 fps)
└── question_metadata.json # Task metadata (objects, target_index, etc.)

File specifications: Images are 1024×1024 PNG. Videos are MP4 at 16 fps, approximately 3.1 seconds long.

Metadata Format

The question_metadata.json file contains structured information about each task:

{
  "version": "1.0",
  "task_id": "symbol_worlds_symbol_editing_00000000",
  "generator": "symbol_worlds_symbol_editing",
  "timestamp": "2026-02-15T01:02:32.575886",
  "parameters": {
    "target_index": 2,
    "objects": [
      {
        "symbol": "diamond",
        "center": [377, 512],
        "color": [230, 200, 50]
      },
      {
        "symbol": "square",
        "center": [512, 512],
        "color": [180, 50, 180]
      },
      ...
    ]
  },
  "param_hash": "...",
  "generation": {
    "seed": 42,
    "generator_version": "1.0"
  }
}
  • target_index: Index of the object to be deleted (0-based)
  • objects: Array of all objects in the sequence, each containing:
    • symbol: Shape name (e.g., "circle", "square", "triangle", "star", "diamond", "hexagon")
    • center: Center position [x, y] in pixels
    • color: RGB color [R, G, B]

🏷️ Tags

sequential-reasoning symbol-manipulation deletion target-identification sequence-preservation instruction-following selective-removal


About

Symbol deletion task data generator

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages