Generates synthetic datasets for training and evaluating vision models on chart analysis and extreme value identification tasks. Each sample contains a chart with visible data where the extreme point (maximum or minimum) must be identified and highlighted.
Each sample pairs a task (first frame + prompt describing what needs to happen) with its ground truth solution (final frame showing the result + video demonstrating how to achieve it). This structure enables both model evaluation and training.
| Property | Value |
|---|---|
| Task ID | G-29 |
| Task | Chart Extreme With Data |
| Category | Abstraction |
| Resolution | 1024×1024 px |
| FPS | 16 fps |
| Duration | ~3 seconds |
| Output | PNG images + MP4 video |
# 1. Clone the repository
git clone https://github.com/VBVR-DataFactory/G-29_chart_extreme_with_data_data-generator.git
cd G-29_chart_extreme_with_data_data-generator
# 2. Create and activate virtual environment
python3 -m venv venv
source venv/bin/activate # On Windows: venv\Scripts\activate
# 3. Install dependencies
pip install --upgrade pip
pip install -r requirements.txt
pip install -e .# Generate 50 samples
python examples/generate.py --num-samples 50
# Custom output directory
python examples/generate.py --num-samples 100 --output data/my_dataset
# Reproducible generation with seed
python examples/generate.py --num-samples 50 --seed 42
# Without videos (faster)
python examples/generate.py --num-samples 50 --no-videos| Argument | Description |
|---|---|
--num-samples |
Number of tasks to generate (required) |
--output |
Output directory (default: data/questions) |
--seed |
Random seed for reproducibility |
--no-videos |
Skip video generation (images only) |
The scene shows a bar chart titled 'Monthly Product Sales Statistics 2024' with Month on the x-axis and Sales on the y-axis. Find the month with the lowest sales and draw a red rectangular border around the corresponding bar to highlight it.
![]() |
![]() |
![]() |
| Initial Frame Line chart with data points visible |
Animation Red border appears around extreme point |
Final Frame Highest temperature point highlighted |
Analyze charts with visible data to find extreme values (maximum or minimum) and highlight the corresponding point or bar.
- Chart types: Line charts, bar charts (horizontal/vertical)
- Data visibility: All data points and values are clearly visible
- Axes: Labeled x-axis and y-axis with units
- Title: Descriptive chart title
- Extreme type: Maximum (highest) or minimum (lowest) value
- Highlight method: Red rectangular border around target point/bar
- Background: White with clear chart elements
- Goal: Identify correct extreme point and highlight it
- Visual data analysis with explicit values shown
- Extreme value identification (max/min)
- Multiple chart type support (line, bar)
- Chart reading comprehension (understanding axes, titles, data)
- Red border annotation for visual highlighting
- Tests quantitative reasoning and chart literacy
data/questions/chart_extreme_with_data_task/chart_extreme_with_data_00000000/
├── first_frame.png # Chart without highlighting
├── final_frame.png # Chart with extreme point highlighted
├── prompt.txt # Instruction to find and highlight extreme value
├── ground_truth.mp4 # Animation of highlight appearing
└── question_metadata.json # Task metadata
File specifications:
- Images: 1024×1024 PNG format
- Video: MP4 format, 16 fps
- Duration: ~3 seconds
logic-symbols chart-analysis data-visualization extreme-values quantitative-reasoning visual-highlighting


