Generates synthetic datasets for training and evaluating vision models on pattern recognition and sequence prediction tasks. Each sample contains a color sequence where the next color must be predicted based on the pattern.
Each sample pairs a task (first frame + prompt describing what needs to happen) with its ground truth solution (final frame showing the result + video demonstrating how to achieve it). This structure enables both model evaluation and training.
| Property | Value |
|---|---|
| Task ID | G-51 |
| Task | Predict Next Color |
| Category | Abstraction |
| Resolution | 1024×1024 px |
| FPS | 16 fps |
| Duration | ~4 seconds |
| Output | PNG images + MP4 video |
# 1. Clone the repository
git clone https://github.com/VBVR-DataFactory/G-51_predict_next_color_data-generator.git
cd G-51_predict_next_color_data-generator
# 2. Create and activate virtual environment
python3 -m venv venv
source venv/bin/activate # On Windows: venv\Scripts\activate
# 3. Install dependencies
pip install --upgrade pip
pip install -r requirements.txt
pip install -e .# Generate 50 samples
python examples/generate.py --num-samples 50
# Custom output directory
python examples/generate.py --num-samples 100 --output data/my_dataset
# Reproducible generation with seed
python examples/generate.py --num-samples 50 --seed 42
# Without videos (faster)
python examples/generate.py --num-samples 50 --no-videos| Argument | Description |
|---|---|
--num-samples |
Number of tasks to generate (required) |
--output |
Output directory (default: data/questions) |
--seed |
Random seed for reproducibility |
--no-videos |
Skip video generation (images only) |
Predict the next color in the sequence.
![]() |
![]() |
![]() |
| Initial Frame Sequence of colored elements |
Animation Next color revealed in sequence |
Final Frame Complete sequence with predicted color |
Analyze a color sequence to identify the pattern and predict the next color in the series. This generator involves color pattern recognition and prediction tasks, requiring understanding of color sequences and sequential logic.
- Sequence: Series of colored elements following a pattern
- Colors: 30 distinct colors (red, blue, green, yellow, purple, orange, pink, cyan, lime, indigo, teal, amber, rose, violet, emerald, sky, fuchsia, coral, mint, navy, maroon, olive, turquoise, salmon, lavender, crimson, gold, silver, bronze, plum)
- Pattern types: Repetition, alternation, gradual change, etc.
- Task: Identify pattern rules and predict next color
- Elements: Visual representations (squares, circles, etc.)
- Background: White with clear visibility
- Goal: Correctly predict the next color based on sequence logic
- Color sequence pattern recognition
- Inductive reasoning from examples
- Various pattern types (cyclic, alternating, progressive)
- Tests understanding of sequential logic
- Requires identifying underlying color rules
- Symbolic reasoning with visual elements
data/questions/predict_next_color_task/predict_next_color_00000000/
├── first_frame.png # Color sequence without next element
├── final_frame.png # Sequence with predicted color revealed
├── prompt.txt # Color prediction instruction
├── ground_truth.mp4 # Animation of next color appearing
└── question_metadata.json # Task metadata
File specifications:
- Images: 1024×1024 PNG format
- Video: MP4 format, 16 fps
- Duration: ~4 seconds
logic-symbols pattern-recognition sequence-prediction color-reasoning inductive-reasoning sequential-logic


