Generates synthetic tasks where multiple characters (letters, numbers, symbols, and one Chinese character) are displayed. The task is to identify the single Chinese character among them and circle it with a red line.
Each sample pairs a task (first frame + prompt describing what needs to happen) with its ground truth solution (final frame showing the result + video demonstrating how to achieve it). This structure enables both model evaluation and training.
| Property | Value |
|---|---|
| Task ID | G-247 |
| Task | Identify Chinese Character |
| Category | Knowledge |
| Resolution | 1024×1024 px |
| FPS | 16 fps |
| Duration | ~3 seconds |
| Output | PNG images + MP4 video |
# Clone the repository
git clone https://github.com/Jiaqi-Gong/Gong_VBVR_Data.git
cd Gong_VBVR_Data/G-247_identify_chinese_character_data-generator
# Install dependencies
pip install -r requirements.txt# Generate 100 samples
python examples/generate.py --num-samples 100
# Generate with specific seed
python examples/generate.py --num-samples 100 --seed 42
# Generate without videos
python examples/generate.py --num-samples 100 --no-videos
# Custom output directory
python examples/generate.py --num-samples 100 --output data/my_output| Argument | Type | Description | Default |
|---|---|---|---|
--num-samples |
int | Number of samples to generate | 100 |
--seed |
int | Random seed for reproducibility | Random |
--output |
str | Output directory | data/questions |
--no-videos |
flag | Skip video generation | False |
Find and circle the Chinese character among the displayed characters. Only one character is Chinese. Draw a red circle around it.
![]() |
![]() |
![]() |
| Initial Frame Mixed characters displayed |
Animation Red circle drawn around Chinese character |
Final Frame Chinese character identified and circled |
Identify the single Chinese character among a group of mixed characters (letters, numbers, symbols) and mark it with a red circle.
- Character Count: 3-8 characters per image (at least one must be Chinese)
- Character Types:
- One Chinese character from a pool of 100 common characters
- Multiple non-Chinese characters (letters, numbers, symbols)
- Character Placement: Random positions with collision detection to prevent overlap
- Font Size: Large, clearly visible characters
- Circle Properties: 5-pixel thick red circle (RGB: 255, 0, 0)
- Layout Seed Range: 0-499 for diverse placement patterns
- Non-overlapping characters: Collision detection ensures all characters are clearly separated
- Single correct answer: Exactly one Chinese character per image
- Diverse character pool: 100 common Chinese characters for variety
- Smooth animation: Circle progressively draws around the target character
- Scalable generation: Supports 10,000+ unique samples with <0.5% repetition rate
- Deduplication: Task signatures ensure unique character combinations and layouts
data/questions/identify_chinese_character_task/identify_chinese_character_00000000/
├── first_frame.png # Initial state (characters without circle)
├── final_frame.png # Goal state (Chinese character circled)
├── prompt.txt # Task instructions
├── ground_truth.mp4 # Solution video (16 fps)
└── question_metadata.json # Task metadata
File specifications: Images are 1024×1024 PNG. Videos are MP4 at 16 fps, approximately 3 seconds long.
character-recognition language-identification visual-search circle-marking chinese-language


