Skip to content

VBVR-DataFactory/G-247_identify_chinese_character_data-generator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

32 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

G-247: Identify Chinese Character Data Generator

Generates synthetic tasks where multiple characters (letters, numbers, symbols, and one Chinese character) are displayed. The task is to identify the single Chinese character among them and circle it with a red line.

Each sample pairs a task (first frame + prompt describing what needs to happen) with its ground truth solution (final frame showing the result + video demonstrating how to achieve it). This structure enables both model evaluation and training.


📌 Basic Information

Property Value
Task ID G-247
Task Identify Chinese Character
Category Knowledge
Resolution 1024×1024 px
FPS 16 fps
Duration ~3 seconds
Output PNG images + MP4 video

🚀 Usage

Installation

# Clone the repository
git clone https://github.com/Jiaqi-Gong/Gong_VBVR_Data.git
cd Gong_VBVR_Data/G-247_identify_chinese_character_data-generator

# Install dependencies
pip install -r requirements.txt

Generate Data

# Generate 100 samples
python examples/generate.py --num-samples 100

# Generate with specific seed
python examples/generate.py --num-samples 100 --seed 42

# Generate without videos
python examples/generate.py --num-samples 100 --no-videos

# Custom output directory
python examples/generate.py --num-samples 100 --output data/my_output

Command-Line Options

Argument Type Description Default
--num-samples int Number of samples to generate 100
--seed int Random seed for reproducibility Random
--output str Output directory data/questions
--no-videos flag Skip video generation False

📖 Task Example

Prompt

Find and circle the Chinese character among the displayed characters. Only one character is Chinese. Draw a red circle around it.

Visual

Initial Frame
Mixed characters displayed
Animation
Red circle drawn around Chinese character
Final Frame
Chinese character identified and circled

📖 Task Description

Objective

Identify the single Chinese character among a group of mixed characters (letters, numbers, symbols) and mark it with a red circle.

Task Setup

  • Character Count: 3-8 characters per image (at least one must be Chinese)
  • Character Types:
    • One Chinese character from a pool of 100 common characters
    • Multiple non-Chinese characters (letters, numbers, symbols)
  • Character Placement: Random positions with collision detection to prevent overlap
  • Font Size: Large, clearly visible characters
  • Circle Properties: 5-pixel thick red circle (RGB: 255, 0, 0)
  • Layout Seed Range: 0-499 for diverse placement patterns

Key Features

  • Non-overlapping characters: Collision detection ensures all characters are clearly separated
  • Single correct answer: Exactly one Chinese character per image
  • Diverse character pool: 100 common Chinese characters for variety
  • Smooth animation: Circle progressively draws around the target character
  • Scalable generation: Supports 10,000+ unique samples with <0.5% repetition rate
  • Deduplication: Task signatures ensure unique character combinations and layouts

📦 Data Format

data/questions/identify_chinese_character_task/identify_chinese_character_00000000/
├── first_frame.png      # Initial state (characters without circle)
├── final_frame.png      # Goal state (Chinese character circled)
├── prompt.txt           # Task instructions
├── ground_truth.mp4     # Solution video (16 fps)
└── question_metadata.json # Task metadata

File specifications: Images are 1024×1024 PNG. Videos are MP4 at 16 fps, approximately 3 seconds long.


🏷️ Tags

character-recognition language-identification visual-search circle-marking chinese-language


About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages