Description: Repository for generating and evaluating code for various types of tasks
- text-input
- generate(single-round/multi-round)
- edit
- repair
- image-input
- generate(single image/multiple images)
- edit(single image/multiple images)
- repair(single image/multiple images)
- video-input
- generate
- text-input
- generate(single-round/multi-round)
- edit
- repair
- image-input
- generate(single image/multiple images)
- edit(single image/multiple images)
- repair(single image/multiple images)
- video-input
- generate
- LLM-as-a-judge(visual: 美观/instruction following)
- agent-as-a-judge(web-agent/llm看视频)