FIFA

This is the official release accompanying our paper, FIFA: Unified Faithfulness Evaluation Framework for Text-to-Video and Video-to-Text Generation . FIFA will be available as a PIP package as well.

If you find FAITHSCORE useful, please cite:

@misc{jing2025fifaunifiedfaithfulnessevaluation,
      title={FIFA: Unified Faithfulness Evaluation Framework for Text-to-Video and Video-to-Text Generation}, 
      author={Liqiang Jing and Viet Lai and Seunghyun Yoon and Trung Bui and Xinya Du},
      year={2025},
      eprint={2507.06523},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2507.06523}, 
}

Process

Install

Clone the repo

git clone https://github.com/du-nlp-lab/FIFA.git
cd FIFA/fifa

Install requirements

pytorch
transformers
openai
tqdm
Any requirements for your LLM/Multimodal LLM

See examples for your evaluation. The example is placed in fifa/evaluate.py.

Step 1: define your llm

If you want to use OpenAI LLMs

from llm_api import OpenaiLLM
llm = OpenaiLLM(model_name="gpt-4o", api_key="your-api-key", NUM_SECONDS_TO_SLEEP=10)

If you want to use qwen

from llm_api import Qwen3LLM
llm = Qwen3LLM(model_name="Qwen/Qwen3-32B")

If you want to use other LLMs, please rewrite BaseLLM in llm_api.py

Step 2: Define your VideoQA model

If you want to use InternVL2.5

from video_qa_internvl import InternVLGenerator
vqamodel = InternVLGenerator(model_name="OpenGVLab/InternVL2_5-8B")

If you want to use qwen2.5-vl

from video_qa_qwen import Qwen25VLGenerator
vqamodel = Qwen25VLGenerator(model_name="Qwen/Qwen2.5-VL-32B-Instruct")

If you want to use other VideoLLMs, please rewrite BaseVQAGenerator in videoqa.py

Step 3: Finish your evaluation task

Example for Text2Video.

from eval_text2video import eval_text2video
## SHOW for Data Format
data = [{"prompt": "text", "video_path": "xx.mp4"}, {"prompt": "text", "video_path": "xx.mp4"}]
eval_text2video(data, save=False, n_parallel_workers=1, cache_dir="./results",  llm=llm, vqamodel=vqamodel)

Example for Video2Text

from eval_video2text import eval_video2text
## SHOW for Data Format
data = [{"prompt": "model response", "video_path": "xx.mp4", "question": "question"}, {"prompt": "model response", "video_path": "xx.mp4", "question": "question"}]
eval_video2text(data, save=False, n_parallel_workers=1, cache_dir="./results", llm=llm, vqamodel=model)

Running FIFA using Pip Package

TO DO

Running FIFA using a Command Line

You can also evaluate answers generated by the following command line.

TODO

Data

Annotation Data

The data is given in a json format file. For example,

You can download our annotation dataset.

Automatic Evaluation Benchmarks

You can download our automatic evaluation benchmarks.

Leaderboard

TODO

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
fifa		fifa
README.md		README.md
method.png		method.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

FIFA

Process

Install

Step 1: define your llm

Step 2: Define your VideoQA model

Step 3: Finish your evaluation task

Running FIFA using Pip Package

Running FIFA using a Command Line

Data

Annotation Data

Automatic Evaluation Benchmarks

Leaderboard

About

Uh oh!

Releases

Packages

Languages

du-nlp-lab/FIFA

Folders and files

Latest commit

History

Repository files navigation

FIFA

Process

Install

Step 1: define your llm

Step 2: Define your VideoQA model

Step 3: Finish your evaluation task

Running FIFA using Pip Package

Running FIFA using a Command Line

Data

Annotation Data

Automatic Evaluation Benchmarks

Leaderboard

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages