End-to-end BLIP + LLaVA project for image captioning and VQA with COCO/VQAv2, standard metrics (CIDEr/BLEU/SPICE), and a Gradio demo
-
Updated
Sep 5, 2025 - Python
End-to-end BLIP + LLaVA project for image captioning and VQA with COCO/VQAv2, standard metrics (CIDEr/BLEU/SPICE), and a Gradio demo
🖼️ Enhance image understanding with this project for image captioning and visual question answering using BLIP and LLaVA, complete with reproducible setup and demos.
Add a description, image, and links to the pycocoevalcap topic page so that developers can more easily learn about it.
To associate your repository with the pycocoevalcap topic, visit your repo's landing page and select "manage topics."