Skip to content

Latest commit

 

History

History
12 lines (8 loc) · 319 Bytes

File metadata and controls

12 lines (8 loc) · 319 Bytes

vqa

Light weight "Visual Question Answering" - providing "instruct" capabilities on an image.

analyzer.py demonstrates usage.

python analyzer.py https://upload.wikimedia.org/wikipedia/commons/thumb/3/37/Small_USPS_Truck.jpg/640px-Small_USPS_Truck.jpg

mail truck

Runs on 6GB of VRAM (tested on A2000 6GB).