You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
A modular RAG-based framework for image retrieval and context-aware generation using visual and textual queries. Combines pretrained encoders, vector search, and generative models. Evaluated on Flickr30k for captioning and retrieval tasks.