Skip to content

๐Ÿ‘จโ€๐ŸซThis project was built under the mentorship of Mr. Lokesh Sir as part of the AI & ML Capstone Project. It combines computer vision and deep learning for practical image understanding.

Notifications You must be signed in to change notification settings

deepakshroff/Image-captioning-app

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

5 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

๐Ÿ–ผ๏ธ AI Image Captioning App

The AI Image Captioning App is a Flask-based web application that uses a deep learning model to generate intelligent captions for uploaded images. It analyzes the image and provides the best possible description along with a confidence score.


๐Ÿ” Screenshots


๐Ÿš€ Features

  • ๐Ÿ“ท Upload any JPG/PNG image
  • ๐Ÿง  Automatically generates a descriptive caption
  • ๐Ÿ“Š Displays prediction confidence percentage
  • ๐ŸŽจ Smooth, responsive, and aesthetic UI
  • ๐Ÿ’ก Built using Flask, Python, and a custom ML model

๐Ÿงช Tech Stack

๐Ÿ’ป Frontend

  • HTML5
  • CSS3
  • Jinja2 (Flask template rendering)

๐Ÿง  Backend

  • Python
  • Flask
  • Pre-trained image captioning model
  • Custom module: caption_model.py

๐Ÿ’ก How to Run Locally

๐Ÿ›  Prerequisites

  • Python 3.x
  • Pip
  • Virtualenv (optional but recommended)

โš™๏ธ Steps

  1. Clone the repository
    git clone https://github.com/your-username/AI-Image-Captioning.git
    cd AI-Image-Captioning
    

๐Ÿ” How It Works

  • User uploads an image through the UI
  • The image is saved to the /static/uploads/ directory
  • The model (in caption_model.py) processes the image
  • The generated caption and confidence score are returned and displayed

๐Ÿ”ฎ Future Enhancements

  • ๐Ÿ–ผ๏ธ Support for drag-and-drop image upload
  • ๐Ÿง  Use more powerful models like BLIP or Vision Transformers
  • ๐ŸŒ Add language translation for captions
  • โ˜๏ธ Deploy on Render or HuggingFace Spaces

About

๐Ÿ‘จโ€๐ŸซThis project was built under the mentorship of Mr. Lokesh Sir as part of the AI & ML Capstone Project. It combines computer vision and deep learning for practical image understanding.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published