The AI Image Captioning App is a Flask-based web application that uses a deep learning model to generate intelligent captions for uploaded images. It analyzes the image and provides the best possible description along with a confidence score.
![]() |
![]() |
- ๐ท Upload any JPG/PNG image
- ๐ง Automatically generates a descriptive caption
- ๐ Displays prediction confidence percentage
- ๐จ Smooth, responsive, and aesthetic UI
- ๐ก Built using Flask, Python, and a custom ML model
- HTML5
- CSS3
- Jinja2 (Flask template rendering)
- Python
- Flask
- Pre-trained image captioning model
- Custom module:
caption_model.py
- Python 3.x
- Pip
- Virtualenv (optional but recommended)
- Clone the repository
git clone https://github.com/your-username/AI-Image-Captioning.git cd AI-Image-Captioning
- User uploads an image through the UI
- The image is saved to the /static/uploads/ directory
- The model (in caption_model.py) processes the image
- The generated caption and confidence score are returned and displayed
- ๐ผ๏ธ Support for drag-and-drop image upload
- ๐ง Use more powerful models like BLIP or Vision Transformers
- ๐ Add language translation for captions
- โ๏ธ Deploy on Render or HuggingFace Spaces

