Image Captioning System

This repository contains both the research and project implementation of an Image Captioning System that received an Honourable Mention at the 30th Congress of Scientific Initiation (UnB) and 21st Congress of the Federal District, Brazil. The project combines natural language processing (NLP) and computer vision techniques to generate descriptive captions for images. It explores the latest methodologies in the field, such as Convolutional Neural Networks (CNNs) for image processing and Recurrent Neural Networks (RNNs), particularly LSTMs, for text generation.

Project Overview

The image captioning system is divided into two main parts:

Research: The research section includes a detailed literature review and experiments exploring various image captioning models, architectures, and techniques.
Project: The implementation part focuses on building a working prototype using state-of-the-art deep learning models.

Installation

To run the project locally, follow these steps:

Clone the repository:

git clone https://github.com/loioladev/cnpq-caption-ia.git
cd cnpq-caption-ia

Create a virtual environment and install dependencies:

python -m venv venv
source venv/bin/activate
./requirements.sh

Download and prepare datasets before training the models.

Usage

The project is divided into two main parts: YOLO Object Detection and LSTM Text Generation. Each part has its own set of scripts and notebooks for training and evaluation.

Datasets

YOLO Datasets

The YOLO datasets are used for training the object detection model. The datasets are available in the following links:

LSTM Datasets

The LSTM datasets are used for training the text generation model. The datasets are available in the following links:

Flickr8k Dataset

License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
project		project
research		research
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
honorable_mention.png		honorable_mention.png
project.pdf		project.pdf
requirements.sh		requirements.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Image Captioning System

Table of Contents

Project Overview

Installation

Usage

Datasets

YOLO Datasets

LSTM Datasets

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Languages

License

loioladev/cnpq-caption-ia

Folders and files

Latest commit

History

Repository files navigation

Image Captioning System

Table of Contents

Project Overview

Installation

Usage

Datasets

YOLO Datasets

LSTM Datasets

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Languages

Packages