Warehouse Shelf Classifier API Service

This is a project I created prior to the start of an internship I completed for Nokia Bell Labs' AIMS (Autonomous Inventory Monitoring Service) team. Using drones with cameras attached, and later computer vision segmentation and classification models during postprocessing, the team created a monitoring service for warehouses, allowing their clients to keep better track of their inventory and recover any of it that was lost.

Project Overview

This project implements an end-to-end machine learning pipeline for automated warehouse inventory monitoring. The system trains ResNet and EfficientNet-B0 convolutional neural networks to classify warehouse shelf images as either "empty" or "filled", then deploys the trained model as a REST API service using FastAPI and Docker. The workflow includes data preprocessing with augmentation techniques, model training with class-weighted loss to handle imbalanced datasets, and containerized deployment for real-time inference on new warehouse images.

This project served as a preparation for my internship. More context about the results and directions of how to train the vision models and deploy them for classification on new images of warehouse shelves are given below. Enjoy!

Author

Name: Alexander Romanus

Email: aromanus@gmail.com

Codebase Structure

Model and Training Scripts: Located in the src/ directory.
API Service: Implemented in src/app/main.py.
Model Weights: Should be stored in src/checkpoints/warehouse_classifier.pth after being trained in associated Colab notebook.
Dockerfile: In the project root directory.

Model Architecture

The classification model is based on EfficientNet-B0, a pre-trained convolutional neural network from the torchvision library.

Motivation for Choosing EfficientNet-B0

EfficientNet-B0 offers a good balance between accuracy and computational efficiency, making it suitable for deployment in resource-constrained environments.

Dataset and Data Augmentation

Dataset Overview

Total Images: 1,006 images.
Classes:
- Empty Shelves: 325 images.
- Filled Shelves: 681 images.
Class Imbalance: During EDA, I found that the dataset is imbalanced, with more images of filled shelves.

Impressions

Because of the strong imbalance, I trained the EfficientNet-B0 with a loss that was weighted by the proportions of the class labels
The images are clear and well-centered, which was beneficial for model training.

Data Modifications and Augmentation

To address improve model generalization and robustness, the following augmentations were applied to the training data:

Normalization
Random Horizontal Flips
Color Jittering:
- Brightness Adjustment
- Contrast Adjustment
- Saturation Adjustment

Effect of Modifications

Improved Generalization: The augmentations help the model perform better on unseen data by simulating variations.
Reduced Overfitting: The data augmentation increases the effective size of the dataset, which reducing overfitting.
Handled Class Imbalance: The class-weighted loss during training helps give the model more importance to the minority class (empty shelves).

Model Metrics

The final model trained is the EfficientNet-B0 with class-imbalance weighted loss.

Performance Metrics

Training Accuracy: ~99%
Validation Accuracy: 100%

To view training results, go here

Requirements

Docker installed on your system
Access to Google Colab for model training

Setup Instructions

1. Generate Model Weights

IMPORTANT: Before running the application, you must first generate the trained model weights:

Open the Google Colab notebook
Run all cells in the notebook to train the EfficientNet model
The notebook will generate a warehouse_classifier.pth file
Download this file and place it in the src/checkpoints/ directory of this project

2. Building the Docker Image

To build the image:

Ensure you are in the root directory of this project
Ensure the warehouse_classifier.pth file is in src/checkpoints/
Run the following command to build the docker image:

docker build -t warehouse-classifier .

3. Running the Docker Container

To run the docker container, enter the following command:

docker run -d --name warehouse-classifier -p 8080:8080 warehouse-classifier

If you're running on a device with an NVIDIA GPU and you'd like to run the image with GPU support, use the following command:

docker run -d --name warehouse-classifier --gpus all -p 8080:8080 warehouse-classifier

API Usage

Once the container is running, you can classify warehouse images using curl:

Classify an empty warehouse image:

curl -X POST -F "file=@path/to/your/image.jpg" http://localhost:8080/classify

Example response:

{
  "confidence": 0.5755093097686768,
  "prediction": "empty",
  "probabilities": {
    "empty": 0.5755093097686768,
    "full": 0.42449072003364563
  }
}

Available endpoints:

GET /author - Returns author information
POST /classify - Classifies uploaded warehouse images
GET /health - Health check endpoint

Troubleshooting

If you encounter issues:

Ensure the warehouse_classifier.pth file exists in src/checkpoints/
Check Docker container logs: docker logs warehouse-classifier
Verify the container is running: docker ps

Name		Name	Last commit message	Last commit date
Latest commit History 66 Commits
src		src
training_data		training_data
.gitignore		.gitignore
Dockerfile		Dockerfile
EDA.ipynb		EDA.ipynb
README.md		README.md
constants.py		constants.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Warehouse Shelf Classifier API Service

Project Overview

Author

Codebase Structure

Model Architecture

Motivation for Choosing EfficientNet-B0

Dataset and Data Augmentation

Dataset Overview

Impressions

Data Modifications and Augmentation

Effect of Modifications

Model Metrics

Performance Metrics

To view training results, go here

Requirements

Setup Instructions

1. Generate Model Weights

2. Building the Docker Image

3. Running the Docker Container

API Usage

Classify an empty warehouse image:

Example response:

Available endpoints:

Troubleshooting

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Warehouse Shelf Classifier API Service

Project Overview

Author

Codebase Structure

Model Architecture

Motivation for Choosing EfficientNet-B0

Dataset and Data Augmentation

Dataset Overview

Impressions

Data Modifications and Augmentation

Effect of Modifications

Model Metrics

Performance Metrics

To view training results, go here

Requirements

Setup Instructions

1. Generate Model Weights

2. Building the Docker Image

3. Running the Docker Container

API Usage

Classify an empty warehouse image:

Example response:

Available endpoints:

Troubleshooting

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages