Verizon Real-Time Accessories Classification

This is a collaboration project with Verizon made possible by Break Through Tech AI.

The main goal is to build a Real Time Vision Algorithm that is aimed to enhance enhancing the retail shopping experience by identifying accessories in a retail store, improving customer experience and reducing wait times and staff dependency for compatibility checks. By leveraging state-of-the-art machine learning techniques, this solution ensures accurate recommendations and efficient image processing, streamlining the shopping process for both customers and retailers. Our project is currenly only able to identify phone cases using 3 robust models:

An object detection model for phone case design recognition
A segmentation model for camera cutout recognition
A segmentation model for phone detection

Features

Real-Time Object Detection: Utilizes YOLOv8l for precise detection of phone case designs in real-time.
Efficient Image Preprocessing: Incorporates convex hulls and binary masks to reduce noise and improve feature extraction for accurate model input calibration.
High-Fidelity Object Segmentation: Utilizes two YOLOv11-seg models — one for phone camera recognition and another for phone segmentation - achieving high segmentation precision.
Optimized Data Pipeline: Built a robust pipeline using Ultralytics YOLO and PyTorch to process and analyze images efficiently.

Technologies Used

Python 3.9 and later
OpenCV for real-time image processing and data preparation
PyTorch as a main framework the models are based on
Ultralytics YOLOv8l and YOLOv11-seg for real-time object detection, segmentation and prediction
Google Colab as a main collaborative environment for this project
Makesense.ai for data annotation

Dataset Contents and Sources

Phone Case Detection

The dataset cosists of 266 hand-picked images containing phones at vaious angles with 25 unique case designs, each of the images annotated by hand using makesense.ai.

Sources

All of the case design examples have been gathered from the internet, mainly the Verizon Store Website, and Amazon as well various case manufacturers' websites (for examples with different models of the same design that are absent from the aforementioned sources).

Format Example

Each image and label follows this convention:

DesignName_PhoneModel_ImageNumber.png/.jpg/.txt

DesignName stands for the name of the case design in the image. It is written without any spaces or underscores.
PhoneModel stands for model of the phone in the image. It is written without any spaces or underscores.
ImageNumber stands for the sequential number for multiple instances of the same design and phone model combination.

Each image has a .txt file that contains a YOLO format label. For more information, please see YOLO Label Documentation. Here's an example of what it can look like:

12 0.496733 0.498562 0.982945 0.993289

The numbers respresent the following, in that order:

Class ID
The x-coordinate of the bounding box center
The y-coordinate of the boundting box center
Width of the bounding box
Height of the bounding box

File Structure Example

Here's a visualization of the file structure within root, assuming files are sorted in alphabetical order:

/root
├── dataset
│   ├── AfternoonInVersailles_GalaxyS23Ultra_1.png // Case design image
│   ├── AfternoonInVersailles_GalaxyS23Ultra_1.txt // YOLO-format label
│   ├── BowPosiePink_iPhone15ProMax_4.png
│   └── BowPosiePink_iPhone15ProMax_4.txt

...

Phone Model Detection

We utilized two models: Camera Cutout Segmentation Model for camera recognition and Phone Segmentation Model for phone detection.
YOLOv11-seg was used for achieving high segmentation precision.
The models were used jointly for accurate phone type prediction.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
AI Studio Verizon _ Final Project Presentation.pdf		AI Studio Verizon _ Final Project Presentation.pdf
DataPreparation.ipynb		DataPreparation.ipynb
README.md		README.md
Tuning.ipynb		Tuning.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Verizon Real-Time Accessories Classification

Table of Contents

Features

Technologies Used

Dataset Contents and Sources

Phone Case Detection

Sources

Format Example

File Structure Example

Phone Model Detection

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Verizon Real-Time Accessories Classification

Table of Contents

Features

Technologies Used

Dataset Contents and Sources

Phone Case Detection

Sources

Format Example

File Structure Example

Phone Model Detection

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages