Skip to content

dev-pa5an/perceptiveAI

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Object depth estimation of color Images using a depth camera and an image semantic segmentation model in Computer Vision with minimum latency.

The key twist is using a specific type of neural network called a "semantic segmentation network" to improve the accuracy of the distance estimation. This network learns to categorize different parts of the image (like walls, furniture, people) and uses that information to estimate the average depth to those objects. This method is useful for a robot to enhance its perception of the environment while it is moving, getting depth estimations for particular objects in the environment. When the environment is dynamic and complex, indoor navigation tasks can also utilize this method.

Features

  • YOLOV8 Image Semantic Segmentation Model

    This model is trained using a custom dataset of over 400 images. Ultralytics yolov8's semantic segmentation model parameters are used to train the model. The model itself reached up to 100% confidence level for the validation dataset but only 93% in practice.

    ps: You can use your own semantic segmentation model in this application.

  • Microsoft KinectV2 with python

    Microsoft KinectV2 has both RGB and depth camera. The modified version of pykinect2 library is used to recieve both RGB and depth frames simultaneously using a python program. Please use the modified version of pykinect2 otherwise the program will not run as exspected. Modified version of pykinect2 can be found here. (https://github.com/dev-pa5an/pykinect.git)

  • Depth Estimation Algorithm

    When mapping the pixel coordinates of the image frame to the depth frame, the frame size difference results in the inclusion of depth values from areas outside the identified object in the depth array. This error can be clearly identified in the following figure.

Run Locally

  git clone https://github.com/dev-pa5an/perceptiveAI
  python perceptiveEye.py

About

Emphasizes the use of depth data in AI projects

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published