3D Scene Reconstruction for Autonomous Robot Navigation

This project aims to develop a comprehensive framework for 3D scene reconstruction, enabling effective autonomous robot navigation in dynamic environments. It integrates advanced object detection, depth mapping, and path planning techniques to enhance robot perception and decision-making capabilities.

Abstract

This project constructs a detailed 3D representation of environments using video data, enabling robots to classify and localize obstacles accurately. The system integrates depth mapping, object detection, and optimized path planning for safe and efficient navigation in complex settings.

Dataset

We used the ScanNet sensor dataset, specifically scene0000, containing:

5,578 frames of RGB images
Depth maps
Camera pose information

Objectives

The project is structured around the following objectives:

Depth Estimation: Using MiDAS for accurate depth mapping from RGB images.
3D Scene Reconstruction: Integrating RGB-D data and camera poses.
Object Detection: Leveraging YOLOv8 Nano for real-time object detection.
Instance Segmentation: Using Mobile SAM for segmenting and tracking individual objects.
3D Object Mapping: Projecting objects into the 3D scene for spatial context.
Bird’s-Eye View Generation: Simplifying 3D data into a 2D representation.
Optimal Path Planning: Computing obstacle-free paths in the 3D environment.

Methodology

1. Depth Estimation

Model Used: MiDAS
Outcome: Predicted depth maps compared against true depth values, demonstrating the accuracy of the approach.

2. 3D Scene Reconstruction

Used true depth images for higher accuracy.
Integrated RGB-D data and camera poses into a point cloud and mesh representation.

3. Object Detection

Model Used: YOLOv8 Nano
Classes Detected: Common objects like chairs, tables, sofas, etc.
Techniques: Confidence thresholding and Non-Maximum Suppression.

4. Instance Segmentation

Model Used: Mobile SAM
Generated binary masks aligned with object shapes for use in 3D mapping.

5. 3D Object Mapping

Projected segmented objects into the 3D scene with unique visual indicators.

6. Bird’s-Eye View Generation

Created 2D occupancy grids from 3D point clouds for simplified spatial visualization.

7. Optimal Path Planning

Pathfinding algorithm to compute the optimal path between points of interest.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
3D_reconstruction		3D_reconstruction
annotate_3d_map		annotate_3d_map
data		data
depth_estimation		depth_estimation
optimal_path_finding		optimal_path_finding
pointcloud_to_birdseye_view		pointcloud_to_birdseye_view
results		results
.gitignore		.gitignore
Project Presentation.pdf		Project Presentation.pdf
README.md		README.md
methods.txt		methods.txt
reqirements.txt		reqirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

3D Scene Reconstruction for Autonomous Robot Navigation

Table of Contents

Abstract

Dataset

Objectives

Methodology

1. Depth Estimation

2. 3D Scene Reconstruction

3. Object Detection

4. Instance Segmentation

5. 3D Object Mapping

6. Bird’s-Eye View Generation

7. Optimal Path Planning

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

3D Scene Reconstruction for Autonomous Robot Navigation

Table of Contents

Abstract

Dataset

Objectives

Methodology

1. Depth Estimation

2. 3D Scene Reconstruction

3. Object Detection

4. Instance Segmentation

5. 3D Object Mapping

6. Bird’s-Eye View Generation

7. Optimal Path Planning

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages