ObjectDetect4Blind

A multi-threaded pipeline that runs three vision models in parallel — Depth Estimation (Depth-Anything-V2), Object Detection (YOLO), and Segmentation — on images or video. Outputs are saved to ./output.

Features

Orchestrates depth, detection, and segmentation together via MAIN.py
Supports image and video inputs
Simple, folder-based model checkpoint placement

Project Layout

RECOMMEND: put file respitory the same as me, or u need to fix code for file/model location


C:/Python/
├────ObjectDetect4Blind/
│    ├── .vscode
│    ├── MAIN.py                         # Launches multithreading 3 models
│    ├── assets/                         # Example input images
│    ├── output/                         # Outputs produced by MAIN.py
│    ├── Depth-Anything-V2-main/         # Depth estimation module
│    │   ├── app.py
│    │   ├── run.py
│    │   └──run_video.py                 # Temp not developed
│    ├── Object detection/               # Object detection module
│    │   └── main.py                     # Usage entrypoint for detection
│    └── Segmentation/                   # Segmentation module
│        └── test_model.py               # Usage entrypoint for segmentation
└────ObjectDetectRequireFile/
     ├── put-in-depth-anything
     ├── put-in-obj-detect                   
     ├── put-in-segment                   
     └── output/

Setup

Model file

https://drive.google.com/file/d/1DwhseV8bqV_qw7CIuWS7pnMRJ9au9lpE/view

Python versions (important)

This repo currently expects two Python interpreters when running MAIN.py:

YOLO / Object detection: Python 3.11
Depth estimation: Python 3.13

MAIN.py starts each model using the interpreter paths you configure.
Edit MAIN.py and replace the hard-coded interpreter/virtual-env paths with the ones on your machine (see comments in the file).

✅ Tip: If you manage multiple Python versions, set up two virtual environments (e.g., with Conda or pyenv) and point MAIN.py to their python executables.

Quickstart

A) Run the full pipeline (multithreading)

Open MAIN.py and update the Python paths/env activation commands for:
- Detection, Segmentation (Python 3.11)
- Depth (Python 3.13)
From the project root: python MAIN.py

B) Run Depth-Anything-V2 only

Local demo server
- python app.py
Single img(depth estimation only)
- python run.py --encoder vits --precision int8 --img-path "C:\Python\ObjectDetect4Blind\assets\demo01.jpg" --outdir depth_vis --pred-only
Single image (side-by-side input + depth)
- python run.py --encoder vits --precision int8 --img-path "C:\Python\ObjectDetect4Blind\assets\demo01.jpg" --outdir depth_vis
Video
- python run_video.py --encoder vitl \ --video-path assets/examples_video \ --outdir video_depth_vis

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
Computervision		Computervision
Data processing		Data processing
Depth-Anything-V2-main		Depth-Anything-V2-main
Object_detection		Object_detection
Segmentation		Segmentation
assets		assets
block_change_reports		block_change_reports
block_change_reports_metric		block_change_reports_metric
distance_way_evaluate_report		distance_way_evaluate_report
model_eva_object_detect		model_eva_object_detect
output		output
output_metric_depth		output_metric_depth
report-result-faker		report-result-faker
seg_eva_detect		seg_eva_detect
server_metric		server_metric
server_relative		server_relative
test_train		test_train
visualize_each_step_main		visualize_each_step_main
EVALUATE_DISTANCE.md		EVALUATE_DISTANCE.md
MAIN_distance.py		MAIN_distance.py
MAIN_relative.py		MAIN_relative.py
NOTIFICATION_USER_LEARN.md		NOTIFICATION_USER_LEARN.md
OUTSIDE_NOTE.md		OUTSIDE_NOTE.md
README.md		README.md
SERVER_LEARNING.md		SERVER_LEARNING.md
THIRD_PARTY_NOTICE.md		THIRD_PARTY_NOTICE.md
per-class-dis-range-metric.py		per-class-dis-range-metric.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ObjectDetect4Blind

Features

Project Layout

Setup

Model file

Python versions (important)

Quickstart

A) Run the full pipeline (multithreading)

B) Run Depth-Anything-V2 only

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

HaiDreamer/ObjectDetect4Blind

Folders and files

Latest commit

History

Repository files navigation

ObjectDetect4Blind

Features

Project Layout

Setup

Model file

Python versions (important)

Quickstart

A) Run the full pipeline (multithreading)

B) Run Depth-Anything-V2 only

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages