Enhancing Monocular Metric Depth Estimation through Adaptive Scaling

This repository implements a lightweight adaptive scaling framework to correct scale ambiguity in monocular depth estimation. The method learns to predict an image-specific scaling factor that, when applied to relative depth predictions (e.g., from Depth Anything V2), significantly improves metric depth accuracy — all without extra sensors or modifications to the base depth model.

🔧 Setup Instructions

1. Clone the repository (with submodules)

git clone --recursive https://github.com/tae-h-yang/adaptive-depth-estimation.git
cd adaptive-depth-estimation

2. Set up the environment

Be sure to source the setup script before running anything:

source setup.sh

This installs the required dependencies and sets up environment variables.

3. Prepare the dataset

Download the NYU Depth V2 dataset using these instructions, and place the extracted files into the datasets/ directory:

4. Add model checkpoints

Place the pretrained Depth Anything V2 checkpoint in the checkpoints/ directory:

🧠 Method Overview

This project introduces a two-stage solution to improve monocular metric depth prediction:

Offline Optimization: Compute the optimal per-image scale by minimizing the Wasserstein distance between predicted and ground-truth depth distributions.
Online Correction: Train a lightweight CNN to predict a log-scale correction factor directly from an RGB image.

The predicted scaling factor is applied to the base model's output at inference time, yielding improved metric accuracy in diverse scenes.

📂 Directory Structure

adaptive_depth/        # Core implementation of the adaptive scaling model
Depth-Anything-V2/     # Submodule for Depth Anything V2
datasets/              # NYU Depth V2 dataset (user-supplied)
checkpoints/           # Pretrained model weights
figures/               # Visualizations and plots
setup.sh               # Setup script for environment

📜 License

This repository is open-sourced under the MIT License. See LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.vscode		.vscode
Depth-Anything-V2 @ a5ad82a		Depth-Anything-V2 @ a5ad82a
adaptive_scaling		adaptive_scaling
docs		docs
figures		figures
ml-hypersim @ 747366a		ml-hypersim @ 747366a
old_scripts		old_scripts
util		util
.DS_Store		.DS_Store
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
metric_depth_estimator.py		metric_depth_estimator.py
relative_depth_estimator.py		relative_depth_estimator.py
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Enhancing Monocular Metric Depth Estimation through Adaptive Scaling

🔧 Setup Instructions

1. Clone the repository (with submodules)

2. Set up the environment

3. Prepare the dataset

4. Add model checkpoints

🧠 Method Overview

📂 Directory Structure

📜 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Enhancing Monocular Metric Depth Estimation through Adaptive Scaling

🔧 Setup Instructions

1. Clone the repository (with submodules)

2. Set up the environment

3. Prepare the dataset

4. Add model checkpoints

🧠 Method Overview

📂 Directory Structure

📜 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages