Stairway to Success: An Online Floor-Aware Zero-Shot Object-Goal Navigation Framework via LLM-Driven Coarse-to-Fine Exploration

Zeying Gong¹, Rong Li¹, Tianshuai Hu², Ronghe Qiu¹, Lingdong Kong³,
Lingfeng Zhang⁴, Guoyang Zhao¹, Yiyi Ding¹, Junwei Liang^1,2,✉

¹ The Hong Kong University of Science and Technology (Guangzhou).
² The Hong Kong University of Science and Technology
³ National University of Singapore
⁴ Tsinghua University

📋 TODO List

✅ Complete Installation and Usage documentation
✅ Add datasets download documentation
✅ Release the main algorithm of ASCENT
❌ Release the code of real-world deployment

🛠️ Environment Setup

1. Preparing Conda Environment

Assuming you have conda installed, let's prepare a conda env:

conda_env_name=ascent_nav
conda create -n $conda_env_name python=3.9 cmake=3.14.0
conda activate $conda_env_name

Install proper version of torch:

pip install torch==2.1.0 torchvision==0.16.0 torchaudio==2.1.0 --index-url https://download.pytorch.org/whl/cu118

2. Install Habitat-Sim

conda install habitat-sim=0.3.1 withbullet headless -c conda-forge -c aihabitat

If you encounter network problems, you can manually download the Conda package from this link to download the conda bag, and install it via: conda install --use-local /path/to/xxx.tar.bz2 to download.

In theory, versions >= 0.2.4 are all compatible, but it is better to keep the same version between habitat-lab and habitat-sim. Here we use 0.3.1 version.

3. Clone Repository

git clone --recurse-submodules https://github.com/Zeying-Gong/ascent.git

4. Install Habitat-Lab

cd third_party/habitat-lab
git checkout v0.3.1
pip install -e habitat-lab
pip install -e habitat-baselines
cd ../..

4. GroundingDINO

Following GroundingDINO's instruction:

export CUDA_HOME=/path/to/cuda-11.8 # replace with actual path

cd third_party/GroundingDINO
pip install -e . --no-build-isolation --no-dependencies
cd ../..

5. MobileSAM

Following MobileSAM's instruction:

cd third_party/MobileSAM
pip install -e .
cd ../..

6. Others

pip install -r requirements.txt

The following dependencies require special build flags:

pip install transformers==4.37.0

🏋️ Downloading Model Weights

Download the required model weights and save them to the pretrained_weights/ directory:

Model	Filename	Download Link
Places365	`resnet50_places365.pth.tar`	Download
MobileSAM	`mobile_sam.pt`	GitHub
GroundingDINO	`groundingdino_swint_ogc.pth`	GitHub
D-FINE	`dfine_x_obj2coco.pth`	GitHub
RedNet	`rednet_semmap_mp3d_40.pth`	Google Drive
RAM++	`ram_plus_swin_large_14m.pth`	HuggingFace

Qwen2.5-7B Weights

Through HuggingFace or ModelScope download the checkpoints, and put them in pretrained_weights/

PointNav Weights

The PointNav weight is directly from VLFM, located in third_party/vlfm/data/pointnav_weights.pth.

Locate Datasets: The file structure should look like this:

pretrained_weights
├── mobile_sam.pt
├── groundingdino_swint_ogc.pth
├── dfine_x_obj2coco.pth
├── ram_plus_swin_large_14m.pth
├── rednet_semmap_mp3d_40.pth
├── resnet50_places365.pth.tar
└── Qwen2.5-7b
    ├── model-00001-of-00005.safetensors
    └── ...

📚 Datasets Setup

Download Scene & Episode Datasets: Following the instructions for HM3D and MP3D in Habitat-lab's Datasets.md.
Locate Datasets: The file structure should look like this:

data
└── datasets
    ├── objectnav
    │   ├── hm3d
    │   │   └── v1
    │   │        └── val
    │   │             ├── content
    │   │             └── val.json.gz
    │   └── mp3d
    │       └── v1
    │            └── val
    │                 ├── content
    │                 └── val.json.gz
    └── scene_datasets
        ├── hm3d
        │   └── ...
        └── mp3d
            └── ...

🚀 Evaluation

Run VLM servers

./scripts/launch_vlm_servers_ascent.sh

It will open a tmux windows in a separate terminal.

Open another terminal, run evaluation on HM3D dataset:

python -u -m ascent.run --config-name=eval_ascent_hm3d.yaml

Or run evaluation on MP3D dataset:

python -u -m ascent.run --config-name=eval_ascent_mp3d.yaml

⚠️ Notes

This is a refactored version of the original codebase with improved code organization and structure.
Due to the inherent randomness in object detection (GroundingDINO, D-FINE) and LLM inference (Qwen2.5), evaluation results may vary slightly from the paper's reported metrics.

✒️ Citation

If you use ASCENT in your research, please use the following BibTeX entry.

@article{gong2025stairway,
  title={Stairway to Success: Zero-Shot Floor-Aware Object-Goal Navigation via LLM-Driven Coarse-to-Fine Exploration},
  author={Gong, Zeying and Li, Rong and Hu, Tianshuai and Qiu, Ronghe and Kong, Lingdong and Zhang, Lingfeng and Ding, Yiyi and Zhang, Leying and Liang, Junwei},
  journal={arXiv preprint arXiv:2505.23019},
  year={2025}
}

🙏 Acknowledgments

We would like to thank the following repositories for their contributions:

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
RedNet		RedNet
ascent		ascent
docs		docs
experiments		experiments
model_api		model_api
scripts		scripts
statistic_priors		statistic_priors
third_party		third_party
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
constants.py		constants.py
dummy_policy.pth		dummy_policy.pth
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Stairway to Success: An Online Floor-Aware Zero-Shot Object-Goal Navigation Framework via LLM-Driven Coarse-to-Fine Exploration

📋 TODO List

🛠️ Environment Setup

1. Preparing Conda Environment

2. Install Habitat-Sim

3. Clone Repository

4. Install Habitat-Lab

4. GroundingDINO

5. MobileSAM

6. Others

🏋️ Downloading Model Weights

Qwen2.5-7B Weights

PointNav Weights

📚 Datasets Setup

🚀 Evaluation

⚠️ Notes

✒️ Citation

🙏 Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors 1

Languages

Folders and files

Latest commit

History

Repository files navigation

Stairway to Success: An Online Floor-Aware Zero-Shot Object-Goal Navigation Framework via LLM-Driven Coarse-to-Fine Exploration

📋 TODO List

🛠️ Environment Setup

1. Preparing Conda Environment

2. Install Habitat-Sim

3. Clone Repository

4. Install Habitat-Lab

4. GroundingDINO

5. MobileSAM

6. Others

🏋️ Downloading Model Weights

Qwen2.5-7B Weights

PointNav Weights

📚 Datasets Setup

🚀 Evaluation

⚠️ Notes

✒️ Citation

🙏 Acknowledgments

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors 1

Languages

Packages