Learning Bird's Eye View Scene Graph and Knowledge-Inspired Policy for Embodied Visual Navigation

Paper | Project Page | Video

Learning Bird's Eye View Scene Graph and Knowledge-Inspired Policy for Embodied Visual Navigation

We propose BevNav framework to solve these issues by three parts: (i) we introduce a novel Bird's Eye View (BEV) scene graph (BevSG) that utilizes multi-view 2D information transformed into 3D under the supervision of 3D detection to encode scene layouts and geometric clues. It can distinguish multi-view semantically similar objects and make plans in this graph. (ii) we propose BEV-BLIP contrastive learning that aligns the BEV and language grounding inputs transferring constrain commonsense knowledge in pre-trained models without other training in the environments. (iii) we design BEV-based view search navigation policy, which encourages representations that encode the semantics, relationships, and positional information of objects.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
BevSG_Nav.py		BevSG_Nav.py
BevSG_Nav.yml		BevSG_Nav.yml
LICENSE		LICENSE
README.md		README.md
agent.py		agent.py
arial.ttf		arial.ttf
co_occur.npy		co_occur.npy
deberta_predict.npy		deberta_predict.npy
deberta_predict_room.npy		deberta_predict_room.npy
matterport_category_mappings.tsv		matterport_category_mappings.tsv
obj_room.npy		obj_room.npy
rand_obj.npy		rand_obj.npy
rand_room.npy		rand_room.npy
scenegraph.py		scenegraph.py
start.py		start.py
start_multiprocess.py		start_multiprocess.py
utils_glip.py		utils_glip.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Learning Bird's Eye View Scene Graph and Knowledge-Inspired Policy for Embodied Visual Navigation

Paper | Project Page | Video

Getting Started

About

Uh oh!

Releases

Packages

Languages

License

zhoukang123/BevSG

Folders and files

Latest commit

History

Repository files navigation

Learning Bird's Eye View Scene Graph and Knowledge-Inspired Policy for Embodied Visual Navigation

Paper | Project Page | Video

Getting Started

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages