Skip to content

Open-Vocabulary Camouflaged Object Segmentation with Cascaded Vision Language Models

Notifications You must be signed in to change notification settings

intcomp/camouflaged-vlm

Repository files navigation

Open-Vocabulary Camouflaged Object Segmentation with Cascaded Vision-Language Models

arXiv


📌 Overview

Framework Overview

This repository provides the official implementation of our paper:
"Open-Vocabulary Camouflaged Object Segmentation with Cascaded Vision-Language Models",
which introduces a cascaded two-stage framework for segmenting and recognizing camouflaged objects in open-vocabulary settings.


📁 Setup

🔹 Dataset

  1. Download the OVCamo dataset from the official repository.
  2. Update the dataset path in the following config file:

./datasets/ovcamo_info/splitted_ovcamo.yaml

🔹 Pretrained Model

Download the SAM backbone from Meta AI:

Place the file in the ./pretrained directory:

./pretrained/sam_vit_h_4b8939.pth


🚀 Demo

Demo Visualization

Download our best pre-trained model:

Save it to:

./best_model_pth/model_epoch_best.pth

Run the demo with:

python demo.py \
  --img-path ./demo_img/scorpionfish.jpg \
  --output-dir ./demo_img \
  --config ./configs/demo.yaml \
  --model ./best_model_pth/model_epoch_best.pth

About

Open-Vocabulary Camouflaged Object Segmentation with Cascaded Vision Language Models

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •  

Languages