Skip to content

Latest commit

 

History

History
15 lines (9 loc) · 790 Bytes

File metadata and controls

15 lines (9 loc) · 790 Bytes

POSIDA : Prompt Driven Synthesis Image for Zero-Shot Domain Adaption

Model Architecture

POSIDA Architecture

Key ideas

Prompt-Driven Synthesis → CLIP space: Synthetically generated images from prompts are projected into CLIP’s shared embedding space, after which the PIN process is applied to refine alignment between text and image features.

Knowledge-based prompts: We incorporate knowledge-guided prompts so the text descriptions are steered toward domain-appropriate semantics, improving adaptation to the target domain.

Results

POSIDA Architecture

POSIDA surpasses the baseline on both the Snow and Rain target domains.