TRAP: Targeted Redirecting of Agentic Preferences

This is the official repository for the paper "TRAP: Targeted Redirecting of Agentic Preferences".

Authors: Hangoo Kang*, Jehyeok Yeon*, Gagandeep Singh

Overview

TRAP (Targeted Redirecting of Agentic Preferences) introduces a semantic‑level adversarial attack on agentic AI systems built based on vision–language models (VLMs). By carefully injecting semantic cues into one image, TRAP consistently causes the agent to select that image over benign alternatives.
Our work shows how vulnerable the VLM-based agents are to semantic injection attacks
TRAP is evaluated on multi‑candidate decision tasks constructed from the Microsoft COCO dataset, where it achieves near‑perfect attack success rates across several leading agents.

Quick overview of TRAP's 4-step framework

Environment Setups

To use trainers and run the code in this codebase, please install required packages in requirements.txt file.

pip install -r requirements.txt

Launch TRAP framework

To run the TRAP framework, you can simply use the command below

python trap_framework.py

Citation

If you find our project helpful, please consider citing our paper:

@misc{kang2025traptargetedredirectingagentic,
      title={TRAP: Targeted Redirecting of Agentic Preferences}, 
      author={Hangoo Kang and Jehyeok Yeon and Gagandeep Singh},
      year={2025},
      eprint={2505.23518},
      archivePrefix={arXiv},
      primaryClass={cs.AI},
      url={https://arxiv.org/abs/2505.23518}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
LICENSE		LICENSE
README.md		README.md
diagram.png		diagram.png
requirements.txt		requirements.txt
trap_framework.py		trap_framework.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TRAP: Targeted Redirecting of Agentic Preferences

Overview

Environment Setups

Launch TRAP framework

Citation

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

hgKang02/TRAP

Folders and files

Latest commit

History

Repository files navigation

TRAP: Targeted Redirecting of Agentic Preferences

Overview

Environment Setups

Launch TRAP framework

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages