SaFoLab : Security and Safe Foundation Model Systems
Pinned Loading
Repositories
- DoxBench Public
[ICLR 2026] The official code for "Doxing via the Lens: Revealing Location-related Privacy Leakage on Multi-modal Large Reasoning Models"
SaFo-Lab/DoxBench’s past year of commit activity - dVLM-AD Public
Official Repo for “dVLM-AD: Enhance Diffusion Vision-Language-Model for Driving via Controllable Reasoning”
SaFo-Lab/dVLM-AD’s past year of commit activity - AutoDAN-Turbo Public
[ICLR 2025 Spotlight] The official implementation of our ICLR2025 paper "AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMs".
SaFo-Lab/AutoDAN-Turbo’s past year of commit activity - DRIFT Public
[NeurIPS 2025] The official implementation of the paper "DRIFT: Dynamic Rule-Based Defense with Injection Isolation for Securing LLM Agents".
SaFo-Lab/DRIFT’s past year of commit activity - PRISM Public
PRISM: Robust VLM Alignment with Principled Reasoning for Integrated Safety in Multimodality
SaFo-Lab/PRISM’s past year of commit activity - AGrail4Agent Public
[ACL 2025] The official code for "AGrail: A Lifelong Agent Guardrail with Effective and Adaptive Safety Detection".
SaFo-Lab/AGrail4Agent’s past year of commit activity - llm-armor Public
SaFo-Lab/llm-armor’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…