Awesome AI Security

Curated resources for securing AI/ML systems across threat modeling, adversarial ML, LLM security, governance, MLSecOps, and benchmarks.

Read this in other languages: 中文

Contributions welcome! See CONTRIBUTING for details.

ASB Open Source Ecosystem
Quick Start
1. Threat Modeling & Frameworks
2. Adversarial Machine Learning
3. LLM & GenAI Security
4. Privacy, Safety & Governance
5. MLSecOps, MLOps & Supply Chain Security
6. Datasets & Benchmarks
7. Learning Resources
9. Related Awesome Lists
Contributing
Project Status & Roadmap
License

?? ASB Open Source Ecosystem

ASB Security Schema - Unified JSON schema for capturing AI security telemetry and sharing alerts across systems.
asb-secure-gateway - Reference gateway enforcing ASB schema with OPA policies for AI-native traffic.

Quick Start

Map the threat landscape: skim Section 1 before picking tools or controls.
Experiment with adversarial tooling: start with the libraries in Section 2 to understand attacker capability.
Secure LLM applications: apply the guidance and scanners in Section 3 as you build guardrails.
Embed governance early: use the risk and privacy references in Section 4 to keep regulators happy.
Operationalize safeguards: treat Section 5 as your MLSecOps checklist.

1. Threat Modeling & Frameworks

1.1 General AI/ML Threat Modeling

MITRE ATLAS - Tactics, techniques, and case studies for attacks on AI/ML systems.
MITRE Adversarial ML Threat Matrix - ATT&CK-style matrix translating ML pipeline attacks into concrete techniques.
ENISA Artificial Intelligence Threat Landscape - Comprehensive overview of AI attack surfaces, assets, and mitigations.
CISA/NSA Guidelines for Secure AI System Development - Joint principles for designing, deploying, and monitoring AI securely.
NCSC Patterns for Secure AI System Development - Reusable architectural patterns for securing data, models, and tooling.

1.2 Risk Management, Governance & Standards

NIST AI Risk Management Framework (AI RMF 1.0) - Voluntary framework covering governance, mapping, measuring, and managing AI risk.
NIST AI RMF Playbook - Practical implementation guidance, artifacts, and crosswalks for AI RMF adoption.
NIST AI RMF Profile: Generative AI - Draft profile translating AI RMF tasks to GenAI-specific safeguards.

2. Adversarial Machine Learning

2.1 Toolkits & Libraries

Adversarial Robustness Toolbox (ART) - Python library for evasion, poisoning, extraction, and inference attacks plus defenses.
CleverHans - Classic adversarial example framework for benchmarking robustness.
Foolbox - Unified interface for fast gradient-based and decision-based attacks across DL frameworks.
AdvBox - Attack generation across CV, NLP, and speech models with multi-framework support.
TextAttack - NLP-focused adversarial attack, augmentation, and training library.
AutoAttack - Parameter-free ensemble of strong white-box attacks for reliable robustness evaluation.

2.2 Research & Surveys

Adversarial Attacks and Defences: A Survey - Deep dive on threat models, attack classes, and countermeasures for DL systems.
Security Matters: A Survey on Adversarial Machine Learning - Taxonomy linking attacker goals with defender controls across the ML lifecycle.
SoK: Security and Privacy in Machine Learning - Foundational SoK covering threat models, privacy risks, and defense trade-offs.

3. LLM & GenAI Security

3.1 Guidance & Taxonomies

OWASP Top 10 for Large Language Model Applications - Canonical list of LLM-specific risks and mitigations.
OWASP LLM Top 10 Unofficial Japanese Translation - Community translation of the OWASP LLM Top 10 for Japanese teams.
open-source-llm-scanners - Catalog of scanners and fuzzers targeting LLM misuse cases.

3.2 Tools & Frameworks

garak - LLM vulnerability scanner probing for jailbreaks, leakage, and safety failures.
LLM Guard - Input/output filtering toolkit with regex, classifiers, and secret detectors for LLM apps.
DeepTeam - Red teaming orchestration framework for multi-agent LLM penetration testing.
Giskard - Evaluation suite catching bias, robustness, and security issues in ML/LLM pipelines.
cyber-security-llm-agents - AutoGen-based agents for offensive and defensive AI security tasks.

Need LLM jailbreak benchmarks? Jump to Section 6.2.

4. Privacy, Safety & Governance

SoK: Data Reconstruction Attacks Against Machine Learning Models - Taxonomy and benchmarks for reconstruction attacks plus measurement guidance.
SoK: Security and Privacy Risks of Healthcare AI - Sector-specific review of threats to clinical AI deployments.
SoK: Data Minimization in Machine Learning - Framework for applying data-minimization principles throughout ML pipelines.

5. MLSecOps, MLOps & Supply Chain Security

MLSecOps - Opinionated repo of processes, tooling, and templates for secure ML operations.
Automating ML Security Checks using CI/CD - Guide for wiring poisoning, bias, and drift tests into pipelines.
Analyzing the Security of Machine Learning Research Code - NVIDIA AI Red Team playbook for auditing ML repos and dependencies.
ModelScan - Static and dynamic scanner for catching malicious or vulnerable model artifacts before deployment.

6. Datasets & Benchmarks

6.1 Robustness to Corruptions & Perturbations

ImageNet-C - Standard corruption benchmark to evaluate ML robustness to common noise patterns.
CIFAR-10-C / CIFAR-100-C - Corruption suites for CIFAR datasets spanning 19 perturbations at five severities.
RobustBench - Leaderboard and library for adversarially robust models plus evaluation scripts.

6.2 LLM Jailbreak & Safety Benchmarks

JailbreakBench - Open benchmark and evaluation harness for jailbreak robustness.
JBB-Behaviors dataset - 100 misuse behaviors for red teaming LLM outputs.
Heuristic Red Teaming - Prompt dataset and harness for stress-testing safety policies.

7. Learning Resources

ML Security Cheat Sheet - High-level primer on attack surfaces, threat models, and mitigation patterns.
Five Essential Machine Learning Security Papers - NCC Group commentary on must-read academic work.
Machine Learning Security Principles (Packt) - Book covering foundational concepts and defensive controls.
Responsible AI: Adversarial Attacks on LLMs (YouTube) - Conference talk demonstrating jailbreak techniques and mitigations.

9. Related Awesome Lists

Contributing

We welcome high-signal resources that directly improve the security of AI systems.

What we accept

Threat modeling frameworks, governance standards, and incident handling references.
Offensive and defensive research (adversarial ML, jailbreaks, poisoning, extraction, inference, safety testing).
Production-ready tooling, datasets, benchmarks, and red teaming harnesses.
Tutorials, talks, and books that teach practitioners how to secure AI systems.

Formatting rules

Use unordered list items (-) and keep one resource per line.
Follow the format below and keep descriptions concise and plain English.

Tools / libraries

- [Project Name](https://example.com) - One-line description of what it does and why its useful.

Papers / posts / datasets

- *Paper or Post Title* - Short summary plus venue or publisher if relevant.

Add new entries near related content sections to keep the list curated and deduplicated. Please also double-check that added links are live and publicly accessible.

Project Status & Roadmap

This list is early-stage and intentionally scoped to core security primitives. Near-term goals:

Expand domains beyond generic ML/LLM (healthcare, industrial, safety-critical control).
Track emerging GenAI-specific benchmarks and red teaming playbooks.
Highlight production case studies once vetted.

Issues and PRs are welcome for suggestions.

License

This project is released under CC0 1.0. You can copy, modify, and reuse the list without asking permission.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.github		.github
docs/patterns		docs/patterns
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
README_zh.md		README_zh.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Awesome AI Security

Contents

?? ASB Open Source Ecosystem

Quick Start

1. Threat Modeling & Frameworks

1.1 General AI/ML Threat Modeling

1.2 Risk Management, Governance & Standards

2. Adversarial Machine Learning

2.1 Toolkits & Libraries

2.2 Research & Surveys

3. LLM & GenAI Security

3.1 Guidance & Taxonomies

3.2 Tools & Frameworks

4. Privacy, Safety & Governance

5. MLSecOps, MLOps & Supply Chain Security

6. Datasets & Benchmarks

6.1 Robustness to Corruptions & Perturbations

6.2 LLM Jailbreak & Safety Benchmarks

7. Learning Resources

9. Related Awesome Lists

Contributing

What we accept

Formatting rules

Project Status & Roadmap

License

About

Uh oh!

Releases

Packages

License

SecureAI-Team/awesome-aisecurity

Folders and files

Latest commit

History

Repository files navigation

Awesome AI Security

Contents

?? ASB Open Source Ecosystem

Quick Start

1. Threat Modeling & Frameworks

1.1 General AI/ML Threat Modeling

1.2 Risk Management, Governance & Standards

2. Adversarial Machine Learning

2.1 Toolkits & Libraries

2.2 Research & Surveys

3. LLM & GenAI Security

3.1 Guidance & Taxonomies

3.2 Tools & Frameworks

4. Privacy, Safety & Governance

5. MLSecOps, MLOps & Supply Chain Security

6. Datasets & Benchmarks

6.1 Robustness to Corruptions & Perturbations

6.2 LLM Jailbreak & Safety Benchmarks

7. Learning Resources

9. Related Awesome Lists

Contributing

What we accept

Formatting rules

Project Status & Roadmap

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages