Prompt Injection Taxonomy Across Agent Frameworks

65-78% injection success across all 4 frameworks tested (LangChain, CrewAI, AutoGen, direct API). Framework choice creates 13pp variation — and indirect injection via tool output is 2x more effective than direct injection on CrewAI. Multi-agent CrewAI (55%) is less vulnerable than single-agent (70%), opposite of prediction.

Blog post: Which Agent Frameworks Are Most Vulnerable to Prompt Injection?

Key Results

Framework	Injection Success Rate	vs Direct API
LangChain	78%	+3pp (most vulnerable)
Direct API	75%	baseline
CrewAI	70%	-5pp
AutoGen	65%	-10pp (most resistant)
CrewAI multi-agent	55%	-20pp (opposite of prediction)

Quick Start

git clone https://github.com/rexcoleman/framework-injection-taxonomy
cd framework-injection-taxonomy
pip install -e .
bash reproduce.sh

Project Structure

FINDINGS.md # Research findings with pre-registered hypotheses and full results
EXPERIMENTAL_DESIGN.md # Pre-registered experimental design and methodology
HYPOTHESIS_REGISTRY.md # Hypothesis predictions, results, and verdicts
reproduce.sh # One-command reproduction of all experiments
governance.yaml # govML governance configuration
CITATION.cff # Citation metadata
LICENSE # MIT License
pyproject.toml # Python project configuration
scripts/ # Experiment and analysis scripts
src/ # Source code
tests/ # Test suite
outputs/ # Experiment outputs and results
docs/ # Documentation and decision records

Methodology

See FINDINGS.md and EXPERIMENTAL_DESIGN.md for detailed methodology, pre-registered hypotheses, and full experimental results with multi-seed validation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Prompt Injection Taxonomy Across Agent Frameworks

Key Results

Quick Start

Project Structure

Methodology

Citation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
blog		blog
docs		docs
outputs		outputs
scripts		scripts
src		src
tests		tests
.gitignore		.gitignore
CITATION.cff		CITATION.cff
DECISION_LOG.md		DECISION_LOG.md
EXPERIMENTAL_DESIGN.md		EXPERIMENTAL_DESIGN.md
FINDINGS.md		FINDINGS.md
HYPOTHESIS_REGISTRY.md		HYPOTHESIS_REGISTRY.md
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
reproduce.sh		reproduce.sh

Folders and files

Latest commit

History

Repository files navigation

Prompt Injection Taxonomy Across Agent Frameworks

Key Results

Quick Start

Project Structure

Methodology

Citation

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages