Adversarial ML on Network Intrusion Detection

⚠️ ARCHIVED — This project is archived. The research is complete and the findings remain valid, but no further development is planned. The adversarial control analysis methodology developed here continues in active projects — see rexcoleman.dev for current work.

Adversarial ML on Network Intrusion Detection

Can ML-based intrusion detection systems survive adversarial evasion by a realistic attacker?

Most adversarial ML research on IDS perturbs all features equally. Real attackers can't forge TCP flags or control destination ports. This project quantifies how feature controllability constraints change everything about adversarial robustness.

Key Results

All adversarial attacks use random noise perturbation (not gradient-based). Gradient-based attacks (FGSM, PGD, C&W) were not tested because sklearn models lack differentiable outputs. Results reflect robustness against noise-based evasion only.

Metric	XGBoost	Random Forest
Clean Macro-F1 (4-seed mean)	0.895 ± 0.013	0.853 ± 0.005
Unconstrained Attack F1 (e=0.3)	0.086 (-74pp)	0.153 (-63pp)
Constrained Attack F1 (e=0.3)	0.213 (-61pp)	0.217 (-56pp)
ASR Reduction against noise (constrained vs unconstrained)	35%	5%

Defense Effectiveness

Defense	XGBoost Recovery	RF Recovery
Adversarial Training	61%	37%
Feature Squeezing	0%	1%
Constraint-Aware Detection	100%	100%

Core insight: The most effective defense is architectural (monitoring the 14 defender-observable features for impossible changes), not learned (adversarial training). Caveat: constraint-aware detection achieves 100% against unconstrained noise (which perturbs all features including defender-observable ones) but would be bypassed by a constrained adversary who avoids perturbing those features.

Architecture

                    CICIDS2017 (2.83M flows, 78 features)
                                |
                    Preprocessing + Feature Split
                       /                    \
            57 Attacker-                14 Defender-
            Controllable                Observable Only
           (packet timing,             (TCP flags,
            payload size)               Dest Port)
                |                           |
        Constrained                 Constraint-Aware
        Attacks (e-ball)            Detection (monitor
                |                    for impossible
        Baseline Models              changes)
        (RF, XGBoost, MLP)              |
                |                   100% detection
        Adversarial Training         on noise-perturbed
        (61% F1 recovery)           observable features*

Quick Start

# 1. Environment setup
conda env create -f environment.yml
conda activate adversarial-ids
bash scripts/verify_env.sh

# 2. Data (manual download required — registration at unb.ca/cic/datasets/ids-2017.html)
# Extract CSVs to data/raw/MachineLearningCVE/
python scripts/check_data_ready.py

# 3. Run full pipeline
python src/eda.py                                          # EDA + feature analysis
python src/train_baselines.py --seeds 123 456 789 1024     # Baseline classifiers
python src/adversarial_attacks.py --attacks noise zoo       # Adversarial attacks
python src/defenses.py                                      # Defense evaluation

Project Structure

adversarial-ids-ml/
+-- src/
|   +-- preprocessing.py       # Data pipeline + feature controllability split
|   +-- eda.py                 # Exploratory data analysis
|   +-- train_baselines.py     # RF, XGBoost, MLP training
|   +-- adversarial_attacks.py # Unconstrained + constrained attacks
|   +-- defenses.py            # Adversarial training, squeezing, constraint-aware
+-- data/
|   +-- raw/MachineLearningCVE/  # CICIDS2017 CSVs (not committed)
|   +-- splits/                  # Split metadata
|   +-- checksums.sha256         # Data integrity verification
+-- models/                    # Trained model artifacts (.pkl)
+-- outputs/
|   +-- eda/                   # Feature correlations, class distribution, summary
|   +-- baselines/             # Per-seed training results + confusion matrices
|   +-- adversarial/           # Budget curves, unconstrained/constrained results
|   +-- defense/               # Defense comparison + recovery plots
|   +-- provenance/            # versions.txt, git SHA, run log
+-- docs/                      # govML governance templates (16 active)
+-- blog/                      # Publication drafts
+-- scripts/                   # Utility scripts (verify_env, check_data_ready)
+-- FINDINGS.md                # Publication-ready results summary
+-- TRADEOFF_LOG.md            # Architectural decisions

Hypotheses

All hypotheses pre-registered before experiments (see docs/HYPOTHESIS_CONTRACT.md):

ID	Prediction	Verdict
H-1	Unconstrained attacks degrade F1 >= 30pp	Confirmed (74pp XGB, 63pp RF)
H-2	Constraints reduce ASR >= 40%	Partially Confirmed (35% XGB, 5% RF)
H-3	Adversarial training > preprocessing defense	Confirmed (61% vs 0% recovery)
H-4	Architectural > learned defense	Partially Confirmed (100% detection against unconstrained noise; would fail against constrained adversary -- see limitations)

Limitations

Random noise only -- All attacks use random uniform noise, not gradient-based (FGSM/PGD/C&W). sklearn models lack differentiable outputs, so gradient attacks don't apply. Black-box attacks (ZOO/HopSkipJump) would be stronger but were not tested.
Single seed for attacks/defenses -- Baselines confirmed stable across 4 seeds (123, 456, 789, 1024). Seed 42 used for attack/defense experiments only.
10% sample -- Trained on 283K rows (10% of 2.83M) for speed.
Constraint-aware detection is limited -- Achieves 100% detection against unconstrained noise (which naively perturbs defender-observable features), but a constrained adversary who only perturbs attacker-controllable features would bypass it entirely.
No adaptive attacker tested -- The true test of the architectural defense (a constrained attacker aware of the detection mechanism) was not conducted.

Governance

Built with govML -- 16 active templates covering data contracts, experiment protocols, hypothesis pre-registration, adversarial evaluation, and reproducibility specs.

Publication

Blog post: Your IDS Adversarial Defense is Probably Testing the Wrong Threat Model
Talk target: BSides / DEF CON AI Village
Full findings: FINDINGS.md

License

MIT

Author

Rex Coleman

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
blog		blog
data		data
docs		docs
figures		figures
outputs		outputs
scripts		scripts
src		src
tests		tests
.gitignore		.gitignore
EXECUTION_PLAN.md		EXECUTION_PLAN.md
FINDINGS.md		FINDINGS.md
HYPOTHESIS_REGISTRY.md		HYPOTHESIS_REGISTRY.md
LICENSE		LICENSE
README.md		README.md
TRADEOFF_LOG.md		TRADEOFF_LOG.md
environment.yml		environment.yml
project.yaml		project.yaml
pyproject.toml		pyproject.toml
reproduce.sh		reproduce.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Adversarial ML on Network Intrusion Detection

Key Results

Defense Effectiveness

Architecture

Quick Start

Project Structure

Hypotheses

Limitations

Governance

Publication

License

Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Adversarial ML on Network Intrusion Detection

Key Results

Defense Effectiveness

Architecture

Quick Start

Project Structure

Hypotheses

Limitations

Governance

Publication

License

Author

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages