Test Coverage Matrix

This document summarizes the test coverage for SecAI_OS across all languages and test categories.

Last updated: 2026-03-10

Summary

Language	Test Count	Runner
Go	26	`go test ./...`
Python	677+	`pytest`
Shell	All .sh files	`shellcheck`

Go Tests (26 total)

Service	Location	Tests	Description
Registry	services/registry/	6	Trusted model registry, hash pinning, cosign verification
Tool Firewall	services/tool-firewall/	10	Default-deny egress policy, rule evaluation
Airlock	services/airlock/	10	Online airlock, request sanitization, policy enforcement

Python Tests (677+ total)

Test File	Location	Approx. Tests	Description
test_pipeline.py	tests/	~96	Quarantine pipeline stages, scanning, pass/fail logic
test_search.py	tests/	~27	Search mediator, PII stripping, injection detection
test_ui.py	tests/	~11	Flask web UI routes, rendering, input handling
test_vault_watchdog.py	tests/	~18	Vault auto-lock, idle detection, timer controls
test_memory_protection.py	tests/	~37	Swap encryption, zswap, core dumps, mlock, TEE detection
test_traffic_analysis.py	tests/	~41	Padding, timing jitter, dummy traffic generation
test_differential_privacy.py	tests/	~37	Privacy-preserving query obfuscation: decoy queries, k-anonymity, timing randomization
test_clipboard_isolation.py	tests/	~30	Clipboard access controls, content sanitization
test_canary_tripwire.py	tests/	~49	Canary token placement, tripwire monitoring, alerts
test_emergency_wipe.py	tests/	~65	3-level panic wipe, secure deletion, escalation
test_update_rollback.py	tests/	~74	Signed update verification, rollback triggers, recovery
test_agent.py	tests/	93	Agent policy engine, capability tokens, storage gateway, budgets, planner, executor, API, workspace validation, security invariants

Agent test breakdown (test_agent.py)

Class	Tests	Category	Description
TestClassifyRisk	3	Unit	Risk-level classification for agent actions
TestPolicyEngine	15	Unit / Security	Deny-by-default evaluation, always-deny invariants, hard-approval gates
TestCapabilityTokens	8	Unit	Token creation, workspace scoping, mode-specific capabilities
TestBudgets	7	Unit	Budget enforcement, limit checking, sensitive-mode tighter limits
TestStorageGateway	14	Unit / Security	Path scope validation, sensitive file blocking, sensitivity ceiling, file size limits
TestPlannerHeuristic	8	Unit	Heuristic plan decomposition, keyword-to-action mapping
TestPlannerLLMParsing	4	Unit	LLM response parsing, malformed plan rejection
TestExecutor	6	Integration	Step execution dispatch, tool firewall calls, budget tracking
TestAgentAPI	17	Integration	HTTP endpoint contracts, input validation, task CRUD lifecycle, workspace ID resolution
TestSecurityInvariants	7	Security	Fail-closed behavior, airlock/firewall bypass prevention, service-down handling
TestDataModels	4	Unit	Task/step serialisation, status enum coverage

Shell Checks

All shell scripts under files/system/ are validated with shellcheck. This is enforced in CI.

CI Pipeline

CI is defined in .github/workflows/ci.yml and runs on every push and pull request.

Steps:

Lint shell scripts with shellcheck
Run Go tests (go test ./...)
Lint Python (py_compile for all service modules including agent)
Run Python tests (pytest tests/) — includes agent tests
Validate YAML configs (policy, agent, recipes)

Test Categories

Category	Description	Examples
Unit	Isolated function/method tests	Hash verification, policy rule parsing
Integration	Multi-component interaction tests	Pipeline stage sequencing, service auth flow
Security	Validates security invariants hold	Injection detection, PII stripping, fail-closed behavior

Running Tests Locally

Go tests

cd services/registry && go test ./...
cd services/tool-firewall && go test ./...
cd services/airlock && go test ./...

Python tests

pip install pytest flask requests pyyaml
pytest tests/

To run a specific test file:

pytest tests/test_pipeline.py
pytest tests/test_search.py
pytest tests/test_agent.py

Shell checks

shellcheck files/system/usr/libexec/secure-ai/*.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Test Coverage Matrix

Summary

Go Tests (26 total)

Python Tests (677+ total)

Agent test breakdown (test_agent.py)

Shell Checks

CI Pipeline

Test Categories

Running Tests Locally

Go tests

Python tests

Shell checks

FilesExpand file tree

test-matrix.md

Latest commit

History

test-matrix.md

File metadata and controls

Test Coverage Matrix

Summary

Go Tests (26 total)

Python Tests (677+ total)

Agent test breakdown (test_agent.py)

Shell Checks

CI Pipeline

Test Categories

Running Tests Locally

Go tests

Python tests

Shell checks