Agentic Resilience & Evaluation System for Essential-Infrastructure (ARES-E) DOE Genesis Mission — Phase I Prototype Version 0.2.0 | April 2026
ARES-E is a deterministic, physics-constrained evaluation harness purpose-built for the American Science Cloud (AmSC). It provides a unified platform for scoring, validating, and auditing scientific AI agent actions across three DOE Genesis Mission topic areas:
| Topic | Codename | Capability |
|---|---|---|
| 16 — Scaling the Grid | EWIS | Physics-Informed Neural Networks for grid dispatch under Dynamic Line Rating |
| 21 — Fluid Flow for Energy | WOIK | Closed-loop thermal-hydraulic conservation via directed-graph topology |
| 20 — Cybersecurity for AI | PHIAK | ε-Differential privacy, adversarial/injection detection, data-provenance fingerprinting |
The platform enforces Verification, Validation, and Uncertainty Quantification (VVUQ) at every evaluation step, produces a cryptographically-chained (SHA-256) audit trail exportable as STIX/TAXII 2.1 bundles, and delivers a plug-and-play interoperability registry enabling zero-downtime integration of additional agency engines (NNSA, NIST, NRC, Navy, ISO).
This system directly advances the Genesis Mission's core objectives as stated in the FOA:
- Accelerate breakthroughs in energy dominance — The EWIS engine validates AI-proposed grid dispatch strategies against Kirchhoff-law physics constraints, ensuring agent recommendations improve upon classical baselines before deployment.
- Discovery science — The WOIK engine enforces first-law thermodynamic conservation (Q = ṁ · cₚ · ΔT) on closed-loop cooling systems, enabling safe exploration of novel thermal configurations for HPC and reactor cooling.
- National security capabilities — The PHIAK engine provides ε-differential privacy guarantees on infrastructure telemetry and detects adversarial injection, poisoning, and prompt-manipulation attacks targeting scientific AI agents.
ARES-E is designed for seamless deployment on the AmSC:
- Air-gapped operation — Zero external API calls; all computation is local.
- FAIR data compliance — Every payload requires structured metadata (Source, Modality, Timestamp, Uncertainty_Margin) per DOE FAIR principles.
- STIX/TAXII 2.1 audit export — Immutable ledger exportable for DOE threat-intelligence sharing and DFIR forensics.
- Containerized deployment — Docker Compose manifest provides Uvicorn + Ray head for immediate AmSC cluster integration.
| OT Section | ARES-E Compliance Mechanism |
|---|---|
| Section I — Scope of Work | Three-topic coverage (16, 21, 20) with deterministic VVUQ scoring |
| Section III — Intellectual Property | All code MIT-licensed; no third-party IP encumbrances; IP Certification-ready |
| Section IV — IP Warranty | Open-source dependencies only (PyTorch, NetworkX, NumPy, FastAPI) |
| Section V — RTES | No foreign entity dependencies; 100% U.S. development; no FCOC components |
| Section VI — Reporting | Milestone-aligned deliverables with deterministic acceptance criteria |
| Appendix II — Patent Rights | No Subject Inventions claimed; Government retains non-exclusive license |
| Appendix III — Data Rights | Open data rights; Government has Unlimited Rights in all delivered Data |
┌─────────────────────────────────────────────────────────┐
│ FastAPI Application │
│ app/main.py │
├─────────────────────────────────────────────────────────┤
│ API Layer (endpoints.py) │
│ POST /submit_workflow GET /status GET /ledger │
│ GET /domains GET /health │
├─────────────────────────────────────────────────────────┤
│ GenesisHarness (amsc_harness.py) │
│ ┌──────────────────────────────────────────────────┐ │
│ │ InteroperabilityManager (registry) │ │
│ │ register_engine / unregister_engine / execute │ │
│ └──────────────────────────────────────────────────┘ │
├────────────┬─────────────┬──────────────────────────────┤
│ EWIS │ WOIK │ PHIAK │
│ GridPINN │ FluidEngine│ DPMechanism │
│ Dispatch │ NetworkX │ AdversarialDetector │
│ Baseline │ 1st Law │ SHA-256 Fingerprint │
│ AI-Adv. │ ΔT Check │ ε-Laplace │
├────────────┴─────────────┴──────────────────────────────┤
│ ZeroTrustLedger (cyber_ledger.py) │
│ SHA-256 Hash Chain → STIX/TAXII 2.1 Bundle Export │
└─────────────────────────────────────────────────────────┘
- Ingestion — AI agent submits a JSON payload via
POST /api/v1/genesis/submit_workflowconforming to theGenesisWorkflowPayloadPydantic V2 schema. - Validation — Pydantic strict-mode validators enforce FAIR metadata presence, domain-literal typing (
EWIS | WOIK | PHIAK), and parameter-schema conformance. - Dispatch —
InteroperabilityManager.execute()routes the payload to the registered domain engine. - Evaluation — The domain engine produces a result dict containing
physics_violations(int) andvvuq_score(float in [0, 1]). - Audit —
ZeroTrustLedger.add_record()appends a SHA-256-chained block referencing the evaluation result. - Response — The harness stores the workflow status and returns the
job_idfor subsequent status queries.
| Layer | Technology | Version | Purpose |
|---|---|---|---|
| API Framework | FastAPI | 0.111+ | Async REST API with OpenAPI 3.1 auto-docs |
| Schema Validation | Pydantic V2 | 2.7+ | Strict-mode FAIR-compliant data validation |
| Physics ML | PyTorch | 2.1+ | GridPINN training with composite physics loss |
| Graph Modeling | NetworkX | 3.4+ | Directed-graph thermal-loop topology |
| Numerical | NumPy | 2.2+ | Laplace mechanism for differential privacy |
| Cryptographic | hashlib (stdlib) | — | SHA-256 hash chaining for zero-trust audit |
| Orchestration | Ray | 2.5+ | Distributed compute for AmSC HPC clusters |
| Containerization | Docker Compose 3.9 | — | Uvicorn + Ray head for local simulation |
Three-layer fully-connected neural network with sigmoid output activation:
Input(3) → Linear(64) → ReLU → Linear(64) → ReLU → Linear(1) → Sigmoid
Composite Loss Function:
Where
The deterministic dispatch layer computes effective capacity under weather-driven DLR:
Where
This metric quantifies the PINN's improvement over classical static dispatch, penalizing any physics violation to zero.
A NetworkX directed graph models the closed cooling loop:
Chiller → Pump → Heat_Load → Return → Chiller
First-law energy conservation:
Where:
-
$Q$ = heat load (kW) -
$\dot{m} = \rho \cdot V_{\text{flow}}$ = mass flow rate (kg/s) -
$c_p = 4.186$ kJ/(kg·K) for liquid water at ~25°C -
$\Delta T_{\text{critical}} = 15$ K — violation threshold
The engine rejects flow allocations where the maximum deviation from mean flow exceeds 20%, preventing physically unrealizable configurations from reaching the conservation check.
Laplace mechanism with configurable privacy budget:
Where
Eight compiled regex patterns detect injection attacks:
- SQL injection (
DROP TABLE,UNION SELECT,; --) - Prompt injection (
ignore previous instructions) - XSS (
<script>) - Command injection (
EXEC,SHUTDOWN,DELETE)
Seven keyword detectors identify data-poisoning vocabulary: malicious, backdoor, trigger, poison, adversarial, trojan, evasion.
Content-addressable fingerprinting enables data provenance tracking across the AmSC:
Each domain engine returns a vvuq_score in [0, 1]:
| Domain | VVUQ Computation |
|---|---|
| EWIS | $1 - |
| WOIK | 1.0 if energy is conserved AND |
| PHIAK |
The physics validation layer operates independently of the AI prediction layer:
- Deterministic dispatch computes ground-truth metrics from first principles.
- PINN prediction is compared against the deterministic result.
- VVUQ score quantifies agreement; violations are absolute disqualifiers.
This separation ensures that no AI prediction can bypass physics constraints.
Each evaluation produces a block:
{
"index": N,
"timestamp": "ISO-8601",
"data": { ... evaluation payload ... },
"stix_type": "indicator",
"taxii_collection": "DOE-ARES-E",
"previous_hash": "SHA-256 of block N-1",
"hash": "SHA-256 of this block (excluding hash field)"
}verify_chain() walks the full chain and confirms:
- Each block's
previous_hashmatches the prior block'shash. - Each block's
hashis consistent with its content.
The ledger exports as a STIX bundle with spec_version: "2.1", where each block becomes an indicator object with labels ["DOE", "ARES-E", "STIX", "zero-trust"] and confidence 100.
The InteroperabilityManager provides a plug-and-play engine registry:
harness.interop.register_engine("NNSA", nnsa_engine_callable)
harness.interop.register_engine("NIST", nist_engine_callable)
harness.interop.get_registered_domains() # ['EWIS', 'NNSA', 'NIST', 'PHIAK', 'WOIK']Any callable conforming to the following signature can be registered:
def engine(payload: Dict[str, Any]) -> Dict[str, Any]:
"""Must return dict with 'physics_violations' (int) and 'vvuq_score' (float)."""The harness routes payloads by the domain field in the workflow payload, enabling seamless integration of DOE Lab, NNSA, Navy, NIST, NRC, and ISO evaluation engines without modifying the orchestration core.
| Module | Tests | Coverage |
|---|---|---|
test_physics_engines.py |
25 | EWIS dispatch, PINN, baseline, AI-advantage; WOIK conservation, violations, flow balance; PHIAK privacy, adversarial detection, fingerprint |
test_amsc_harness.py |
11 | All 3 domains via harness, failures, ledger growth, chain verification, interoperability register/unregister |
test_api_endpoints.py |
9 | All 5 REST endpoints, Pydantic validation errors, 404 handling |
| Total | 45 | Zero warnings, zero deprecations |
git clone https://github.com/DaScient/Genesis.git && cd Genesis
pip install -e .
PYTHONPATH=. pytest -v
uvicorn app.main:app --reload --host 0.0.0.0 --port 8000docker-compose up --build- Deploy behind Science DMZ network perimeter.
- Set
CORS_ORIGINSenvironment variable to restrict allowed origins. - Enable
torch.use_deterministic_algorithms(True)for reproducibility on LCF GPU nodes. - Mount Ray cluster head for distributed PINN training across HPC nodes.
| Version | Date | Author | Description |
|---|---|---|---|
| 0.1.0 | 2026-04-02 | ARES-E Team | Initial scaffold and engine implementations |
| 0.2.0 | 2026-04-02 | ARES-E Team | Production-grade rewrite: Pydantic V2, 45 tests, interoperability, enhanced notebooks |