🔒 EphemeralML

Confidential AI inference with hardware-backed attestation

Run AI models where prompts and weights stay encrypted — even if the host is compromised.

Why EphemeralML?

Problem	Solution
Cloud hosts can see your data	TEE isolation — data decrypted only inside the enclave
"Trust me" isn't enough	Cryptographic attestation — verify code before sending secrets
No audit trail	Execution receipts — proof of what code processed your data

Built for: Defense, GovCloud, Finance, Healthcare — anywhere "good enough" security isn't.

Architecture

┌─────────┐      HPKE       ┌─────────────┐     VSock     ┌─────────────┐
│  Client │◄──────────────► │  Host (blind │◄────────────►│   Enclave   │
└─────────┘   encrypted     │    relay)   │    encrypted  └──────┬──────┘
                            └─────────────┘                      │
                                   │                             │ NSM
                                   │ S3                          ▼
                            ┌──────┴──────┐              ┌───────────────┐
                            │   Encrypted │              │    AWS KMS    │
                            │    Models   │              │ (key release) │
                            └─────────────┘              └───────────────┘

Key insight: Host never has keys. It just forwards ciphertext.

Security Model

What's Protected

✅ Model weights (IP protection)
✅ Prompts & outputs (PII / classified data)
✅ Execution integrity (verified code)

How

Attestation-gated key release — KMS releases DEK only if enclave PCRs match policy
HPKE encrypted sessions — end-to-end encryption, host sees only ciphertext
Ed25519 signed receipts — cryptographic proof of execution

Threat Model

✓ Compromised host OS → Protected (enclave isolation)
✓ Malicious cloud admin → Protected (can't decrypt)
✓ Supply chain attack → Detected (PCR verification)
✓ Model swap attack → Prevented (signed manifests)

Features

Core (Production Ready)

Nitro Enclave integration with real NSM attestation
AWS KMS key release via RSA-2048 SPKI handshake
VSock protocol for host↔enclave communication
S3 model storage with client-side encryption

Inference Engine

Candle-based transformer inference (MiniLM, BERT, Llama)
GGUF support for quantized models (int4, int8)
BF16/safetensors format enforcement
Memory-optimized for TEE constraints

Compliance

Attested Execution Receipts (AER) for audit
120+ unit tests passing
Deterministic builds for reproducibility

Quick Start

Prerequisites

AWS account with Nitro Enclave support
Rust 1.75+ (for local development)
Terraform (for infrastructure)

Deploy

# 1. Provision infrastructure
cd infra/hello-enclave
terraform init && terraform apply

# 2. Build enclave image
./scripts/build_enclave.sh

# 3. Run
nitro-cli run-enclave --eif-path enclave.eif --cpu-count 2 --memory 4096

See QUICKSTART.md for detailed instructions.

Project Status

Component	Status
NSM Attestation	✅ Production
KMS Integration	✅ Production
VSock Protocol	✅ Production
HPKE Sessions	✅ Production
Inference Engine	✅ Production
Receipt Signing	✅ Production
Compliance Tools	🚧 In Progress
Policy Updates	📋 Planned

Overall: 97% complete — E2E path verified on AWS Nitro.

Documentation

docs/design.md — Architecture & threat model
docs/tasks.md — Implementation progress
QUICKSTART.md — Deployment guide
SECURITY_DEMO.md — Security walkthrough

License

Apache 2.0 — see LICENSE

Run inference like the host is already hacked.

Documentation • Issues

Name		Name	Last commit message	Last commit date
Latest commit History 178 Commits
.cargo		.cargo
client		client
common		common
confidential-inference-gateway		confidential-inference-gateway
docs		docs
enclave		enclave
enclaves		enclaves
host		host
infra		infra
investor		investor
scripts		scripts
test_assets/minilm		test_assets/minilm
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CNAME		CNAME
Cargo.toml		Cargo.toml
Dockerfile.prod		Dockerfile.prod
INFRA.md		INFRA.md
LICENSE		LICENSE
QA_REPORT.md		QA_REPORT.md
QUICKSTART.md		QUICKSTART.md
README.md		README.md
SECURITY_DEMO.md		SECURITY_DEMO.md
demo.sh		demo.sh
favicon.svg		favicon.svg
index.html		index.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🔒 EphemeralML

Why EphemeralML?

Architecture

Security Model

What's Protected

How

Threat Model

Features

Core (Production Ready)

Inference Engine

Compliance

Quick Start

Prerequisites

Deploy

Project Status

Documentation

License

About

Uh oh!

Releases

Packages

Languages

License

tsyrulb/EphemeralML

Folders and files

Latest commit

History

Repository files navigation

🔒 EphemeralML

Why EphemeralML?

Architecture

Security Model

What's Protected

How

Threat Model

Features

Core (Production Ready)

Inference Engine

Compliance

Quick Start

Prerequisites

Deploy

Project Status

Documentation

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages