Qora-FL

Quorum-Oriented Robust Aggregation for Federated Learning

Problem

Federated learning systems are fragile under adversarial or faulty clients. Standard aggregation methods silently fail under model poisoning, gradient manipulation, or non-IID drift.

Qora-FL provides quorum-oriented, Byzantine-tolerant aggregation primitives designed for predictable behavior under adversarial conditions, with optional determinism and reputation-aware filtering. Replace FedAvg with a single line change and tolerate up to 30% malicious clients.

How Qora-FL Differs

Most "robust FL" implementations are paper artifacts or tightly coupled experiments. Qora-FL is designed as infrastructure, emphasizing:

Explicit aggregation semantics -- each method has documented tolerance bounds, not just "robust"
Measurable deviation signals -- reputation is derived from observable behavior, not assumed trust
Deterministic execution paths -- BFP-16 block floating-point for deterministic Krum/Multi-Krum distance computation
Ecosystem integration -- drop-in Flower strategy, not a standalone experiment
Benchmarked overhead -- sub-10ms aggregation for 100K parameters, not just accuracy claims

Feature	FedAvg	Typical Robust FL	Qora-FL
Byzantine tolerance	--	Paper-only	Algorithms informed by 181-day QRES deployment
Deterministic option	--	--	BFP-16 block floating-point
Reputation tracking	--	--	Deviation-derived, persistent
Flower integration	Native	--	Drop-in `QoraStrategy`
Benchmarked overhead	--	--	<10ms / 100K params
Production API	Partial	--	Rust core + Python bindings

Architecture

Clients produce model updates
              │
              ▼
┌────────────────────────────────┐
│   Qora Robust Aggregator       │
│                                │
│  ┌──────────────────────────┐  │
│  │ Trimmed Mean    (~30%)   │  │
│  │ Median          (~50%)   │  │
│  │ Krum       (n ≥ 2f+3)    │  │
│  │ Multi-Krum  (top-m avg)  │  │
│  │ FedAvg      (baseline)   │  │
│  └──────────────────────────┘  │
└─────────────┬──────────────────┘
              │
   Deviation signal (not trust)
              │
              ▼
     ┌─────────────────┐
     │    Reputation   │
     │     Manager     │
     │                 │
     │  Scores / Gates │
     │  / Weighting    │
     └────────┬────────┘
              │
              ▼
     Verified Global Update

Determinism & Reproducibility

Floating-point aggregation is not reproducible across platforms or runtimes. Qora-FL supports deterministic aggregation paths via BFP-16 (block floating-point with 16-bit mantissas and per-vector shared exponent), enabling:

Reproducible experiments -- identical Krum/Multi-Krum rankings regardless of hardware, compiler, or optimization level
Auditability -- any party can verify the exact aggregation result from the same inputs
Consensus-style FL -- bit-perfect agreement required for regulated industries (healthcare, finance)

BFP-16 extends the original Q16.16 fixed-point approach to cover the full f32 range. Distance computations use purely integer arithmetic (i32 shifts, i64 accumulation, saturating operations). The deterministic Krum implementation was validated across ARM Cortex-M, Xtensa (ESP32), and x86_64 during the 181-day QRES deployment.

Reputation as a First-Class Primitive

Reputation in Qora-FL is not trust. It is a deviation-derived signal measuring how closely a client's update aligns with the emergent robust consensus.

Reputation may be used to:

Weight aggregation contributions (cubic influence: min(rep^3, 0.8))
Gate participation (clients below threshold are excluded)
Detect persistent outliers across rounds (not just per-round anomalies)
Persist across server restarts via JSON serialization

The 0.8 influence cap prevents any single node from dominating consensus, even at maximum reputation. This mitigates the "Slander-Amplification" vulnerability identified during the QRES deployment.

Quick Start (Python + Flower)

pip install qora-fl[flower]

import flwr as fl
from qora import QoraStrategy

strategy = QoraStrategy(
    aggregation_method="trimmed_mean",
    trim_fraction=0.2,
    min_fit_clients=5,
)

fl.server.start_server(
    server_address="0.0.0.0:8080",
    config=fl.server.ServerConfig(num_rounds=10),
    strategy=strategy,
)

QoraStrategy inherits from FedAvg -- all standard Flower parameters (fraction_fit, min_fit_clients, initial_parameters, etc.) work as expected.

Python (Standalone)

pip install qora-fl

import numpy as np
from qora import ByzantineAggregator

agg = ByzantineAggregator("trimmed_mean", 0.3)

# 7 honest clients + 3 Byzantine attackers
updates = [np.array([[1.0, 2.0]], dtype=np.float32) for _ in range(7)]
updates += [np.array([[100.0, 200.0]], dtype=np.float32) for _ in range(3)]

result = agg.aggregate(updates)
# Result is [1.0, 2.0] -- attackers rejected

Reputation Tracking

from qora import ReputationManager

rep = ReputationManager(ban_threshold=0.2)
rep.reward("hospital_A", 0.02)

# Penalize repeatedly -- simulates multiple rounds of bad behavior
for _ in range(5):
    rep.penalize("hospital_bad", 0.08)

print(rep.is_banned("hospital_bad"))  # True (score dropped below 0.2)
print(rep.active_clients())           # Only non-banned clients

# Persist between server restarts
json_state = rep.to_json()
rep2 = ReputationManager.from_json(json_state, ban_threshold=0.2)

Rust

cargo add qora-fl

use qora_fl::{ByzantineAggregator, AggregationMethod};
use ndarray::array;

let mut agg = ByzantineAggregator::new(AggregationMethod::TrimmedMean, 0.3);

let updates = vec![
    array![[1.0, 2.0]],   // Honest
    array![[1.1, 2.1]],   // Honest
    array![[0.9, 1.9]],   // Honest
    array![[100.0, 200.0]], // Byzantine attacker
];

let result = agg.aggregate(&updates, None).unwrap();
// Result is close to [1.0, 2.0], attacker ignored

Individual Functions

use qora_fl::{trimmed_mean, median, fedavg};
use ndarray::array;

let updates = vec![array![[1.0]], array![[2.0]], array![[3.0]]];

let tm = trimmed_mean(&updates, 0.2).unwrap();
let med = median(&updates).unwrap();
let avg = fedavg(&updates, None).unwrap();

Aggregation Methods

Method	Byzantine Tolerance	Use Case
`TrimmedMean`	~30% of clients	Default choice for most FL deployments
`Median`	~50% of clients	When stronger robustness is needed
`Krum`	n >= 2f+3	Deterministic single-vector selection
`MultiKrum`	n >= 2f+3	Top-m averaging for smoother convergence
`FedAvg`	None	Baseline comparison only

Benchmarks

Trimmed mean remains stable under ~30% label-flipping and gradient-scaling attacks, converging faster and to higher accuracy than undefended FedAvg. Aggregation overhead stays under 10ms for typical FL model sizes. At larger model sizes, aggregation cost becomes memory-bound, consistent with the behavior of coordinate-wise robust aggregation methods.

Byzantine Robustness

The following benchmark evaluates all aggregation methods against five common FL attacks (Label Flip, Gradient Scaling, Sign Flip, Additive Noise, ALIE) with 10–30% Byzantine clients:

Key finding: FedAvg drops to ~10% accuracy under gradient-based attacks. Robust aggregators maintain >90% accuracy across all attack types and Byzantine ratios.

Convergence Under Attack

Under a sustained 30% label-flipping attack, robust methods converge faster and reach higher final accuracy:

Key finding: Robust aggregation methods not only tolerate attacks—they accelerate convergence by filtering noisy gradients.

Run Benchmarks

# Aggregation overhead (Python)
python examples/benchmark_overhead.py

# MNIST poisoning: FedAvg vs Qora-FL under 30% attack
pip install matplotlib scikit-learn
python examples/mnist_poisoning_demo.py

# Criterion benchmarks (Rust)
cargo bench

# Rust examples
cargo run --example quickstart
cargo run --example compare_methods

Background

The aggregation algorithms in Qora-FL (trimmed mean, coordinate-wise median, Krum, Multi-Krum) are standard Byzantine-robust aggregation methods from the federated learning literature [1][2]. The specific design decisions -- deviation-derived reputation, BFP-16 deterministic paths, adaptive trim fraction, and the 0.8 influence cap -- were informed by operational experience during a 181-day autonomous IoT deployment (QRES). The Qora-FL codebase is a clean-room Rust implementation; it does not share code with the original QRES prototype.

Design Philosophy

Qora-FL treats aggregation as a decision process, not a statistical convenience. The system favors explicit constraints, observable signals, and predictable failure modes over opaque optimization.

Roadmap

Weighted robust aggregation (reputation-scaled trimmed mean)
TensorFlow Federated adapter
Formal verification experiments for the deterministic path
Norm-bound verification as a pre-aggregation filter

Requirements

Rust: edition 2021, stable toolchain
Python: >= 3.8 (bindings built with PyO3 abi3)
NumPy: >= 1.21.0
Flower (optional): >= 1.5.0 (pip install qora-fl[flower])

Building from Source

# Rust
cargo build
cargo test
cargo doc --open

# Python bindings
cd bindings/python
pip install maturin
maturin develop --features python

References

Blanchard, P., El Mhamdi, E. M., Guerraoui, R., & Stainer, J. (2017). Machine Learning with Adversaries: Byzantine Tolerant Gradient Descent. NeurIPS.
Yin, D., Chen, Y., Ramchandran, K., & Bartlett, P. (2018). Byzantine-Robust Distributed Learning: Towards Optimal Statistical Rates. ICML.
Krenik, C. (2025). QRES: Resource-Aware Agentic Swarm for Distributed Learning. Zenodo. DOI: 10.5281/zenodo.18474976

License

Licensed under either of Apache License, Version 2.0 or MIT License at your option.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
.github/workflows		.github/workflows
benches		benches
bindings/python		bindings/python
docs		docs
examples		examples
src		src
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
Cargo.toml		Cargo.toml
README.md		README.md
SECURITY.md		SECURITY.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Qora-FL

Problem

How Qora-FL Differs

Architecture

Determinism & Reproducibility

Reputation as a First-Class Primitive

Quick Start (Python + Flower)

Python (Standalone)

Reputation Tracking

Rust

Individual Functions

Aggregation Methods

Benchmarks

Byzantine Robustness

Convergence Under Attack

Run Benchmarks

Background

Design Philosophy

Roadmap

Requirements

Building from Source

References

License

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Qora-FL

Problem

How Qora-FL Differs

Architecture

Determinism & Reproducibility

Reputation as a First-Class Primitive

Quick Start (Python + Flower)

Python (Standalone)

Reputation Tracking

Rust

Individual Functions

Aggregation Methods

Benchmarks

Byzantine Robustness

Convergence Under Attack

Run Benchmarks

Background

Design Philosophy

Roadmap

Requirements

Building from Source

References

License

About

Topics

Resources

Security policy

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages