Distributed Verifiable Model Inference

This is a project that leverages zkSNARKs to enable partial-privacy in a distributed set of (untrusted) nodes that collectively run an ML model. Each node/worker is assigned a shard to run and can later be prompted to generate a zkSNARK, proving that the model (shard) M was run correctly by taking an input x and generating an input y using M. To create said zkSNARKs for ML inference runs, we leverage the ezkl toolkit.

Project Structure

Main files

config.py contains important configuration values. As noted in the file, feel free to adjust the first few variables, but we recommend to keep certain values as configured.
worker.py contains the logic for the worker nodes that run individual shards.
coordinator.py contains the logic for the coordinator that assigns the shards to the workers and orchestrates the entire process.

Installation

The use of a Python venv is recommended.

Install dependencies:

pip install -r requirements.txt

Optionally, if you want to use ezkl's CLI version (v12.0.1), install as follows:

curl https://raw.githubusercontent.com/zkonduit/ezkl/main/install_ezkl_cli.sh | bash -s -- v12.0.1

If you wish to use ezkl's CLI version, keep USE_EZKL_CLI = True in config.py, otherwise set it to False.

Run the Project

First spawn a (1) coordinator, then spawn (N) workers sequentially.

There are four (4) models available in model_training.py:

mlp: A simple 711 parameter MLP-style model
cnn: A Convolutional Neural Network with 26K parameters
mlp2: A large 548K parameter MLP-style model
attention: A large 1.19M parameter attention-style model

When running the system, use one of the values above to set the model that's going to be used.

Warning: mlp2 and attention require multiple hundred GB of free RAM space and take a significant time to run. Use with caution!

Sample Usage:

First spawn a coordinator. The coordinator accepts connections on localhost:8000, and expects a 4-shard setup with the mlp model.

python coordinator.py localhost 8000 4 mlp

All following workers must be spawned in order.

Spawn the first worker. This worker takes on the role FIRST.

python worker.py localhost 8001 localhost 8000 FIRST

Spawn the second worker

python worker.py localhost 8002 localhost 8000 MIDDLE

Spawn the third worker

python worker.py localhost 8003 localhost 8000 MIDDLE

Spawn the fourth worker. This worker takes on the role LAST.

python worker.py localhost 8004 localhost 8000 LAST

If you want to spawn the setup with a single worker, make sure to give the worker the role SOLO.

Run Benchmarks

There are two benchmarking scripts in the /benchmarking directory:

accuracy_benchmark.py: Benchmarking for ezkl's quantization of values
system_benchmark.py: Benchmarking for the entire system and determining ezkl proof artifact sizes and generation times

We recommend executing the respective benchmarking scripts from within the /benchmarking directory.

cd ./benchmarking

Sample Usage:

Example for running the accuracy benchmark for the mlp model and storing the results in the ./tmp-mlp directory. When running this benchmark in parallel, make sure to use a different directory for each run as files get deleted.

python accuracy_benchmark.py mlp ./tmp-mlp

Example for running the system benchmark for the cnn model and defaults to storing the results in the ./tmp-system-benchmark directory.

python system_benchmark.py cnn

Additional Notes

This codebase is part of a research project. The corresponding paper(s) can be found here: TODO

Name		Name	Last commit message	Last commit date
Latest commit History 190 Commits
benchmarking		benchmarking
modules		modules
testing		testing
utils		utils
.editorconfig		.editorconfig
.gitignore		.gitignore
README.md		README.md
config.py		config.py
coordinator.py		coordinator.py
requirements.txt		requirements.txt
worker.py		worker.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Distributed Verifiable Model Inference

Project Structure

Installation

Run the Project

Sample Usage:

Run Benchmarks

Sample Usage:

Additional Notes

About

Uh oh!

Releases

Packages

Languages

chenneking/verifiable-distributed-inference

Folders and files

Latest commit

History

Repository files navigation

Distributed Verifiable Model Inference

Project Structure

Installation

Run the Project

Sample Usage:

Run Benchmarks

Sample Usage:

Additional Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages