Ollama AuxRNG

This community fork of Ollama introduces AuxRNG: an optional auxiliary randomness source that can be specified at runtime for use during token sampling.

With this, Ollama can draw from any byte-producing device node for sampling decisions instead of using the default internal PRNG path. The model weights do not change — only the sampler's randomness stream does.

AuxRNG is meant for experimentation and research work and is especially useful for those interested in supplying "true/quantum randomness" to LLMs. If you don’t explicitly enable it, this Ollama behaves the same as upstream (PRNG).

What is AuxRNG?

AuxRNG is a small integration point that provides an environment hook OLLAMA_AUXRNG_DEV by which the runtime will pull bytes from a specified external device node (e.g. a hardware TRNG like /dev/ttyACM0) and feed those bytes to the sampler.

Does not change model files, prompts, or the REST API schema
Works with any model you run through Ollama
Can be enabled/disabled at runtime (via environment)
Throughput/latency depends on your device and how you read from it

If the environment hook is undefined, or cannot be opened or read, Ollama falls back to default internal PRNG behavior (and logs accordingly).

AuxRNG is device-agnostic; technically it can read from any device node that produces bytes, though its intended use here involves TRNG or QRNG devices.

Process flow

In this iteration, AuxRNG simply feeds fresh bytes "on-demand" to the chooser sampler on a per-token basis. The intention is to provide "live randomness" for selecting each token rather than snapshot determinism from a prefilled buffer.

Runtime logging is somewhat verbose at present; Auxiliary RNG: prints with metadata for each read of your AuxRNG device (~per-token).

Quick Start

1) Identify the node path of your auxiliary device or randomness source

Examples:

/dev/ttyACM0 (typical of TrueRNGpro V2 hardware RNG)
/dev/urandom (Linux kernel-provided cryptographic pseudorandom byte stream)

2) Ensure that filesystem permissions allow the device to be read

On many Linux distros, the user that will serve Ollama needs to be added to the dialout group so that Ollama can read from the serial device:

sudo usermod -aG dialout "$USER"
# then log out/in

3) (Optional but recommended) Install official Ollama to establish GPU support libraries and a convenient side-by-side testing environment.

GPU (CUDA) libraries are included with the official Ollama installation. The custom ollama-auxrng binary will use those. If you don't have it and you want GPU support and/or a side-by-side testing environment, install it.

curl -fsSL https://ollama.com/install.sh | sh

4) Clone AuxRNG, build

git clone https://github.com/orphiceye/ollama-auxrng.git ~/ollama-auxrng
cd ~/ollama-auxrng
go clean -cache
# We will place the ollama-auxrng binary alongside the standard ollama binary - same path use by the official Ollama installation (typically /usr/local/bin/).
# This helps the auxrng find CUDA libs at runtime for GPU support in case you want it.
go build -buildvcs=false -o /usr/local/bin/ollama-auxrng

5) Set the environment and launch

I recommend using an alternative port like 11435 to avoid clashing with another Ollama instance on the default 11434. In this manner they can run side-by-side.

#Environment variables
OLLAMA_HOST=http://0.0.0.0:11435 \
OLLAMA_MODELS=/path/to/your/language/models \
OLLAMA_AUXRNG_DEV=/dev/yourDeviceNode \
/usr/local/bin/ollama-auxrng serve

Name		Name	Last commit message	Last commit date
Latest commit History 4,917 Commits
.github		.github
api		api
app		app
auth		auth
cmd		cmd
convert		convert
discover		discover
docs		docs
envconfig		envconfig
format		format
fs		fs
harmony		harmony
integration		integration
kvcache		kvcache
llama		llama
llm		llm
logutil		logutil
middleware		middleware
ml		ml
model		model
openai		openai
parser		parser
progress		progress
readline		readline
runner		runner
sample		sample
scripts		scripts
server		server
template		template
thinking		thinking
tools		tools
types		types
version		version
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
.golangci.yaml		.golangci.yaml
CMakeLists.txt		CMakeLists.txt
CMakePresets.json		CMakePresets.json
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile.sync		Makefile.sync
README.md		README.md
SECURITY.md		SECURITY.md
go.mod		go.mod
go.sum		go.sum
main.go		main.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Ollama AuxRNG

What is AuxRNG?

Process flow

Quick Start

1) Identify the node path of your auxiliary device or randomness source

2) Ensure that filesystem permissions allow the device to be read

3) (Optional but recommended) Install official Ollama to establish GPU support libraries and a convenient side-by-side testing environment.

4) Clone AuxRNG, build

5) Set the environment and launch

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Ollama AuxRNG

What is AuxRNG?

Process flow

Quick Start

1) Identify the node path of your auxiliary device or randomness source

2) Ensure that filesystem permissions allow the device to be read

3) (Optional but recommended) Install official Ollama to establish GPU support libraries and a convenient side-by-side testing environment.

4) Clone AuxRNG, build

5) Set the environment and launch

About

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages