Wasm example - quantized embedding gemma #3363

vandana-rajan · 2026-02-06T13:09:06Z

Adds a new WASM example for running quantized Embedding Gemma 300M models in the browser with WebAssembly.

Available Models:

Q8_0 (approx 340MB) and
Q4_0 (approx 297MB)

Both models are from Unsloth AI. Further, output from these models are post-processed by 2 dense layers and normalized. These are provided by Google

Demo interface follows Bert wasm example.

Changes

New example in candle-wasm-examples/quant-embed-gemma/
Modifications in candle-transformers/src/models/quantized_gemma3.rs to accommodate embedding gemma

Usage

wasm-pack build --target web --release
python3 ./serve.py --port 8000
# Opens http://localhost:8000

Note - Adding changes on top of Dr. JesseGlass's modifications to quantized_gemma3.rs

vandana-rajan · 2026-02-06T13:11:03Z

Hello @DrJesseGlass

I have added this new wasm example on top of your changes that is yet to be merged. Once your PR is merged, I think I can just rebase on it. Meanwhile, if you could kindly review this PR, it would be great.

Thanks,
Vandana

DrJesseGlass and others added 7 commits January 23, 2026 15:50

corrected gemma3 activiation fn in quantized

740b3cd

integrated sliding window kv cache

67c7a7e

contig before kv append

6600ef1

basic working code for quant embed gemma demo

a0e4e40

reduce max seq len for wasm load

d470514

check target for max seq len

4aa47c7

remove un-necessary trace comments

b4685c8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wasm example - quantized embedding gemma #3363

Wasm example - quantized embedding gemma #3363

Uh oh!

vandana-rajan commented Feb 6, 2026

Uh oh!

vandana-rajan commented Feb 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Wasm example - quantized embedding gemma #3363

Are you sure you want to change the base?

Wasm example - quantized embedding gemma #3363

Uh oh!

Conversation

vandana-rajan commented Feb 6, 2026

Uh oh!

vandana-rajan commented Feb 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants