feat(gemma): Add TranslateGemma support and reorganize Gemma module structure #3325

DrJesseGlass · 2026-01-23T14:24:52Z

Adds support for Google's TranslateGemma translation models (55 languages) and consolidates the Gemma model family into a unified module structure.

Reorganization

gemma.rs → gemma/gemma1.rs
Consolidated gemma2.rs, gemma3.rs, quantized_gemma3.rs under gemma/
Added gemma/mod.rs with re-exports for backward compatibility

TranslateGemma Support

Added gemma/translate_gemma.rs with prompt formatting utilities and ISO 639-1 language codes
Added examples/translate-gemma.rs supporting both full precision and quantized inference

Bug Fixes

gemma3.rs: Make KV tensors contiguous before cache append (fixes "slice-set only supports contiguous tensors" error with certain GQA ratios)
quantized_gemma3.rs: Added clear_kv_cache() method for multi-turn inference

Usage Notes

Full precision models auto-download from HuggingFace
Quantized inference requires a local GGUF file via --model-path (no official Google GGUF conversions; community conversions available on HuggingFace)

Known Issue

Investigation shows gemma3.rs uses GELU while quantized_gemma3.rs uses SiLU. This is a gemma3.rs issue, not specific to TranslateGemma.

Key and value states become non-contiguous after transpose but KvCache::append() requires contiguous tensors for slice_set. This worked for some model dimensions but failed for others (e.g., TranslateGemma 4B with different GQA ratios).

…ls; however this is because quantized_gemma3 and gemma3 have different activation functions

DrJesseGlass added 11 commits January 20, 2026 16:06

move gemma,gemma2,gemma3,q_gemma3 to gemma mod with new translate_gemma

2b29147

redesign mod; add translate-gemma example

8e2a242

leverage chat template

f8234f9

unused config

3ddce64

include all langs

26ae63e

quantized version

4fc3905

parse not from_str

b61ff5d

too many args for translate; suppress clippy

b83bcc5

removed double bos; quantized still works well but full precision fai…

e046f42

…ls; however this is because quantized_gemma3 and gemma3 have different activation functions

corrected prompt format

a66d466

DrJesseGlass marked this pull request as ready for review January 23, 2026 20:56

DrJesseGlass and others added 3 commits January 26, 2026 22:04

Remove bos in testing Apply suggestion from @DrJesseGlass

7d43890

Test prompt complete

8bd698e

cargo fmt transgemma

109212a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(gemma): Add TranslateGemma support and reorganize Gemma module structure #3325

feat(gemma): Add TranslateGemma support and reorganize Gemma module structure #3325

Uh oh!

DrJesseGlass commented Jan 23, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

feat(gemma): Add TranslateGemma support and reorganize Gemma module structure #3325

Are you sure you want to change the base?

feat(gemma): Add TranslateGemma support and reorganize Gemma module structure #3325

Uh oh!

Conversation

DrJesseGlass commented Jan 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reorganization

Bug Fixes

Usage Notes

Known Issue

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

DrJesseGlass commented Jan 23, 2026 •

edited

Loading