Release OfflineLLM v2.0.0 · jegly/OfflineLLM

OfflineLLM v2.0.0

A fully offline, private AI chat app for Android. All inference runs on-device via llama.cpp. Zero network permissions.

Advanced Sampling Parameters — Full control over Temperature, Top-P, Top-K, Min-P, and Repeat Penalty with slider UI and plain-English explanations
Context Size Slider — Adjustable from 512 to 16384 tokens
Text-to-Speech — Read AI responses aloud (speaker icon on assistant messages)
Chat Search — Search messages within conversations
Delete Individual Messages — Long-press any message to delete
Auto-Title Conversations — Chat titles set automatically from your first message
Theme Selector — System Default / Light / Dark / AMOLED Black
Accent Colour Picker — 9 colour options
Thinking Tag Stripping — Hides blocks from reasoning models
Empty Response Fix — No more blank message bubbles
Help Screen — Built-in guide for downloading models from HuggingFace
About Screen — Version info, license, links

OfflineLLM-v2.0.0-release.apk — Install directly on any Android 14+ device
gemma-3-270m-it-Q4_K_M.gguf — Bundled model, fast on 4GB RAM devices (~300MB)