🎥 KMP-Llama: SmolVLM Camera App

This repository is a simple demo for how to use llama.cpp server and mobile application with SmolVLM 500M to get real-time object detection

How to setup on Laptop <> Android

Install llama.cpp
Run llama-server -hf ggml-org/SmolVLM-500M-Instruct-GGUF --host 0.0.0.0 --port 8080
Note: you may need to add -ngl 99 to enable GPU (if you are using NVidia/AMD/Intel GPU)
Note (2): You can also try other models here
Run ifconfig | grep "inet"to get the LAN (Wi-Fi) address Example: inet 127.0.0.1 netmask 0xff000000 inet 192.168.0.244 netmask 0xffffff00 broadcast 192.168.0.255
Run KMP App project (eg. Android)
Optionally change the instruction (for example, make it returns JSON)
Click on "Start" and enjoy

How to setup on Local Android

Install Termux from Google Play
pkg update && pkg upgrade
pkg install cmake clang make git wget
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp
mkdir build cd build cmake .. cmake - -build . - -config Release
./bin/llama-server -hf ggml-org/SmolVLM-500M-Instruct-GGUF

Clone and Run

git clone <repository-url>
cd kmp-llama

# Android
./gradlew :composeApp:installDebug

# Desktop
./gradlew :composeApp:run

# iOS
open iosApp/iosApp.xcodeproj

🔌 API Integration

SmolVLM Configuration

{
  "server_url": "http://192.168.0.244:8080",
  "endpoint": "/v1/chat/completions",
  "format": "OpenAI-compatible"
}

Request Format

VisionRequest(
  model = "smolvlm",
  messages = [
    Message(
      role = "user",
      content = [
        Content(type = "text", text = "What do you see?"),
        Content(type = "image_url", 
               imageUrl = ImageUrl("data:image/jpeg;base64,..."))
      ]
    )
  ]
)

📱 Platform Implementation Status

Platform	UI	Camera	API	Status
Android	✅	✅ CameraX	✅ Ktor	Complete
iOS	✅	🔄 AVFoundation	✅ Ktor	UI Ready
Desktop	✅	🔄 Webcam	✅ Ktor	UI Ready

🎯 Roadmap

Immediate (v1.1)

iOS camera implementation with AVFoundation
Desktop webcam integration
Image gallery and history
Offline model support

Future (v2.0)

Multi-model support (GPT-4V, Claude Vision)
Voice commands and audio responses
Real-time object tracking
AR overlay integration
Cloud sync and sharing

Future (v3.0)

mobile local LLM (inspired by https://github.com/a-ghorbani/pocketpal-ai)

🤝 Contributing

Fork the repository
Create feature branch: git checkout -b feature/amazing-feature
Commit changes: git commit -m 'Add amazing feature'
Push to branch: git push origin feature/amazing-feature
Open a Pull Request

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

Please use this bibtex if you want to cite this repository in your publications:

@misc{kmpllama,
   author = {Sholichin, Fauzi},
   title = {KMP-Llama: SmolVLM Camera App},
   year = {2025},
   publisher = {GitHub},
   journal = {GitHub repository},
   howpublished = {\url{https://github.com/fauzisho/kmp-llama}},
  }

Built with ❤️ using Kotlin Multiplatform

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
composeApp		composeApp
gradle		gradle
iosApp		iosApp
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
build.gradle.kts		build.gradle.kts
demo.gif		demo.gif
demo2.gif		demo2.gif
gradle.properties		gradle.properties
gradlew		gradlew
gradlew.bat		gradlew.bat
settings.gradle.kts		settings.gradle.kts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎥 KMP-Llama: SmolVLM Camera App

How to setup on Laptop <> Android

How to setup on Local Android

Clone and Run

🔌 API Integration

SmolVLM Configuration

Request Format

📱 Platform Implementation Status

🎯 Roadmap

Immediate (v1.1)

Future (v2.0)

Future (v3.0)

🤝 Contributing

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🎥 KMP-Llama: SmolVLM Camera App

How to setup on Laptop <> Android

How to setup on Local Android

Clone and Run

🔌 API Integration

SmolVLM Configuration

Request Format

📱 Platform Implementation Status

🎯 Roadmap

Immediate (v1.1)

Future (v2.0)

Future (v3.0)

🤝 Contributing

📄 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages