runanywhere.ai and cactuscompute.com allows users to download models and use them on-device.
the current LLAMA and MNN providers work well enough, but they need the user to get the gguf file.
whereas, Runanywhere and CactusCompute allow the users to download any of the models that they provide.
thanks. @AAswordman