ramalama-stack is currently only built/tested to be a remote provider, but it's worth exploring the possibility of also adding support to be an inline provider
See https://llama-stack.readthedocs.io/en/latest/providers/index.html for more information about remote versus inline providers