Skip to content
Discussion options

You must be logged in to vote

On Apple Silicon we leverage the GPU (not cpu) via the ONNX Runtime WebGpuExecutionProvider. When you run the model you should see that it is pulling model-generic-gpu - what this means is that the ONNX graph has been optimized for any GPU. For some additional providers like CudaExecutionProvider the ONNX graph has some additional optimizations for CUDA that can be leveraged in the ONNX Runtime.

The WebGPU Execution Provider uses a library called Dawn that converts from the WebGPU shader language to Metal. Therefore, to directly answer the question.... "Foundry Local already supports Metal".

Replies: 3 comments 3 replies

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
2 replies
@samuel100
Comment options

Answer selected by cicorias
@cicorias
Comment options

Comment options

You must be logged in to vote
1 reply
@cicorias
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
4 participants
Converted from issue

This discussion was converted from issue #154 on June 10, 2025 09:39.