Many underlying operators don't support cuda, and thus dfdx-mamba also doesn't. Note: this does not refer to a fused Cuda kernel, just a minimal Cuda support based on underlying operators.
Many underlying operators don't support cuda, and thus dfdx-mamba also doesn't.
Note: this does not refer to a fused Cuda kernel, just a minimal Cuda support based on underlying operators.