Skip to content

Commit 33c867f

Browse files
perf: reduce pre-GEMM overhead in einsum2 prepare path (#95)
* Optimize ND kernel fallback by replacing recursion with iterative carry loop * fmt: apply rustfmt for iterative ND kernel test * perf: reduce pre-gemm overhead in einsum2 prepare path
1 parent 8e6cab2 commit 33c867f

5 files changed

Lines changed: 243 additions & 45 deletions

File tree

strided-einsum2/src/backend.rs

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -26,7 +26,7 @@ pub trait BackendConfig {
2626
/// (e.g., tropical semiring) and pass the backend to [`einsum2_with_backend_into`].
2727
///
2828
/// [`einsum2_with_backend_into`]: crate::einsum2_with_backend_into
29-
pub trait BgemmBackend<T> {
29+
pub trait BgemmBackend<T: crate::ScalarBase> {
3030
/// Execute batched GEMM: `C = alpha * A * B + beta * C` for each batch.
3131
///
3232
/// - `c`: mutable output operand (batch x m x n)

0 commit comments

Comments
 (0)