A collection of algorithms and data structures
-
Updated
Dec 30, 2024 - Java
A collection of algorithms and data structures
BLAS-like Library Instantiation Software Framework
High-efficiency floating-point neural network inference operators for mobile, server, and Web
Acceleration package for neural networks on multi-core CPUs
Tuned OpenCL BLAS
Fast Clojure Matrix Library
Introduction to PyTorch, covering tensor initialization, operations, indexing, and reshaping.
BLISlab: A Sandbox for Optimizing GEMM
Multi-Threaded FP32 Matrix Multiplication on x86 CPUs
The HPC toolbox: fused matrix multiplication, convolution, data-parallel strided tensor primitives, OpenMP facilities, SIMD, JIT Assembler, CPU detection, state-of-the-art vectorized BLAS for floats and integers
A library and extension that provides objects for scientific computing in PHP.
[DEPRECATED] Moved to ROCm/rocm-libraries repo
💥 Fast matrix-multiplication as a self-contained Python library – no system dependencies!
Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm
CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning
Accelerated General (FP32) Matrix Multiplication from scratch in CUDA
Sparse matrix formats for linear algebra supporting scientific and machine learning applications
DBCSR: Distributed Block Compressed Sparse Row matrix library
Add a description, image, and links to the matrix-multiplication topic page so that developers can more easily learn about it.
To associate your repository with the matrix-multiplication topic, visit your repo's landing page and select "manage topics."