newton-schulz

Here are 4 public repositories matching this topic...

lixilinx / psgd_torch

Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation preconditioner and more)

deep-learning pytorch lie-groups optimization-algorithms stochastic-gradient-descent preconditioner low-rank-approximation kronecker-factored-approximation second-order-optimization affine-group hessian-vector-product newton-schulz

Updated Jan 11, 2026
Python

selamajarrahi / muon-learning-hub

Star

The ultimate learning resource for the Muon optimizer - Newton-Schulz orthogonalization, theory, code examples, and production guides

machine-learning ai deep-learning transformers optimizer pytorch neural-networks muon jax llm newton-schulz

Updated Feb 15, 2026
Python

High-performance CUDA implementation of Muon optimizer for LLM training. Features Newton-Schulz polar decomposition, cuBLAS acceleration, and transpose optimization for 8x FLOP savings on transformer FFN layers. Benchmarked on NVIDIA A100 with Llama 3.1 8B architectures (4096×11008 weights).

neural-network cublas mnist cuda-kernels gpu-optimization optimizers muon-optimizer newton-schulz

Updated Dec 21, 2025
Python

emaballarin / optimuon

Star

A performance-optimized Muon optimizer implementation for PyTorch

deep-learning optimizer pytorch muon-optimizer newton-schulz

Updated Mar 14, 2026
Python

Improve this page

Add a description, image, and links to the newton-schulz topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the newton-schulz topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly