Popular repositories Loading
-
RWKV-LM
RWKV-LM PublicForked from BlinkDL/RWKV-LM
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…
Python
-
Muon-RMS-Norm
Muon-RMS-Norm PublicThis version of Muon converges slightly faster than the Muon from modded-nanogpt in some cases. The change is RMS-Norm after orthogonalization over the first dimension of the weight matrix (last di…
Python 2
-
-
-
Qwen3-VL
Qwen3-VL PublicForked from QwenLM/Qwen3-VL
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Jupyter Notebook
If the problem persists, check the GitHub status page or contact support.


