custom-tokenizer

Here are 3 public repositories matching this topic...

Amjuks / LLM-Experimental

Experimental small LLM project for learning and testing—outputs are often nonsensical.

nlp learning deep-learning experimental transformers text-generation llm custom-tokenizer

Updated Jun 12, 2025
Python

A lightweight, end-to-end implementation of Stable Diffusion built from first principles on a single T4 GPU. Features a custom 192-channel U-Net, VAE, and a CLIP encoder, optimized for consumer hardware and trained on approx. 168k images.

computer-vision deep-learning transformers pytorch vae image-generation unet clip text-to-image diffusion-models single-gpu generative-ai from-scratch-neural-network custom-tokenizer

Updated Jan 24, 2026
TypeScript

ralolooafanxyaiml / frad

Star

A from-scratch PyTorch LLM implementing Sparse Mixture-of-Experts (MoE) with Top-2 gating. Integrates modern Llama-3 components (RMSNorm, SwiGLU, RoPE, GQA) and a custom-coded Byte-Level BPE tokenizer. Pre-trained on a curated corpus of existential & dark philosophical literature.

python pytorch transformer moe from-scratch mixture-of-experts bpe gqa llm rmsnorm swiglu existential-ai llama-3-architecture rope-embeddings custom-tokenizer

Updated Jan 7, 2026
Python

Improve this page

Add a description, image, and links to the custom-tokenizer topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the custom-tokenizer topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly