LiteGPT

A high-performance, educational C++ inference runtime for Large Language Models (LLMs), inspired by projects like llm.c and llama.cpp.

Features

Custom Tensor and Matrix Multiplication implementation
12-layer GPT-2 transformer forward pass
Tokenizer with vocab loading
Sampling strategies: greedy, top-k, top-p
Interactive mode for text generation

Quickstart

Build (Windows)

Ensure Visual Studio Build Tools are installed, then run:

.\compile.bat

Run Interactive Mode

.\llm_engine.exe

Enter prompts like "Once upon a time" and press Enter to generate continuations.

Model Files

Place these in the project root (not included in repo):

gpt2_weights.bin
gpt2_config.json
vocab.json

Development Roadmap

Details

Phase 1: Foundation

Tensor class with strided memory
Matrix multiplication kernels
Basic tensor operations

Phase 2: Architecture

LayerNorm implementation
Linear layers
GELU activation
Multi-head attention mechanism
KV cache support

Phase 3: Runtime

BPE tokenizer improvements
Greedy sampling
Temperature-based sampling
Top-k and top-p sampling

Phase 4: Optimization

SIMD (AVX/NEON) intrinsics
Multithreading with OpenMP
Memory optimizations

Phase 5: Advanced Features

Flash Attention
Quantization (Int8/4-bit)
CUDA/GPU backend
Distributed inference

Contributing

See CONTRIBUTING.md for guidelines.

Inspirations

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
include		include
scripts		scripts
src		src
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SETUP.md		SETUP.md
WEIGHT_LOADING.md		WEIGHT_LOADING.md
build.bat		build.bat
compile.bat		compile.bat
download_weights.py		download_weights.py
get_weights.bat		get_weights.bat
gpt2_config.json		gpt2_config.json
gpt2_quant_config.json		gpt2_quant_config.json
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LiteGPT

Features

Quickstart

Build (Windows)

Run Interactive Mode

Model Files

Development Roadmap

Contributing

Inspirations

License

About

Uh oh!

Releases

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LiteGPT

Features

Quickstart

Build (Windows)

Run Interactive Mode

Model Files

Development Roadmap

Contributing

Inspirations

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Contributors

Uh oh!

Languages