MLOPS using tinygrad

Features

LLM Inference

Easy LLM inference with quantization using tinyops using tinygrad as backened

from tinyops import LLaMa,Gemma
from tinygrad import Device
from pathlib import Path

model_path = Path("Path to model")
tokenizer_path = Path("Path to tokenzier.model file")
shard = 1 # Num of devices
device = tuple(f"{Device.DEFAULT}:{i}" for i in range(shard)) if shard > 1 else Device.DEFAULT

#Tinyops ->  gen = [1,2,3] size = [7B,13B,70B] , quant = [nf4,int8]
llama = LLaMa.build(model_path, tokenizer_path, model_gen="1", model_size="7B", quantize="nf4", device=device)
output = LLaMa.generate(llama,5,"I am batman",0.7,device)
print(output)

#Gemma Support
gemm = Gemma.build(model_path, tokenizer_path, model_gen="gemma", model_size="2B",device=device)
output = Gemma.Benchmark(gemm,10,0.7,device)
#output = Gemma.generate(gemm,5,"I am batman",0.7,device)

Now load gguf models

NOTE It dequantize the model and load it

from tinyops import GGUF_load
from tinygrad import Device
from pathlib import Path

model_path = Path("Path to model")
tokenizer_path = Path("Path to tokenzier.model file")
shard = 1
device = tuple(f"{Device.DEFAULT}:{i}" for i in range(shard)) if shard > 1 else Device.DEFAULT

load_gguf  = GGUF_load.build(model_path,tokenizer_path,device=device)
load_gguf.Benchmark(load_gguf,10,0.1,device)

Model Support

Model	Completion	8bit	4bit	2bit
llama-2	✅	✅	✅	-
llama-3	✅	✅	✅	-
gemma	✅	-	-	-
mixtral	✅	-	-	-

AutoCNN

python3 examples/TinyAutoML_CNN.py -e 1 dataset/train dataset/test

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
DataLoader		DataLoader
examples		examples
extra		extra
tinyops		tinyops
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MLOPS using tinygrad

Features

LLM Inference

Now load gguf models

Model Support

AutoCNN

About

Uh oh!

Releases

Packages

Languages

Guney-olu/tinyOPS

Folders and files

Latest commit

History

Repository files navigation

MLOPS using tinygrad

Features

LLM Inference

Now load gguf models

Model Support

AutoCNN

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages