Skip to content

Mantissagithub/Mantissagithub

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

113 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Pradheep P

CS Undergrad

PortfolioLinkedInX


messing with rl training on a100s rn — benchmarking efficiency, vram hacks, and ways to speed it up. speculative decoding history in the works too.

Check out the organization we're building: HyperKuvid-Labshttps://github.com/HyperKuvid-Labs

have a sweet spot for a100s — matching the vram needs perfectly at low cost for experiments. also renting gpus from primeintellect.ai for bigger rl runs.


Tech Stack:

stuff i'm using: python, pytorch, cuda, trl, unsloth, a100 gpus... benchmarking and tweaking for faster rl loops.


github stats (because why not)

Isometric commit calendar

About

cs undergrad

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors