Skip to content
View Mantissagithub's full-sized avatar
🙂
🙂

Highlights

  • Pro

Organizations

@HyperKuvid-Labs

Block or report Mantissagithub

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Mantissagithub/README.md

Pradheep P

CS Undergrad

PortfolioLinkedInX


messing with rl training on a100s rn — benchmarking efficiency, vram hacks, and ways to speed it up. speculative decoding history in the works too.

Check out the organization we're building: HyperKuvid-Labshttps://github.com/HyperKuvid-Labs

have a sweet spot for a100s — matching the vram needs perfectly at low cost for experiments. also renting gpus from primeintellect.ai for bigger rl runs.


Tech Stack:

stuff i'm using: python, pytorch, cuda, trl, unsloth, a100 gpus... benchmarking and tweaking for faster rl loops.


github stats (because why not)

Isometric commit calendar

Pinned Loading

  1. HyperKuvid-Labs/FrugalSOT HyperKuvid-Labs/FrugalSOT Public

    An adaptive model selection system for efficient on-device NLP inference, enhancing speed, privacy, and resource use on edge devices.

    TypeScript 1

  2. HyperKuvid-Labs/AlphaDesign HyperKuvid-Labs/AlphaDesign Public

    Hybrid AI framework combining reinforcement learning and genetic algorithms to optimize Formula 1 front wing aerodynamic designs. Features neural network-guided optimization, CFD analysis, structur…

    Python 1 1

  3. HyperKuvid-Labs/energy_throttling_llms HyperKuvid-Labs/energy_throttling_llms Public

    Energy-aware DDPG RL framework that dynamically optimizes LLM speculative decoding parameters based on real-time hardware metrics (CPU/GPU temps, battery). Maintains 95-98% energy utilization to ma…

    Python

  4. HyperKuvid-Labs/Gideon HyperKuvid-Labs/Gideon Public

    Emotion-aware LLM chat interface with Ollama, FastAPI, and React, featuring real-time image generation and project automation via Alpha<Stack> agent. Accelerates full-stack and blockchain dev by 60…

    Python

  5. HyperKuvid-Labs/alpha-stack HyperKuvid-Labs/alpha-stack Public

    A universal, AI-powered development agent that supports any tech stack—battle-tested across 25+ full dev cycles. It intelligently scaffolds and iterates on complex projects with automated feedback …

    Python 3 1

  6. HyperKuvid-Labs/AlphaD-RL HyperKuvid-Labs/AlphaD-RL Public

    Multi-Teacher Monte Carlo Tree Search (MT-MCTS) for code generation, where 3+ diverse teacher models (DeepSeek-Coder, CodeLlama, Qwen2.5-Coder) propose token paths that form the MCTS search tree. T…

    Python