AI DevTech Engineer @NVIDIA.
PhD in HPC @ Georgia Tech.
-
NVIDIA
- Santa Clara, CA
-
23:09
(UTC -08:00) - huanghua1994.github.io
- in/hua-huang-146a1b104
Highlights
- Pro
Pinned Loading
-
-
NVIDIA/cutlass
NVIDIA/cutlass PublicCUDA Templates and Python DSLs for High-Performance Linear Algebra
-
NVIDIA/TransformerEngine
NVIDIA/TransformerEngine PublicA library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…
-
NVIDIA/cutile-python
NVIDIA/cutile-python PubliccuTile is a programming model for writing parallel kernels for NVIDIA GPUs
-
NVIDIA/TileGym
NVIDIA/TileGym PublicHelpful kernel tutorials and examples for tile-based GPU programming
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.




