Skip to content

Latest commit

 

History

History
5 lines (3 loc) · 249 Bytes

File metadata and controls

5 lines (3 loc) · 249 Bytes

model-kernels

Cuda kernels created with kernels library

attention-int8 : A high-performance INT8 fused attention kernel for diffusion transformers.