Skip to content

v0.3.0

Choose a tag to compare

@SystemPanic SystemPanic released this 01 Sep 18:43
· 672 commits to main since this release

Windows build for Cuda 12.4, Pytorch 2.6.0 nightly with Gloo distributed backend, cuDSS, cuDNN and cuBLAS.

AOT version (flashinfer_python-0.3.0-cp39-abi3-win_amd64.whl) has all the kernels precompiled inside the wheel (for production).
Built with MOE, Gemma, OAI-Oss, misc and activation kernels.

JIT version (flashinfer_python-0.3.0-py3-none-any.whl) compiles the required kernels at runtime (for development).