Skip to content

Releases: SystemPanic/flashinfer-windows

v0.2.3

22 Mar 03:37

Choose a tag to compare

Windows build for Cuda 12.4, Pytorch 2.6.0 nightly with Gloo distributed backend, cuDSS, cuDNN and cuBLAS.

AOT version (flashinfer_python-0.2.3+cu124torch2.6-cp312-cp312-win_amd64.whl) has all the kernels precompiled inside the wheel (for production).

JIT version (flashinfer_python-0.2.3+cu124torch2.6-py3-none-any.whl) compiles the required kernels at runtime (for development).