Releases · SystemPanic/flashinfer-windows

AOT version (flashinfer_python-0.3.0-cp39-abi3-win_amd64.whl) has all the kernels precompiled inside the wheel (for production).
Built with MOE, Gemma, OAI-Oss, misc and activation kernels.

JIT version (flashinfer_python-0.3.0-py3-none-any.whl) compiles the required kernels at runtime (for development).

Assets 4

01 Sep 05:28

SystemPanic

v0.2.14.post1

702e3c1

v0.2.14.post1

Windows build for Cuda 12.4, Pytorch 2.6.0 nightly with Gloo distributed backend, cuDSS, cuDNN and cuBLAS.

AOT version (flashinfer_python-0.2.14.post1-cp39-abi3-win_amd64.whl) has all the kernels precompiled inside the wheel (for production).

JIT version (flashinfer_python-0.2.14.post1-py3-none-any.whl) compiles the required kernels at runtime (for development).

Assets 4

26 Jul 18:26

SystemPanic

v0.2.8

e09d10e

v0.2.8

Windows build for Cuda 12.4, Pytorch 2.6.0 nightly with Gloo distributed backend, cuDSS, cuDNN and cuBLAS.

AOT version (flashinfer_python-0.2.8-cp39-abi3-win_amd64.whl) has all the kernels precompiled inside the wheel (for production).

JIT version (flashinfer_python-0.2.8-py3-none-any.whl) compiles the required kernels at runtime (for development).

Assets 4

26 Jul 02:24

SystemPanic

v0.2.7.post1

00159d3

v0.2.7.post1

Windows build for Cuda 12.4, Pytorch 2.6.0 nightly with Gloo distributed backend, cuDSS, cuDNN and cuBLAS.

AOT version (flashinfer_python-0.2.7.post1-cp39-abi3-win_amd64.whl) has all the kernels precompiled inside the wheel (for production).

JIT version (flashinfer_python-0.2.7.post1-py3-none-any.whl) compiles the required kernels at runtime (for development).

Assets 4

22 Jun 00:25

SystemPanic

v0.2.6.post1

bb16091

v0.2.6.post1

Windows build for Cuda 12.4, Pytorch 2.6.0 nightly with Gloo distributed backend, cuDSS, cuDNN and cuBLAS.

AOT version (flashinfer_python-0.2.6.post1-cp39-abi3-win_amd64.whl) has all the kernels precompiled inside the wheel (for production).

JIT version (flashinfer_python-0.2.6.post1-py3-none-any.whl) compiles the required kernels at runtime (for development).

Assets 4

04 May 00:44

SystemPanic

v0.2.2.post1

98129f1

v0.2.2.post1

Windows build for Cuda 12.4, Pytorch 2.6.0 nightly with Gloo distributed backend, cuDSS, cuDNN and cuBLAS.

AOT version (flashinfer_python-0.2.2.post1-cp312-abi3-win_amd64.wheel) has all the kernels precompiled inside the wheel (for production).

JIT version (flashinfer_python-0.2.2.post1-py3-none-any.whl) compiles the required kernels at runtime (for development).

Assets 4

07 Apr 23:15

SystemPanic

v0.2.5

86876f6

v0.2.5

Windows build for Cuda 12.4, Pytorch 2.6.0 nightly with Gloo distributed backend, cuDSS, cuDNN and cuBLAS.

AOT version (flashinfer_python-0.2.5-cp312-abi3-win_amd64) has all the kernels precompiled inside the wheel (for production).

JIT version (flashinfer_python-0.2.5-py3-none-any.whl) compiles the required kernels at runtime (for development).

Assets 4

Releases: SystemPanic/flashinfer-windows

v0.6.7.post1

Uh oh!

v0.6.3

Uh oh!

v0.4.1

Uh oh!

v0.3.0

Uh oh!

v0.2.14.post1

Uh oh!

v0.2.8

Uh oh!

v0.2.7.post1

Uh oh!

v0.2.6.post1

Uh oh!

v0.2.2.post1

Uh oh!

v0.2.5

Uh oh!