Bump torch to 2.9.1 with auto-patched cuDNN 9.17 #321

yondonfu · 2026-01-08T20:34:38Z

Summary

Upgrade torch 2.8.0 → 2.9.1, torchvision 0.23.0 → 0.24.1
Upgrade torchao 0.13.0 → 0.15.0, triton 3.4 → 3.5.1
Bump Python 3.10 → 3.12 (flash-attn 2.8.3 wheels for torch 2.9 only support cp312)
Add nvidia-cudnn-cu12 override (9.15+) to fix Conv3D bf16 performance regression
Add automatic cuDNN patching for Windows via .pth file

Why

PyTorch 2.9.1 has a Conv3D bf16 performance regression with cuDNN < 9.15. On Windows, PyTorch bundles cuDNN in torch/lib and loads it by full path, ignoring pip packages. The .pth file automatically copies the newer cuDNN DLLs at Python startup.

The flash-attn 2.8.3 prebuilt wheels for torch 2.9 are only available for Python 3.12 (cp312), requiring the Python version bump.

Changes

.python-version: 3.10.12 → 3.12.8
pyproject.toml:
- requires-python → >=3.12
- Bump deps + add cuDNN override + force-include for .pth
- Update wheel URLs for cp312 (flash-attn, sageattention)
- ruff target-version → py312
.github/workflows/lint.yml: Python 3.10 → 3.12
src/scope/core/patches/cudnn.py: Use importlib.util.find_spec() to find package paths WITHOUT importing torch (prevents DLL locking)
patches.pth: Installed to site-packages, runs at Python startup

Test plan

🤖 Generated with Claude Code

Signed-off-by: Yondon Fu <yondon.fu@gmail.com>

yondonfu · 2026-01-14T22:25:11Z

Linux benchmarking

LongLive test script comparable throughput + latency vs. main

Windows benchmarking

LongLive, StreamDiffusionV2, MemFlow, Krea comparable throughput + latency vs main

Bump to torch 2.9.1 + patch cudnn to fix Conv3D perf regression

1ecc7c1

Signed-off-by: Yondon Fu <yondon.fu@gmail.com>

yondonfu marked this pull request as draft January 8, 2026 20:35

yondonfu added 2 commits January 13, 2026 14:27

Find cudnn package without importing torch to prevent file locking

3c3d58e

Signed-off-by: Yondon Fu <yondon.fu@gmail.com>

Bump to Python 3.12.8

5ca8d2b

Signed-off-by: Yondon Fu <yondon.fu@gmail.com>

yondonfu force-pushed the scope-torch-bump branch from 82e9d74 to 5ca8d2b Compare January 13, 2026 20:23

yondonfu marked this pull request as ready for review January 14, 2026 22:25

yondonfu merged commit b35751a into main Jan 14, 2026
5 checks passed

yondonfu deleted the scope-torch-bump branch January 14, 2026 22:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Bump torch to 2.9.1 with auto-patched cuDNN 9.17 #321

Bump torch to 2.9.1 with auto-patched cuDNN 9.17 #321

yondonfu commented Jan 8, 2026 •

edited

Loading

Uh oh!

yondonfu commented Jan 14, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Bump torch to 2.9.1 with auto-patched cuDNN 9.17 #321

Bump torch to 2.9.1 with auto-patched cuDNN 9.17 #321

Conversation

yondonfu commented Jan 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Why

Changes

Test plan

Uh oh!

yondonfu commented Jan 14, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

yondonfu commented Jan 8, 2026 •

edited

Loading