Auto-patch torch_python.dll for StaticCudaLauncher overflow fix #337

yondonfu · 2026-01-13T20:25:25Z

Summary

Adds automatic binary patching of torch_python.dll at Python startup to fix OverflowError in torch.compile with reduce-overhead mode on Windows
Extracts shared find_package_path() utility to _utils.py for reuse between cudnn and static_cuda_launcher patches
Patches format specifier from l (signed long) to K (unsigned long long) for CUDA stream parsing

Test plan

Fresh uv sync installs patches.pth to site-packages
First Python startup automatically patches torch_python.dll
Subsequent startups detect "already patched" and skip
Server starts without errors

🤖 Generated with Claude Code

Fixes unsigned long issue on Windows when using torch.compile Signed-off-by: Yondon Fu <yondon.fu@gmail.com>

Auto patch torch static cuda launcher

5007edc

Fixes unsigned long issue on Windows when using torch.compile Signed-off-by: Yondon Fu <yondon.fu@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Auto-patch torch_python.dll for StaticCudaLauncher overflow fix #337

Auto-patch torch_python.dll for StaticCudaLauncher overflow fix #337

yondonfu commented Jan 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Auto-patch torch_python.dll for StaticCudaLauncher overflow fix #337

Are you sure you want to change the base?

Auto-patch torch_python.dll for StaticCudaLauncher overflow fix #337

Conversation

yondonfu commented Jan 13, 2026

Summary

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants