Support simultaneous ptx and lib building (multiple builds in a single builder) #5

guoqingbao · 2024-06-07T03:38:07Z

The current version does not support multiple builder instances and a build instance cannot be used for another build (as also mentioned in #2 ), I have fixed this by making the builder reusable, e.g., building ptx after lib building.

The tested use case:

fn main() {
    println!("cargo:rerun-if-changed=build.rs");
    println!("cargo:rerun-if-changed=src/attention.cu");
    println!("cargo:rerun-if-changed=src/copy_blocks_kernel.cu");
    println!("cargo:rerun-if-changed=src/reshape_and_cache_kernel.cu");
    println!("cargo:rerun-if-changed=src/rotary_embedding_kernel.cu");
    let builder = bindgen_cuda::Builder::default();
    println!("cargo:info={builder:?}");
    builder.build_lib("libattention.a");

    let bindings = builder.build_ptx().unwrap();
    bindings.write("src/lib.rs").unwrap();

    println!("cargo:rustc-link-lib=attention"); 
}

guoqingbao · 2024-06-07T03:42:20Z

The current version does not support multiple builder instances and a build instance cannot be used for another build (as also mentioned in #2 ), I have fixed this by making the builder reusable, e.g., building ptx after lib building.

The tested use case:

fn main() {
    println!("cargo:rerun-if-changed=build.rs");
    println!("cargo:rerun-if-changed=src/attention.cu");
    println!("cargo:rerun-if-changed=src/copy_blocks_kernel.cu");
    println!("cargo:rerun-if-changed=src/reshape_and_cache_kernel.cu");
    println!("cargo:rerun-if-changed=src/rotary_embedding_kernel.cu");
    let builder = bindgen_cuda::Builder::default();
    println!("cargo:info={builder:?}");
    builder.build_lib("libattention.a");

    let bindings = builder.build_ptx().unwrap();
    bindings.write("src/lib.rs").unwrap();

    println!("cargo:rustc-link-lib=attention"); 
}

This is a different solution by making the builder reusable instead of using multiple thread pools like #3

guoqingbao · 2025-12-18T10:00:07Z

@ivarflakstad Hi Ivar, are you able to review this PR? I believe the PR at huggingface/candle#3221 depends on this.

ivarflakstad · 2025-12-18T11:25:45Z

Cargo.toml

 authors = ["Nicolas Patry <patry.nicolas@protonmail.com>"]
 name = "bindgen_cuda"
-version = "0.1.5"
+version = "0.1.7"


Suggested change

version = "0.1.7"

version = "0.1.5"

We can increment the version in a separate PR when creating a new release.

ivarflakstad · 2025-12-18T11:29:04Z

src/lib.rs

            true
        };
        let ccbin_env = std::env::var("NVCC_CCBIN");
+        let nvcc_binary = if std::path::Path::new("/usr/local/cuda/bin/nvcc").exists() {


Nit: Could create a utility ala get_nvcc_binary and avoid the repetition

This is horrible. Let users setup their own path. This location is hardcoded to specific linux distributions it has nothing to do here.

If you really want allow environment overrides. It's pretty bad too, but at least it won't be hardcoded.

This is horrible. Let users setup their own path. This location is hardcoded to specific linux distributions it has nothing to do here.

If you really want allow environment overrides. It's pretty bad too, but at least it won't be hardcoded.

This is intended as a fallback to the default nvcc path. At the moment, bindgen_cuda can panic even when nvcc is available in the standard installation location (/usr/local/cuda/bin).
Given that, we either provide a reasonable fallback here, or require every user to manually set and hardcode the path in their own environment. A get_nvcc_binary may needed as suggested by @ivarflakstad

Narsil · 2026-01-21T06:06:24Z

src/lib.rs

-    pub fn build_ptx(self) -> Result<Bindings, Error> {
-        let cuda_root = self.cuda_root.expect("Could not find CUDA in standard locations, set it manually using Builder().set_cuda_root(...)");
+    pub fn build_ptx(&self) -> Result<Bindings, Error> {
+        let mut cuda_include_dir = PathBuf::from("/usr/local/cuda/include");


This has nothing to do here.

Narsil · 2026-01-21T06:08:45Z

src/lib.rs

+    /// Force to use a given compute capability
+    pub fn set_compute_cap(&mut self, cap: usize) {
+        self.compute_cap = Some(cap);
+    }
+
+    pub fn get_compute_cap(&self) -> Option<usize> {
+        self.compute_cap
+    }


They already exist, remove this.

Sorry, it shouldn't be here because I'm pushing to my own forked repo (forgot this has being pushed for a merge). But these interface are actual demand, we haven't expose compute_cap (initialized on default) to users and sometime they need a specific cuda compute cap to build certain kernels (instead of relying on env or nvidia-smi), e.g., flash attention v2 kernels needs to compiled with sm_90 or below even other kernels compiled with sm_120+. Related issue guoqingbao/vllm.rs#194

Narsil · 2026-01-21T06:10:34Z

src/lib.rs

    /// println!("cargo:rustc-link-lib=flash");
    /// ```
-    pub fn build_lib<P>(self, out_file: P)
+    pub fn build_lib<P>(&self, out_file: P)


There was kind of points to consume the builder...

Everything already works, instead of making build_lib(&self) you should clone the builder directly (so implementing Clone).

Support multiple builder instance (simultaneous ptx and lib build)

fb7ed75

guoqingbao mentioned this pull request Jun 7, 2024

Allow multiple builders #3

Closed

guoqingbao added 2 commits May 9, 2025 16:20

Fix nvcc not found (if not in PATH)

292c6ce

Bump version to 0.1.7

19e33d0

polarathene mentioned this pull request Jun 15, 2025

Docker: CUDA 12.4->12.9, enable NNCL, support CCv7 EricLBuehler/mistral.rs#1468

Open

ivarflakstad reviewed Dec 18, 2025

View reviewed changes

Expose compute_cap

c568a81

guoqingbao force-pushed the main branch from 94b9417 to c568a81 Compare January 21, 2026 05:35

Narsil reviewed Jan 21, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support simultaneous ptx and lib building (multiple builds in a single builder) #5

Support simultaneous ptx and lib building (multiple builds in a single builder) #5

Uh oh!

guoqingbao commented Jun 7, 2024

Uh oh!

guoqingbao commented Jun 7, 2024

Uh oh!

guoqingbao commented Dec 18, 2025

Uh oh!

ivarflakstad Dec 18, 2025

Uh oh!

ivarflakstad Dec 18, 2025

Uh oh!

Narsil Jan 21, 2026

Uh oh!

guoqingbao Jan 21, 2026

Uh oh!

Narsil Jan 21, 2026

Uh oh!

Narsil Jan 21, 2026

Uh oh!

guoqingbao Jan 21, 2026 •

edited

Loading

Uh oh!

Narsil Jan 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Support simultaneous ptx and lib building (multiple builds in a single builder) #5

Are you sure you want to change the base?

Support simultaneous ptx and lib building (multiple builds in a single builder) #5

Uh oh!

Conversation

guoqingbao commented Jun 7, 2024

Uh oh!

guoqingbao commented Jun 7, 2024

Uh oh!

guoqingbao commented Dec 18, 2025

Uh oh!

ivarflakstad Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

ivarflakstad Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

Narsil Jan 21, 2026

Choose a reason for hiding this comment

Uh oh!

guoqingbao Jan 21, 2026

Choose a reason for hiding this comment

Uh oh!

Narsil Jan 21, 2026

Choose a reason for hiding this comment

Uh oh!

Narsil Jan 21, 2026

Choose a reason for hiding this comment

Uh oh!

guoqingbao Jan 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Narsil Jan 21, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

guoqingbao Jan 21, 2026 •

edited

Loading