CUDA: add config tests\n\nAdds kernel-based tests for device-side rea… #736

CodersAcademy006 · 2026-01-20T10:59:35Z

CUDA: add config tests

This PR adds kernel-based tests for device-side read-only access to CUDA config values in Numba-CUDA:

cuda.config.WARP_SIZE
cuda.config.MAX_THREADS_PER_BLOCK
Use of config values in kernel control flow

Key features:

Tests are skipped under cudasim due to backend-specific semantics
NumPy is used as the reference oracle
Scope is intentionally limited to safe, well-defined CUDA config semantics (no mutation or non-CUDA targets)

This continues the systematic porting of CPU-side tests to CUDA, directly contributing to issue #515.

…d-only access to CUDA config values: WARP_SIZE, MAX_THREADS_PER_BLOCK, and use in kernel control flow. Skips under cudasim. NumPy is used as the reference. Scope is intentionally limited to safe, well-defined CUDA config semantics.

copy-pr-bot · 2026-01-20T10:59:41Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

greptile-apps · 2026-01-20T11:04:23Z

Greptile Summary

This PR adds tests for device-side CUDA config constants (cuda.config.WARP_SIZE and cuda.config.MAX_THREADS_PER_BLOCK) that do not exist in the codebase. The implementation is missing.

Critical issues:

cuda.config.WARP_SIZE and cuda.config.MAX_THREADS_PER_BLOCK are not defined anywhere
No resolve_config method in CudaModuleTemplate (cudadecl.py:461)
The only existing references are CU_DEVICE_ATTRIBUTE_WARP_SIZE in enums.py (driver constant) and driver.get_device().MAX_THREADS_PER_BLOCK in transpose.py (host-side API)
All three tests will fail immediately with AttributeError when trying to access these non-existent attributes

What needs to happen:
The implementation PR must be merged first, or this PR needs to include the implementation alongside the tests.

Confidence Score: 0/5

This PR cannot be merged - it tests functionality that doesn't exist and will immediately fail
The PR adds tests for cuda.config.WARP_SIZE and cuda.config.MAX_THREADS_PER_BLOCK, but these constants are not implemented anywhere in the codebase. Comprehensive search confirms no device-side or host-side implementation exists. Tests will fail with AttributeError.
The test file requires a corresponding implementation PR to be merged first, or this PR must include the implementation

Important Files Changed

Filename	Overview
numba_cuda/numba/cuda/tests/test_config.py	Tests reference non-existent `cuda.config.WARP_SIZE` and `cuda.config.MAX_THREADS_PER_BLOCK` - these device-side constants are not implemented anywhere in the codebase

greptile-apps

_{1 file reviewed, 3 comments}

_{Edit Code Review Agent Settings | Greptile}

greptile-apps · 2026-01-20T11:04:21Z

numba_cuda/numba/cuda/tests/test_config.py

+def warp_size_kernel(out):
+    i = cuda.grid(1)
+    if i < out.size:
+        out[i] = cuda.config.WARP_SIZE


logic: cuda.config.WARP_SIZE is not implemented in the codebase. Searched the entire repository and found no device-side constant definition for this attribute. The cuda module's CudaModuleTemplate in cudadecl.py has no resolve_config method, and there's no mechanism to expose config values inside kernels.

greptile-apps · 2026-01-20T11:04:22Z

numba_cuda/numba/cuda/tests/test_config.py

+def max_threads_kernel(out):
+    i = cuda.grid(1)
+    if i < out.size:
+        out[i] = cuda.config.MAX_THREADS_PER_BLOCK


logic: cuda.config.MAX_THREADS_PER_BLOCK is not implemented. While driver.get_device().MAX_THREADS_PER_BLOCK exists for host-side access (see kernels/transpose.py:35), there's no device-side constant accessible within kernels.

greptile-apps · 2026-01-20T11:04:22Z

numba_cuda/numba/cuda/tests/test_config.py

+            (d_inp, d_out),
+            inp.size,
+        )
+        expected = inp * 2 if cuda.config.WARP_SIZE >= 32 else inp


logic: This line attempts host-side access to cuda.config.WARP_SIZE, but this also doesn't exist. The test references a non-existent API on both device and host sides.

CodersAcademy006 · 2026-01-21T06:42:45Z

@atmnp Could you please advise on the preferred approach for exposing these constants to CUDA kernels, or provide guidance on the implementation plan? Once the config attributes are available, I can update and validate these tests accordingly.
Thank You!

greptile-apps bot reviewed Jan 20, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CUDA: add config tests\n\nAdds kernel-based tests for device-side rea… #736

CUDA: add config tests\n\nAdds kernel-based tests for device-side rea… #736

Uh oh!

CodersAcademy006 commented Jan 20, 2026

Uh oh!

copy-pr-bot bot commented Jan 20, 2026

Uh oh!

greptile-apps bot commented Jan 20, 2026

Uh oh!

greptile-apps bot left a comment

Uh oh!

greptile-apps bot Jan 20, 2026

Uh oh!

greptile-apps bot Jan 20, 2026

Uh oh!

greptile-apps bot Jan 20, 2026

Uh oh!

CodersAcademy006 commented Jan 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

CUDA: add config tests\n\nAdds kernel-based tests for device-side rea… #736

Are you sure you want to change the base?

CUDA: add config tests\n\nAdds kernel-based tests for device-side rea… #736

Uh oh!

Conversation

CodersAcademy006 commented Jan 20, 2026