kernel-builder: improve GPU arch handling by danieldk · Pull Request #579 · huggingface/kernels

danieldk · 2026-05-22T09:56:53Z

Add a bunch of improvements to the GPU arch handling code:

Completely remove arch.nix. This file was originally used to for compiling Torch and for determining supported archs list. However, we repackage Torch binaries and we use our own arch list.
Completely separate CUDA and HIP CMake code generation. This is cleaner and fixes an issue where when compiling with ROCm, the CUDA code would also get called, since it was not guarded.
Improve per-kernel GPU arch reporting.

Add a bunch of improvements to the GPU arch handling code: - Completely remove `arch.nix`. This file was originally used to for compiling Torch and for determining supported archs list. However, we repackage Torch binaries and we use our own arch list. - Completely separate CUDA and HIP CMake code generation. This is cleaner and fixes an issue where when compiling with ROCm, the CUDA code would also get called, since it was not guarded. - Improve per-kernel GPU arch reporting.

danieldk · 2026-05-22T09:58:28Z

+    else()
+        set(_KERNEL_ARCHS "${CUDA_KERNEL_ARCHS}")
+    endif()
+    message(STATUS "CUDA kernel: ${KERNEL_NAME}, capabilities: ${_KERNEL_ARCHS}")


This is the only real change here, the rest is just moving the code out of the conditional block due to the CUDA/HIP split.

Would it make sense to split the CUDA and HIP functions into their own scripts and use them here or too much moving around?

I considered that in the big CMake refactor a few months back, but in the end we need all those functions anyway (since the kernel may be multi-backend), so I decided to put them together and do the variable substitution in cuda.cmake, cpu.cmake, xpu.cmake, etc.

danieldk · 2026-05-22T09:58:49Z

+    else()
+        set(_KERNEL_ARCHS "${ROCM_ARCHS}")
    endif()
+    message(STATUS "ROCm kernel: ${KERNEL_NAME}, archs: ${_KERNEL_ARCHS}")


This is the only real change here, the rest is just moving the code out of the conditional block due to the CUDA/HIP split.

danieldk · 2026-05-22T09:59:07Z

-#
-# Note: this is defined as a macro since it updates `CMAKE_CUDA_FLAGS`.
-#
-macro(override_gpu_arches GPU_ARCHES GPU_LANG GPU_SUPPORTED_ARCHES)


Unused now, remove.

sayakpaul

Awesome, left some questions.

sayakpaul · 2026-05-22T11:15:56Z

@@ -0,0 +1,10 @@
+if(GPU_LANG STREQUAL "HIP")


Nice separation.

sayakpaul · 2026-05-22T11:22:30Z

+    else()
+        set(_KERNEL_ARCHS "${CUDA_KERNEL_ARCHS}")
+    endif()
+    message(STATUS "CUDA kernel: ${KERNEL_NAME}, capabilities: ${_KERNEL_ARCHS}")


Would it make sense to split the CUDA and HIP functions into their own scripts and use them here or too much moving around?

sayakpaul · 2026-05-22T11:23:34Z

            cuda_capabilities.as_deref(),
-            None,
            cuda_flags.as_deref(),
-            None,
            cuda_minver.as_ref(),
        ),
-        Kernel::Rocm {
-            rocm_archs,
-            hip_flags,
-            ..
-        } => (
-            None,
-            rocm_archs.as_deref(),
-            None,
-            hip_flags.as_deref(),
-            None,


Very nice. Way less confusing.

danieldk added 2 commits May 22, 2026 09:52

Update ReLU example archs

e1c93aa

danieldk commented May 22, 2026

View reviewed changes

sayakpaul reviewed May 22, 2026

View reviewed changes

nix-builder: remove out commented-out conditional

4c052f6

danieldk temporarily deployed to testpypi May 22, 2026 12:15 — with GitHub Actions Inactive

sayakpaul approved these changes May 22, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kernel-builder: improve GPU arch handling#579

kernel-builder: improve GPU arch handling#579
danieldk wants to merge 3 commits into
mainfrom
gpu-archs-cleanup

danieldk commented May 22, 2026

Uh oh!

danieldk May 22, 2026

Uh oh!

sayakpaul May 22, 2026

Uh oh!

danieldk May 22, 2026

Uh oh!

danieldk May 22, 2026

Uh oh!

danieldk May 22, 2026

Uh oh!

sayakpaul left a comment

Uh oh!

sayakpaul May 22, 2026

Uh oh!

sayakpaul May 22, 2026

Uh oh!

sayakpaul May 22, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

danieldk commented May 22, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants