gpuserver: named semaphore to fix 100% idle CPU from sched_yield() by antonvnv · Pull Request #1101 · soedinglab/MMseqs2

antonvnv · 2026-04-21T23:39:07Z

The previous busy-wait loop used sched_yield(), which yields the thread's timeslice but immediately reschedules it if no other thread is waiting on the same core. On a machine with enough cores (typical for GPU servers), the OS has no reason to deschedule the thread, so it spins at 100% CPU while idle.

Replace with POSIX named semaphore so gpuserver blocks in sem_wait() and uses ~0% CPU when idle. sem_wait() uses an in-kernel futex, so the thread sleeps without context switches until the client posts.

Add GPUSharedMemorySem class to GpuUtil.h that owns the sem_t* internally; call sites are ifdef-free.

USE_GPU_SEM is automatically enabled when ENABLE_CUDA=1 in cmake. Disable with -DUSE_GPU_SEM=OFF.

The previous busy-wait loop used sched_yield(), which yields the thread's timeslice but immediately reschedules it if no other thread is waiting on the same core. On a machine with enough cores (typical for GPU servers), the OS has no reason to deschedule the thread, so it spins at 100% CPU while idle. Replace with POSIX named semaphore so gpuserver blocks in sem_wait() and uses ~0% CPU when idle. sem_wait() uses an in-kernel futex, so the thread sleeps without context switches until the client posts. Add GPUSharedMemorySem class to GpuUtil.h that owns the sem_t* internally; call sites are ifdef-free. USE_GPU_SEM is automatically enabled when ENABLE_CUDA=1 in cmake. Disable with -DUSE_GPU_SEM=OFF.

milot-mirdita · 2026-04-22T00:55:37Z

Is there any downside to making this default enabled on all CUDA builds?

antonvnv · 2026-04-23T01:57:12Z

Is there any downside to making this default enabled on all CUDA builds?

It should be enabled by default in this PR for all CUDA builds... To my knowledge there should no downsides other than the fact that so far I had it only under limited testing [and it seems to be working fine so far]... I'll be testing it more in the coming days.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gpuserver: named semaphore to fix 100% idle CPU from sched_yield()#1101

gpuserver: named semaphore to fix 100% idle CPU from sched_yield()#1101
antonvnv wants to merge 1 commit intosoedinglab:masterfrom
antonvnv:gpusem

antonvnv commented Apr 21, 2026

Uh oh!

milot-mirdita commented Apr 22, 2026

Uh oh!

antonvnv commented Apr 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

antonvnv commented Apr 21, 2026

Uh oh!

milot-mirdita commented Apr 22, 2026

Uh oh!

antonvnv commented Apr 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants