sd: support for CLIP and VAE on different devices by wbruna · Pull Request #2184 · LostRuins/koboldcpp

wbruna · 2026-05-03T22:51:57Z

Support for placing CLIP or VAE on separate devices (e.g. diffusion on Vulkan0, VAE on Vulkan1). It also enables keeping the diffusion model itself on CPU.

The first two commits adapt the C++ code: the interface receives device numbers instead of booleans, with -1 for "main device" and -2 for "CPU", and the backend includes a global config to choose which model gets which device. The last commit changes the sdclipgpu and sdvaecpu boolean parameters to accept "CPU", "main" or a device number.

Tested on Vulkan with my GPU and iGPU. Seems to work fine with command-line and config settings; however, I wasn't able to fully test the launcher, because there doesn't seem to be a way to select a discrete GPU and an iGPU through it (so I likely got its 1-based indexes wrong).

LostRuins · 2026-05-07T14:54:27Z

merged your other PR so now this conflicts

wbruna · 2026-05-07T15:38:34Z

Fixed.

LostRuins · 2026-05-11T07:01:15Z

So I looked through this PR and it does seem like quite a lot of changes + complexities for something that doesn't seem too useful in my opinion.

Especially with the ability to already use offload_cpu (runtime load/unload for each component) it doesn't really seem too useful compare to simply using the same GPU for all image gen components. Is there something I'm missing.

Also it does modify a bunch of extra upstream code too.

wbruna · 2026-05-11T10:44:28Z

So I looked through this PR and it does seem like quite a lot of changes + complexities for something that doesn't seem too useful in my opinion.

Especially with the ability to already use offload_cpu (runtime load/unload for each component) it doesn't really seem too useful compare to simply using the same GPU for all image gen components. Is there something I'm missing.

offload_cpu doesn't help situations which benefit from a second GPU for the same gen: a weaker card with more memory (like an iGPU) could e..g. run a video VAE which wouldn't fit on the main GPU. Also, its cost isn't trivial: it pins a lot of extra system RAM, and introduces extra latency for all generations.

We (and upstream) do get requests for this functionality from time to time.

Also it does modify a bunch of extra upstream code too.

Kind of? It's mostly the unavoidable device initialization for each component. I expect that code to change upstream when multi-device support gets implemented, but in that case dropping our changes would be simple enough.

Edit: rebased on top of #2204 to avoid conflicts.

wbruna force-pushed the kcpp_sd_multi_device_backend branch 2 times, most recently from 1517922 to 197cc2f Compare May 4, 2026 10:23

wbruna marked this pull request as ready for review May 4, 2026 10:24

wbruna mentioned this pull request May 5, 2026

sd: build each source file separately #2188

Merged

wbruna force-pushed the kcpp_sd_multi_device_backend branch from 197cc2f to ed81427 Compare May 7, 2026 15:37

wbruna force-pushed the kcpp_sd_multi_device_backend branch from ed81427 to bd5330a Compare May 9, 2026 12:08

wbruna added 5 commits May 12, 2026 14:47

sd: reuse source lists between make and cmake

6b965c9

sd: sync to master-596-90e87bc

f46d624

sd: generalize internal interfaces to place generation on CPU

696fde4

sd: backend support for multi-device selection

96ba1a0

sd: frontend support for multi-device selection

49ca0ac

wbruna force-pushed the kcpp_sd_multi_device_backend branch from bd5330a to 49ca0ac Compare May 12, 2026 22:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sd: support for CLIP and VAE on different devices#2184

sd: support for CLIP and VAE on different devices#2184
wbruna wants to merge 5 commits into
LostRuins:concedo_experimentalfrom
wbruna:kcpp_sd_multi_device_backend

wbruna commented May 3, 2026 •

edited

Loading

Uh oh!

LostRuins commented May 7, 2026

Uh oh!

wbruna commented May 7, 2026

Uh oh!

LostRuins commented May 11, 2026

Uh oh!

wbruna commented May 11, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

wbruna commented May 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

LostRuins commented May 7, 2026

Uh oh!

wbruna commented May 7, 2026

Uh oh!

LostRuins commented May 11, 2026

Uh oh!

wbruna commented May 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

wbruna commented May 3, 2026 •

edited

Loading

wbruna commented May 11, 2026 •

edited

Loading