Loving the simplified cmake files! Just a quick Q on switching CPU & GPU backends #10690

bertsons · 2024-12-06T13:32:03Z

bertsons
Dec 6, 2024

Are my assumptions correct that I can now compile for any number of GPU and CPU backends and use whichever combination I want together (given the correct hardware of course)? Is it as simple as passing in an option to llama-server or llama-cli to instruct which CPU and GPU backends I want to use? What are these options?

Answered by slaren

Dec 6, 2024

You can build any number of backends, although there may be some incompatibilities with backends that require using a different C/C++ compiler. In that case, it would still be possible to build these backends separately using GGML_BACKEND_DL and then bundle them together. You can use --device to select the devices you want to use, and --list-devices to view a list of available devices.

View full answer

slaren · 2024-12-06T15:05:59Z

slaren
Dec 6, 2024
Maintainer

You can build any number of backends, although there may be some incompatibilities with backends that require using a different C/C++ compiler. In that case, it would still be possible to build these backends separately using GGML_BACKEND_DL and then bundle them together. You can use --device to select the devices you want to use, and --list-devices to view a list of available devices.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Loving the simplified cmake files! Just a quick Q on switching CPU & GPU backends #10690

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Loving the simplified cmake files! Just a quick Q on switching CPU & GPU backends #10690

Uh oh!

bertsons Dec 6, 2024

Replies: 1 comment

Uh oh!

slaren Dec 6, 2024 Maintainer

bertsons
Dec 6, 2024

slaren
Dec 6, 2024
Maintainer