feat: allow setting `num_gpu` parameter #2877

mherrmann3 · 2024-06-06T09:43:08Z

Is your feature request related to a problem? Please describe.
To avoid creating a new modelfile only for adjusting/finetuning the number of layers offloaded to the GPU, make this setting ( num_gpu) user-configurable, which ollama considers as one of the most common parameters.

Describe the solution you'd like
Implement and add 'num_gpu (Ollama)' in the 'Advanced Params' section of a model in Workspace > Models.
(I would not add it to the 'Advanced Parameters' section of Settings > General, as the number and size of layers is model- and-quant-specific.

Describe alternatives you've considered
Well, creating a new ollama model(file) with an adjusted num_gpu, but this is cumbersome if one wants to adjust/finetune num_gpu (or modify it quickly if the GPU runs other things/models).

Additional context
num_gpu is not specified¹ in the official ollama docs as valid PARAMETER, but is supported by the API.

like use_mmap, use_mlock, and num_thread, which are already configurable in open_webui. ↩

The text was updated successfully, but these errors were encountered:

Qualzz · 2024-07-04T09:49:18Z

bumping this

derpyhue · 2024-07-04T11:25:58Z

derpyhue/openwebui_num_gpu@fff91f7
By editing these files it will enable the use of changing num_gpu layers.
Might need a bit of polishing.
This is my first time committing something to GitHub but i hope it helps!

JKratto · 2024-07-10T08:36:15Z

1
Thank you for this. I am looking forward to the merge. Ollama changed the memory allocation strategy (or so I think), and suddenly, the whole model "does not fit VRAM" (only 30/33 layers are offloaded to GPU). But in reality, it can fit 33/33 while still having about 25 % free VRAM. The performance penalty for the 30/33 scenario is about - 70 % loss of throughput (Mixtral 8x7b). Adjusting this setting for my machine would go a long way, as I would not have to create my own model to overcome this issue. It's not a big problem; it just seems cleaner. :)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: allow setting `num_gpu` parameter #2877

feat: allow setting `num_gpu` parameter #2877

mherrmann3 commented Jun 6, 2024

Qualzz commented Jul 4, 2024

derpyhue commented Jul 4, 2024

JKratto commented Jul 10, 2024

feat: allow setting num_gpu parameter #2877

feat: allow setting num_gpu parameter #2877

Comments

mherrmann3 commented Jun 6, 2024

Footnotes

Qualzz commented Jul 4, 2024

derpyhue commented Jul 4, 2024

JKratto commented Jul 10, 2024

feat: allow setting `num_gpu` parameter #2877

feat: allow setting `num_gpu` parameter #2877