Feature: Allow setting num_batch and make basic parameters like this available in the chat UI #3148

sammcj · 2024-06-14T02:05:26Z

Is your feature request related to a problem? Please describe.

num_batch can greatly impact inference performance at the cost of more VRAM usage.

Depending on the task, it can be beneficial to sacrifice say some context size if it allows you to increase num_batch or vice versa.

e.g:

Fast responses: num_batch: 2048, num_ctx: 8192
Larger context, but a bit slower: num_batch: 512, num_ctx 32768
Middleground: num_batch: 1024, num_ctx: 16384

Describe the solution you'd like

It would be great if you could set num_batch in Open WebUI.

It would also be really useful if basic parameters such as num_ctx, num_batch, temp/top, num_keep etc... were available in the chat interface without having to go into settings -> advanced and tweak them there each time.

Describe alternatives you've considered

Right now I'm having to create several copies of models with the num_ctx/num_batch in their name in order to quickly switch between settings. It works, but it's painful.

The text was updated successfully, but these errors were encountered:

tjbck · 2024-06-14T05:05:57Z

Great suggestion, PR welcome!

sammcj · 2024-06-14T05:45:13Z

PR incoming :)

tjbck · 2024-06-14T05:48:50Z

@sammcj

sammcj · 2024-06-14T06:23:26Z

@tjbck

sammcj mentioned this issue Jun 14, 2024

feat: add num_keep, num_batch #3161

Merged

8 tasks

sammcj closed this as completed Jun 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature: Allow setting num_batch and make basic parameters like this available in the chat UI #3148

Feature: Allow setting num_batch and make basic parameters like this available in the chat UI #3148

sammcj commented Jun 14, 2024

tjbck commented Jun 14, 2024

sammcj commented Jun 14, 2024

tjbck commented Jun 14, 2024

sammcj commented Jun 14, 2024

Feature: Allow setting num_batch and make basic parameters like this available in the chat UI #3148

Feature: Allow setting num_batch and make basic parameters like this available in the chat UI #3148

Comments

sammcj commented Jun 14, 2024

tjbck commented Jun 14, 2024

sammcj commented Jun 14, 2024

tjbck commented Jun 14, 2024

sammcj commented Jun 14, 2024