Add switch to disable RAG on attachments. #3556

nickovs · 2024-06-30T22:15:43Z

Currently if you drop even a single document onto Open WebUI and ask a question about that document, it appears that RAG is always used. It would be helpful to be able to switch this off.

Currently, for each attachment the document text is extracted, the text is chunked, embeddings computed and indexed, an index lookup is performed and then some of the chunks are passed to the LLM. While this is an OK solution for retrieval tasks, it abjectly fails for summarisation tasks.

For LLMs with small context windows summarisation is hard, but many LLMs now support very large context windows; GPT-4o has a 128K context windows and Anthropic Claude 3 support 200K tokens or about 400 pages of text. To make best use of these longer-context models it would be helpful to be able to pass the entire extracted text of all attachments into the LLM. Having a switch to bypass the RAG steps would achieve this.

tjbck · 2024-07-01T00:47:17Z

This behaviour has been changed since 0.3.6

nickovs · 2024-07-01T01:05:58Z

Can you describe the new behaviour? I am using 0.3.7 (commit 7bc88eb) and when I attach a multi-page document only a subset of 5 pages are being passed as context, not in the order of the original document. If I change the "Top K" parameter to 10 then I get 10 pages passed in, again not in the order of the original document. Clearly there is something still preventing more than the "top k" hits from being used and the re-ordering suggests that they are coming from some sort of lookup result.

tjbck closed this as completed Jul 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add switch to disable RAG on attachments. #3556

Add switch to disable RAG on attachments. #3556

nickovs commented Jun 30, 2024

tjbck commented Jul 1, 2024

nickovs commented Jul 1, 2024 •

edited

Loading

Add switch to disable RAG on attachments. #3556

Add switch to disable RAG on attachments. #3556

Comments

nickovs commented Jun 30, 2024

tjbck commented Jul 1, 2024

nickovs commented Jul 1, 2024 • edited Loading

nickovs commented Jul 1, 2024 •

edited

Loading