feat: advanced settings refac & num_ctx added by tjbck · Pull Request #166 · open-webui/open-webui

tjbck · 2023-11-30T03:35:59Z

No description provided.

Added context length

Added num_ctx variable

fixed bad ,

Adding num_ctx to the model settings page

feat: advanced settings refac & num_ctx added

…e send (#21596) * perf: eliminate 2 redundant full chat deserialization on every message send (#162) Problem: Every message send triggered get_chat_by_id_and_user_id which loads the entire Chat row — including the potentially massive JSON blob containing the full conversation history — even when the caller only needed a simple yes/no ownership check or a single column value. Two call sites in the message-send hot path were doing this: 1. main.py ownership verification: loaded the entire chat object including all message history JSON, then checked `if chat is None`. The JSON blob was immediately discarded — only the existence of the row mattered. 2. middleware.py folder check: loaded the entire chat object including all message history JSON, then read only `chat.folder_id` — a plain column on the chat table that requires zero JSON parsing. Fix: - Added `chat_exists_by_id_and_user_id()`: uses SQL EXISTS subquery which returns a boolean without loading any row data. The database can satisfy this from the primary key index alone. - Added `get_chat_folder_id()`: queries only the `folder_id` column via `db.query(Chat.folder_id)`, which tells SQLAlchemy to SELECT only that single column instead of the entire row. Both new methods preserve the same error handling semantics (return False/None on exception) and user_id filtering (ownership check) as the original get_chat_by_id_and_user_id. Impact: - Best case (typical): eliminates deserializing 2 full chat JSON blobs per message send. For long conversations (hundreds of messages with tool calls, images, file attachments), this blob can be multiple megabytes. - Worst case: no regression — the new queries are strictly cheaper than the old ones (less data transferred, less Python object construction, no Pydantic model_validate overhead). - The 3 remaining full chat loads in process_chat_payload (load_messages_from_db, add_file_context, chat_image_generation_handler) are left untouched as they genuinely need the full history and require separate analysis. * Address maintainer feedback: rename method and inline call (#166) - Rename chat_exists_by_id_and_user_id -> is_chat_owner - Remove intermediate chat_owned variable; call is_chat_owner directly in if condition

AnthonyCucci and others added 7 commits November 29, 2023 22:02

Update SettingsModal.svelte

ea25763

Added context length

Update +page.svelte

cd22f73

Added num_ctx variable

Update +page.svelte

d180943

Update SettingsModal.svelte

467384e

fixed bad ,

Update SettingsModal.svelte

ec07dff

fix: step value update

8276385

Merge pull request #165 from AnthonyCucci/main

ffe18bc

Adding num_ctx to the model settings page

tjbck changed the title ~~feat: num_ctx added to settings~~ feat: advanced settings refac & num_ctx added Nov 30, 2023

feat: advanced settings refac

a471298

tjbck merged commit 89644d4 into main Nov 30, 2023

explorigin pushed a commit to explorigin/open-webui that referenced this pull request Feb 2, 2024

Merge pull request open-webui#166 from ollama-webui/dev

201eaf1

feat: advanced settings refac & num_ctx added

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Comments

feat: advanced settings refac & num_ctx added#166

feat: advanced settings refac & num_ctx added#166
tjbck merged 8 commits intomainfrom
dev

tjbck commented Nov 30, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Comments

Conversation

tjbck commented Nov 30, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants