mirror of
https://github.com/open-webui/open-webui.git
synced 2026-05-05 18:38:17 -05:00
[GH-ISSUE #12988] issue: Chat title generation does not honor mmap setting #32308
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @m-schenker on GitHub (Apr 17, 2025).
Original GitHub issue: https://github.com/open-webui/open-webui/issues/12988
Check Existing Issues
Installation Method
Docker
Open WebUI Version
v0.6.5
Ollama Version (if applicable)
No response
Operating System
Fedora Linux 42
Browser (if applicable)
No response
Confirmation
README.md.Expected Behavior
When enabling the mmap setting, with cpu inferencing, in a chat to reduce the memory footprint the title generation and other tasks open-webui might execute using the model should honor this setting and start the title generation and possible other tasks with it as well.
Actual Behavior
Open-webui generates the answer to a promt using the mmap setting enabling a low memory footprint, which is especially useful when using cpu inferencing. After the answer is provided it generates the chat title using the same model only to load the entire model into main memory. If it succeeds it generated a high memory footprint doing so if it fails it generates a high memory footprint only to fail entirely if the main memory is not sufficient to keep the entire model (which would often be the case when using mmap).
Steps to Reproduce
Logs & Screenshots
Can't include open-webui logs for privacy reasons.
Ollama container logs:
Additional Information
No response