[GH-ISSUE #12857] run qwen3:32b ctx=32K 之前启动显存32G , 现在启动占用129G (v0.12.7) #55034

Closed
opened 2026-04-29 08:13:18 -05:00 by GiteaMirror · 5 comments
Owner

Originally created by @Lshan523 on GitHub (Oct 30, 2025).
Original GitHub issue: https://github.com/ollama/ollama/issues/12857

What is the issue?

run qwen3:32b ctx=32K 之前启动显存32G , 现在启动占用129G

Relevant log output


OS

No response

GPU

No response

CPU

No response

Ollama version

No response

Originally created by @Lshan523 on GitHub (Oct 30, 2025). Original GitHub issue: https://github.com/ollama/ollama/issues/12857 ### What is the issue? run qwen3:32b ctx=32K 之前启动显存32G , 现在启动占用129G ### Relevant log output ```shell ``` ### OS _No response_ ### GPU _No response_ ### CPU _No response_ ### Ollama version _No response_
GiteaMirror added the bug label 2026-04-29 08:13:18 -05:00
Author
Owner

@Lshan523 commented on GitHub (Oct 30, 2025):

补充说明: 系统 Win10 Pro 显卡 5090-32G * 2

<!-- gh-comment-id:3467657606 --> @Lshan523 commented on GitHub (Oct 30, 2025): 补充说明: 系统 Win10 Pro 显卡 5090-32G * 2
Author
Owner

@rick-github commented on GitHub (Oct 30, 2025):

Server log will help in debugging.

$ ollama -v
ollama version is 0.12.6
$ ollama run qwen3:32b '' ; ollama ps
NAME         ID              SIZE     PROCESSOR    CONTEXT    UNTIL            
qwen3:32b    030ee887880f    35 GB    100% GPU     32768      2 hours from now    
$ ollama -v
ollama version is 0.12.7
$ ollama run qwen3:32b '' ; ollama ps
NAME         ID              SIZE     PROCESSOR    CONTEXT    UNTIL            
qwen3:32b    030ee887880f    35 GB    100% GPU     32768      2 hours from now    
<!-- gh-comment-id:3467859251 --> @rick-github commented on GitHub (Oct 30, 2025): [Server log](https://github.com/ollama/ollama/blob/main/docs/troubleshooting.mdx) will help in debugging. ```console $ ollama -v ollama version is 0.12.6 $ ollama run qwen3:32b '' ; ollama ps NAME ID SIZE PROCESSOR CONTEXT UNTIL qwen3:32b 030ee887880f 35 GB 100% GPU 32768 2 hours from now ``` ```console $ ollama -v ollama version is 0.12.7 $ ollama run qwen3:32b '' ; ollama ps NAME ID SIZE PROCESSOR CONTEXT UNTIL qwen3:32b 030ee887880f 35 GB 100% GPU 32768 2 hours from now ```
Author
Owner

@Lshan523 commented on GitHub (Oct 31, 2025):

Image
<!-- gh-comment-id:3471690184 --> @Lshan523 commented on GitHub (Oct 31, 2025): <img width="918" height="163" alt="Image" src="https://github.com/user-attachments/assets/52cfedb3-7358-4c92-a18d-9f6304fc1ad8" />
Author
Owner

@rick-github commented on GitHub (Oct 31, 2025):

Server log will help in debugging.

<!-- gh-comment-id:3471794210 --> @rick-github commented on GitHub (Oct 31, 2025): [Server log](https://github.com/ollama/ollama/blob/main/docs/troubleshooting.mdx) will help in debugging.
Author
Owner

@Lshan523 commented on GitHub (Oct 31, 2025):

up to 0.12.8 已经解决

<!-- gh-comment-id:3472274081 --> @Lshan523 commented on GitHub (Oct 31, 2025): up to 0.12.8 已经解决
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#55034