[GH-ISSUE #5242] Slow performance on /api/show #3281

Closed
opened 2026-04-12 13:49:44 -05:00 by GiteaMirror · 1 comment
Owner

Originally created by @jmorganca on GitHub (Jun 23, 2024).
Original GitHub issue: https://github.com/ollama/ollama/issues/5242

What is the issue?

Because we now show more model details, /api/show has gotten slower. The part that's slow specifically is reading the arrays (vocab, tensors, etc)

		case ggufTypeArray:
			v, err = readGGUFArray(llm, rs)

OS

No response

GPU

No response

CPU

No response

Ollama version

0.1.45

Originally created by @jmorganca on GitHub (Jun 23, 2024). Original GitHub issue: https://github.com/ollama/ollama/issues/5242 ### What is the issue? Because we now show more model details, `/api/show` has gotten slower. The part that's slow specifically is reading the arrays (vocab, tensors, etc) ``` case ggufTypeArray: v, err = readGGUFArray(llm, rs) ``` ### OS _No response_ ### GPU _No response_ ### CPU _No response_ ### Ollama version 0.1.45
GiteaMirror added the bug label 2026-04-12 13:49:44 -05:00
Author
Owner

@dhiltgen commented on GitHub (Jul 24, 2024):

This was fixed a while ago (0.1.45 took ~300+ ms, but 0.1.48 and later are back down to ~60ms)

<!-- gh-comment-id:2248724892 --> @dhiltgen commented on GitHub (Jul 24, 2024): This was fixed a while ago (0.1.45 took ~300+ ms, but 0.1.48 and later are back down to ~60ms)
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#3281