[GH-ISSUE #14958] Tool calls silently drop with large system prompts (~1600+ tokens) #56131

New Issue

GiteaMirror · 2026-04-29T10:18:14-05:00

GiteaMirror commented

2026-04-29 10:18:14 -05:00

Originally created by @cicoyle on GitHub (Mar 19, 2026).
Original GitHub issue: https://github.com/ollama/ollama/issues/14958

What is the issue?

Im using

$ ollama --version
ollama version is 0.17.6

OS: macOS (Apple M2 Max, 32GB)

Models tested: mistral-small3.1:24b, qwq:32b, qwen2.5:32b

When using the OpenAI-compatible /v1/chat/completions endpoint with tool_choice: "required" and a large system prompt (~1600+ tokens), Ollama generates completion tokens, but returns empty content with no tool_calls in the response. The same request with a shorter system prompt works correctly.

Repro: Works (~570 prompt tokens, short system prompt):

  curl http://localhost:11434/v1/chat/completions -H "Content-Type: application/json" -d '{
      "model": "mistral-small3.1:24b",
      "messages": [{"role":"system","content":"You are a helpful assistant."},{"role":"user","content":"What is the weather?"}],
      "tools": [{"type":"function","function":{"name":"get_weather","description":"Get current weather","parameters":{"type":"object","properties":{"location":{"type":"string"}}}}}],
      "tool_choice": "required"
  }'

Note, I did generalize my prompt to weather...

Fails: Same endpoint and tool definitions, but with a system prompt expanded to ~1600 tokens containing detailed multi-step agent instructions. The response returns "content":"", "finish_reason":"stop" with NO tool_calls field, despite "completion_tokens":31 proving the model generated output.

What I'm seeing:

The model generates 31 completion tokens, but they are not captured as tool_calls
This happens across ALL tested models (mistral, qwen, qwq), meaning it's not model-specific
Works perfectly with short prompts + same tools
num_ctx=4096 (default), prompt is 1632 tokens leaving ~2464 tokens for generation, this should be sufficient

I expect:
Tool calls should be returned in the response regardless of system prompt length, as long as the prompt fits within the context window.

Relevant log output

sanitized logs:

ollama logs

  time=2026-03-19T09:43:38.139-05:00 level=DEBUG source=server.go:1536 msg="completion request" images=0 prompt=7064 format=""                                                                                                                     
  time=2026-03-19T09:43:38.157-05:00 level=DEBUG source=cache.go:151 msg="loading cache slot" id=0 cache=0 prompt=1632 used=0 remaining=1632                                                                                                       
  [GIN] 2026/03/19 - 09:43:47 | 200 | 18.676361417s | ::1 | POST "/v1/chat/completions" 


raw resp captured via a proxy:            
                                                                                                                                                                                             
  {"id":"chatcmpl-638","object":"chat.completion","created":1773931427,"model":"mistral-small3.1:24b","system_fingerprint":"fp_ollama","choices":[{"index":0,"message":{"role":"assistant","content":""},"finish_reason":"stop"}],"usage":{"prompt_
  tokens":1632,"completion_tokens":31,"total_tokens":1663}} 


Note: prompt_tokens:1632, completion_tokens:31 (tokens generated but not returned as tool_calls), content:"", no tool_calls field.

OS

macOS

GPU

Apple

CPU

Apple

Ollama version

0.17.6

Originally created by @cicoyle on GitHub (Mar 19, 2026). Original GitHub issue: https://github.com/ollama/ollama/issues/14958 ### What is the issue? Im using ``` $ ollama --version ollama version is 0.17.6 OS: macOS (Apple M2 Max, 32GB) Models tested: mistral-small3.1:24b, qwq:32b, qwen2.5:32b ``` When using the OpenAI-compatible `/v1/chat/completions` endpoint with `tool_choice: "required"` and a large system prompt (~1600+ tokens), Ollama generates completion tokens, but returns empty content with no tool_calls in the response. The same request with a shorter system prompt works correctly. Repro: Works (~570 prompt tokens, short system prompt): ``` curl http://localhost:11434/v1/chat/completions -H "Content-Type: application/json" -d '{ "model": "mistral-small3.1:24b", "messages": [{"role":"system","content":"You are a helpful assistant."},{"role":"user","content":"What is the weather?"}], "tools": [{"type":"function","function":{"name":"get_weather","description":"Get current weather","parameters":{"type":"object","properties":{"location":{"type":"string"}}}}}], "tool_choice": "required" }' ``` Note, I did generalize my prompt to weather... Fails: Same endpoint and tool definitions, but with a system prompt expanded to ~1600 tokens containing detailed multi-step agent instructions. The response returns "content":"", "finish_reason":"stop" with NO tool_calls field, despite "completion_tokens":31 proving the model generated output. What I'm seeing: - The model generates 31 completion tokens, but they are not captured as tool_calls - This happens across ALL tested models (mistral, qwen, qwq), meaning it's not model-specific - Works perfectly with short prompts + same tools - `num_ctx=4096` (default), prompt is 1632 tokens leaving ~2464 tokens for generation, this should be sufficient I expect: Tool calls should be returned in the response regardless of system prompt length, as long as the prompt fits within the context window. ### Relevant log output ```shell sanitized logs: ollama logs time=2026-03-19T09:43:38.139-05:00 level=DEBUG source=server.go:1536 msg="completion request" images=0 prompt=7064 format="" time=2026-03-19T09:43:38.157-05:00 level=DEBUG source=cache.go:151 msg="loading cache slot" id=0 cache=0 prompt=1632 used=0 remaining=1632 [GIN] 2026/03/19 - 09:43:47 | 200 | 18.676361417s | ::1 | POST "/v1/chat/completions" raw resp captured via a proxy: {"id":"chatcmpl-638","object":"chat.completion","created":1773931427,"model":"mistral-small3.1:24b","system_fingerprint":"fp_ollama","choices":[{"index":0,"message":{"role":"assistant","content":""},"finish_reason":"stop"}],"usage":{"prompt_ tokens":1632,"completion_tokens":31,"total_tokens":1663}} Note: prompt_tokens:1632, completion_tokens:31 (tokens generated but not returned as tool_calls), content:"", no tool_calls field. ``` ### OS macOS ### GPU Apple ### CPU Apple ### Ollama version 0.17.6

GiteaMirror added the bug label 2026-04-29 10:18:14 -05:00

GiteaMirror closed this issue

2026-04-29 10:18:15 -05:00

GiteaMirror commented

2026-04-29 10:18:15 -05:00

@rick-github commented on GitHub (Mar 19, 2026):

What's the output of ollama ps after running the large prompt?

$ curl -s http://localhost:11434/v1/chat/completions -H "Content-Type: application/json" -d '{
      "model": "mistral-small3.1:24b",
      "messages": [
        {"role":"system","content":"'"$(yes You are a helpful assistant. | head -264 | tr \\n ' ')"'"},
        {"role":"user","content":"What is the weather in Zurich?"}
      ],
      "tools": [
        {
          "type": "function",
          "function": {
            "name": "get_weather",
            "description": "Get current weather",
            "parameters": {
              "type": "object",
              "properties": {
                "location": {
                  "type": "string"
                }
              }
            }
          }
        }
      ],
      "tool_choice": "required"
  }' | jq

{
  "id": "chatcmpl-426",
  "object": "chat.completion",
  "created": 1773935171,
  "model": "mistral-small3.1:24b",
  "system_fingerprint": "fp_ollama",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "",
        "tool_calls": [
          {
            "id": "call_wzzfmfr9",
            "index": 0,
            "type": "function",
            "function": {
              "name": "get_weather",
              "arguments": "{\"location\":\"Zurich\"}"
            }
          }
        ]
      },
      "finish_reason": "tool_calls"
    }
  ],
  "usage": {
    "prompt_tokens": 1635,
    "completion_tokens": 18,
    "total_tokens": 1653
  }
}

@rick-github commented on GitHub (Mar 19, 2026): What's the output of `ollama ps` after running the large prompt? ```console $ curl -s http://localhost:11434/v1/chat/completions -H "Content-Type: application/json" -d '{ "model": "mistral-small3.1:24b", "messages": [ {"role":"system","content":"'"$(yes You are a helpful assistant. | head -264 | tr \\n ' ')"'"}, {"role":"user","content":"What is the weather in Zurich?"} ], "tools": [ { "type": "function", "function": { "name": "get_weather", "description": "Get current weather", "parameters": { "type": "object", "properties": { "location": { "type": "string" } } } } } ], "tool_choice": "required" }' | jq ``` ```json { "id": "chatcmpl-426", "object": "chat.completion", "created": 1773935171, "model": "mistral-small3.1:24b", "system_fingerprint": "fp_ollama", "choices": [ { "index": 0, "message": { "role": "assistant", "content": "", "tool_calls": [ { "id": "call_wzzfmfr9", "index": 0, "type": "function", "function": { "name": "get_weather", "arguments": "{\"location\":\"Zurich\"}" } } ] }, "finish_reason": "tool_calls" } ], "usage": { "prompt_tokens": 1635, "completion_tokens": 18, "total_tokens": 1653 } } ```

GiteaMirror commented

2026-04-29 10:18:16 -05:00

@cicoyle commented on GitHub (Mar 19, 2026):

$ ollama ps
NAME                    ID              SIZE     PROCESSOR    CONTEXT    UNTIL              
mistral-small3.1:24b    b9aaf0c2586a    16 GB    100% GPU     4096       4 minutes from now

I think the bug has something to do with the tool call output parser. When the system prompt + tools exceed some threshold (~1500 tokens I think), the parser fails to capture the generated tool call tokens and returns an empty resp. The model is generating the right output (31 tokens = typical tool call), but Ollama's resp serialization drops it.

I created a repo curl that consistently fails for me:

echo "=== SHORT (should work) ===" && curl -s http://localhost:11434/v1/chat/completions -H "Content-Type: application/json" -d @/tmp/ollama_req_short.json | jq . && echo "=== LONG (should fail)
   ===" && curl -s http://localhost:11434/v1/chat/completions -H "Content-Type: application/json" -d @/tmp/ollama_req_1.json | jq .

=== SHORT (should work) ===
{
  "id": "chatcmpl-226",
  "object": "chat.completion",
  "created": 1773952138,
  "model": "mistral-small3.1:24b",
  "system_fingerprint": "fp_ollama",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "",
        "tool_calls": [
          {
            "id": "call_kweb0u1j",
            "index": 0,
            "type": "function",
            "function": {
              "name": "GetTicketsAtRisk",
              "arguments": "{}"
            }
          }
        ]
      },
      "finish_reason": "tool_calls"
    }
  ],
  "usage": {
    "prompt_tokens": 850,
    "completion_tokens": 16,
    "total_tokens": 866
  }
}
=== LONG (should fail)
   ===
{
  "id": "chatcmpl-116",
  "object": "chat.completion",
  "created": 1773952147,
  "model": "mistral-small3.1:24b",
  "system_fingerprint": "fp_ollama",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": ""
      },
      "finish_reason": "stop"
    }
  ],
  "usage": {
    "prompt_tokens": 1467,
    "completion_tokens": 31,
    "total_tokens": 1498
  }
}

Where the files are defined as follows:

cat /tmp/ollama_req_short.json
{"model": "mistral-small3.1:24b", "messages": [{"role": "system", "content": "You are a helpful assistant."}, {"role": "user", "content": "Identify tickets that are at risk."}], "tools": [{"type": "function", "function": {"name": "SendEscalationEmail", "description": "\nSend an email to the Escalation Team notifying them of a delayed ticket\ndue to resource constraints. Use this when a ticket has insufficient staff\nand no available technicians are expected soon.\n", "parameters": {"description": "Input schema for sending escalation notification.", "properties": {"requester_name": {"description": "Requester name", "type": "string"}, "requester_id": {"description": "Requester ID", "type": "string"}, "ticket_id": {"description": "Ticket identifier", "type": "string"}, "skill_needed": {"description": "Skill that is needed", "type": "string"}, "deadline": {"description": "Original SLA deadline", "type": "string"}}, "required": ["requester_name", "requester_id", "ticket_id", "skill_needed", "deadline"], "type": "object"}}}, {"type": "function", "function": {"name": "SendTeamLeadAlert", "description": "\nSend an alert to the team lead about critical issues requiring attention.\nUse this for resource gap escalations, when additional staff are needed, or\nwhen task assignments need priority adjustment.\n", "parameters": {"description": "Input schema for sending alert to team lead.", "properties": {"alert_type": {"description": "Type of alert: 'RESOURCE_GAP', 'PRIORITY_ESCALATION', 'STAFF_NEEDED'", "type": "string"}, "details": {"description": "Detailed description of the issue", "type": "string"}, "ids": {"description": "Comma-separated list of affected IDs", "type": "string"}}, "required": ["alert_type", "ids", "details"], "type": "object"}}}, {"type": "function", "function": {"name": "GetOpenTicketsCount", "description": "Get the total count of all open tickets that have not yet been resolved (stage < 90).", "parameters": {"properties": {}, "type": "object"}}}, {"type": "function", "function": {"name": "GetTicketsAtRisk", "description": "Get all tickets at risk of missing their SLA. Returns tickets with deadline of tomorrow or older that are not yet resolved ordered by date and priority.", "parameters": {"properties": {}, "type": "object"}}}, {"type": "function", "function": {"name": "GetTicketsAtRiskCount", "description": "Get the count of tickets at risk of missing their SLA. Returns the number of tickets with deadline of tomorrow or older that are not yet resolved.", "parameters": {"properties": {}, "type": "object"}}}, {"type": "function", "function": {"name": "GetTicketsWithGaps", "description": "Get tickets that have resource gaps and are at risk. Returns tickets where required skills exceed available staff ordered by deadline.", "parameters": {"properties": {}, "type": "object"}}}, {"type": "function", "function": {"name": "GetTicketsWithGapsCount", "description": "Get the count of tickets that have resource gaps and are at risk. Returns the number of tickets where required skills exceed available staff.", "parameters": {"properties": {}, "type": "object"}}}, {"type": "function", "function": {"name": "GetAvailableStaff", "description": "Get all available staff members that have not yet been assigned and that have the needed skills.", "parameters": {"properties": {}, "type": "object"}}}, {"type": "function", "function": {"name": "GetAvailableStaffCount", "description": "Get the count of unique tickets that have gaps and have available staff that can cover them.", "parameters": {"properties": {}, "type": "object"}}}, {"type": "function", "function": {"name": "GetUncoverableTickets", "description": "Get tickets that have resource gaps and no available staff. These tickets cannot be resolved on time.", "parameters": {"properties": {}, "type": "object"}}}, {"type": "function", "function": {"name": "GetUncoverableTicketsCount", "description": "Get the count of unique tickets that have resource gaps and no available staff.", "parameters": {"properties": {}, "type": "object"}}}, {"type": "function", "function": {"name": "GetRequesterContact", "description": "Get requester contact information for a ticket.", "parameters": {"properties": {"ticket_id": {}}, "required": ["ticket_id"], "type": "object"}}}], "tool_choice": "required", "temperature": 1}%

&&

cat /tmp/ollama_req_1.json
{"model": "mistral-small3.1:24b", "messages": [{"role": "system", "content": "# Today's date is: March 19, 2026\n\nName:\n- Your name is IT_Support_Coordinator_Agent.\n\nRole:\n- You are an IT Helpdesk Operations Coordinator.\n\nGoal:\n- Your goal is to manage helpdesk operations to ensure all support tickets with resource constraints that are at risk of missing their SLA deadline are identified, and relevant actions prioritized. If tickets are unable to be resolved because there are no available technicians, the user can ask to notify the Escalation Team. If the ticket can be resolved by an available technician, the user can ask to notify the team lead to prioritize the required skills and assign them.\n\nPrimary Instructions:\n- Only perform the specific tasks requested. Do NOT send notifications unless explicitly asked.\n- WHEN ASKED to identify at-risk tickets: Call 'get-tickets-at-risk-count' and 'get-open-tickets-count'. Report counts with a chart. Do NOT call 'get-tickets-at-risk' unless asked for the full list.\n- WHEN ASKED to list at-risk tickets: Call 'get-tickets-at-risk'. Report ticket ID, requester name, and due date.\n- WHEN ASKED to identify tickets with resource gaps: Call 'get-tickets-with-gaps-count' and 'get-tickets-at-risk-count'. Report counts with a chart showing 'Resource Gap' vs 'Other At Risk'.\n- WHEN ASKED to identify which gap tickets can be covered by available staff: Call 'get-available-staff-count' and 'get-uncoverable-tickets-count'. Report counts with a chart. Do NOT notify anyone.\n- WHEN ASKED to list gap tickets with available staff: Call 'get-available-staff'.\n- WHEN ASKED to identify gap tickets that cannot be covered: Call 'get-uncoverable-tickets-count' and 'get-available-staff-count'. Report with a chart. Do NOT notify anyone.\n- WHEN ASKED to list uncoverable gap tickets: Call 'get-uncoverable-tickets'.\n- WHEN ASKED to notify the Escalation Team: Use 'get-requester-contact' then 'send-escalation-email' with requester name, requester ID, ticket ID, skill needed, and deadline.\n- WHEN ASKED to notify the Team Lead: Use 'send-team-lead-alert' with alert type 'RESOURCE_GAP', affected ticket IDs, and details.\n- WHEN ASKED to process all gap tickets end-to-end: Call 'get-available-staff'. If no staff exists, notify the Escalation Team. If staff exists, notify the Team Lead.\n- Tool names use lowercase kebab-case. Tickets and incidents are interchangeable terms.\n- When reporting counts, include a chart on its own line using this format: <<<{\"title\":\"Title\",\"data\":{\"Label1\":count1,\"Label2\":count2}}>>> Use integers from tool results only. For at-risk queries show 'At Risk' vs 'Not At Risk'."}, {"role": "user", "content": "Identify tickets that are at risk."}], "tools": [{"type": "function", "function": {"name": "SendEscalationEmail", "description": "\nSend an email to the Escalation Team notifying them of a delayed ticket\ndue to resource constraints. Use this when a ticket has insufficient staff\nand no available technicians are expected soon.\n", "parameters": {"description": "Input schema for sending escalation notification.", "properties": {"requester_name": {"description": "Requester name", "type": "string"}, "requester_id": {"description": "Requester ID", "type": "string"}, "ticket_id": {"description": "Ticket identifier", "type": "string"}, "skill_needed": {"description": "Skill that is needed", "type": "string"}, "deadline": {"description": "Original SLA deadline", "type": "string"}}, "required": ["requester_name", "requester_id", "ticket_id", "skill_needed", "deadline"], "type": "object"}}}, {"type": "function", "function": {"name": "SendTeamLeadAlert", "description": "\nSend an alert to the team lead about critical issues requiring attention.\nUse this for resource gap escalations, when additional staff are needed, or\nwhen task assignments need priority adjustment.\n", "parameters": {"description": "Input schema for sending alert to team lead.", "properties": {"alert_type": {"description": "Type of alert: 'RESOURCE_GAP', 'PRIORITY_ESCALATION', 'STAFF_NEEDED'", "type": "string"}, "details": {"description": "Detailed description of the issue", "type": "string"}, "ids": {"description": "Comma-separated list of affected IDs", "type": "string"}}, "required": ["alert_type", "ids", "details"], "type": "object"}}}, {"type": "function", "function": {"name": "GetOpenTicketsCount", "description": "Get the total count of all open tickets that have not yet been resolved (stage < 90).", "parameters": {"properties": {}, "type": "object"}}}, {"type": "function", "function": {"name": "GetTicketsAtRisk", "description": "Get all tickets at risk of missing their SLA. Returns tickets with deadline of tomorrow or older that are not yet resolved ordered by date and priority.", "parameters": {"properties": {}, "type": "object"}}}, {"type": "function", "function": {"name": "GetTicketsAtRiskCount", "description": "Get the count of tickets at risk of missing their SLA. Returns the number of tickets with deadline of tomorrow or older that are not yet resolved.", "parameters": {"properties": {}, "type": "object"}}}, {"type": "function", "function": {"name": "GetTicketsWithGaps", "description": "Get tickets that have resource gaps and are at risk. Returns tickets where required skills exceed available staff ordered by deadline.", "parameters": {"properties": {}, "type": "object"}}}, {"type": "function", "function": {"name": "GetTicketsWithGapsCount", "description": "Get the count of tickets that have resource gaps and are at risk. Returns the number of tickets where required skills exceed available staff.", "parameters": {"properties": {}, "type": "object"}}}, {"type": "function", "function": {"name": "GetAvailableStaff", "description": "Get all available staff members that have not yet been assigned and that have the needed skills.", "parameters": {"properties": {}, "type": "object"}}}, {"type": "function", "function": {"name": "GetAvailableStaffCount", "description": "Get the count of unique tickets that have gaps and have available staff that can cover them.", "parameters": {"properties": {}, "type": "object"}}}, {"type": "function", "function": {"name": "GetUncoverableTickets", "description": "Get tickets that have resource gaps and no available staff. These tickets cannot be resolved on time.", "parameters": {"properties": {}, "type": "object"}}}, {"type": "function", "function": {"name": "GetUncoverableTicketsCount", "description": "Get the count of unique tickets that have resource gaps and no available staff.", "parameters": {"properties": {}, "type": "object"}}}, {"type": "function", "function": {"name": "GetRequesterContact", "description": "Get requester contact information for a ticket.", "parameters": {"properties": {"ticket_id": {}}, "required": ["ticket_id"], "type": "object"}}}], "tool_choice": "required", "temperature": 1}%

I noticed, when replacing only the system prompt with "You are a helpful assistant." (same tools, same tool_choice, same user message) it returns tool_calls correctly at 926 tokens. The bug triggers with longer structured system prompts around ~1500+ tokens combined with 12 tool definitions.

@cicoyle commented on GitHub (Mar 19, 2026): ``` $ ollama ps NAME ID SIZE PROCESSOR CONTEXT UNTIL mistral-small3.1:24b b9aaf0c2586a 16 GB 100% GPU 4096 4 minutes from now ``` I think the bug has something to do with the tool call output parser. When the system prompt + tools exceed some threshold (~1500 tokens I think), the parser fails to capture the generated tool call tokens and returns an empty resp. The model is generating the right output (31 tokens = typical tool call), but Ollama's resp serialization drops it. I created a repo curl that consistently fails for me: ``` echo "=== SHORT (should work) ===" && curl -s http://localhost:11434/v1/chat/completions -H "Content-Type: application/json" -d @/tmp/ollama_req_short.json | jq . && echo "=== LONG (should fail) ===" && curl -s http://localhost:11434/v1/chat/completions -H "Content-Type: application/json" -d @/tmp/ollama_req_1.json | jq . === SHORT (should work) === { "id": "chatcmpl-226", "object": "chat.completion", "created": 1773952138, "model": "mistral-small3.1:24b", "system_fingerprint": "fp_ollama", "choices": [ { "index": 0, "message": { "role": "assistant", "content": "", "tool_calls": [ { "id": "call_kweb0u1j", "index": 0, "type": "function", "function": { "name": "GetTicketsAtRisk", "arguments": "{}" } } ] }, "finish_reason": "tool_calls" } ], "usage": { "prompt_tokens": 850, "completion_tokens": 16, "total_tokens": 866 } } === LONG (should fail) === { "id": "chatcmpl-116", "object": "chat.completion", "created": 1773952147, "model": "mistral-small3.1:24b", "system_fingerprint": "fp_ollama", "choices": [ { "index": 0, "message": { "role": "assistant", "content": "" }, "finish_reason": "stop" } ], "usage": { "prompt_tokens": 1467, "completion_tokens": 31, "total_tokens": 1498 } } ``` Where the files are defined as follows: ``` cat /tmp/ollama_req_short.json {"model": "mistral-small3.1:24b", "messages": [{"role": "system", "content": "You are a helpful assistant."}, {"role": "user", "content": "Identify tickets that are at risk."}], "tools": [{"type": "function", "function": {"name": "SendEscalationEmail", "description": "\nSend an email to the Escalation Team notifying them of a delayed ticket\ndue to resource constraints. Use this when a ticket has insufficient staff\nand no available technicians are expected soon.\n", "parameters": {"description": "Input schema for sending escalation notification.", "properties": {"requester_name": {"description": "Requester name", "type": "string"}, "requester_id": {"description": "Requester ID", "type": "string"}, "ticket_id": {"description": "Ticket identifier", "type": "string"}, "skill_needed": {"description": "Skill that is needed", "type": "string"}, "deadline": {"description": "Original SLA deadline", "type": "string"}}, "required": ["requester_name", "requester_id", "ticket_id", "skill_needed", "deadline"], "type": "object"}}}, {"type": "function", "function": {"name": "SendTeamLeadAlert", "description": "\nSend an alert to the team lead about critical issues requiring attention.\nUse this for resource gap escalations, when additional staff are needed, or\nwhen task assignments need priority adjustment.\n", "parameters": {"description": "Input schema for sending alert to team lead.", "properties": {"alert_type": {"description": "Type of alert: 'RESOURCE_GAP', 'PRIORITY_ESCALATION', 'STAFF_NEEDED'", "type": "string"}, "details": {"description": "Detailed description of the issue", "type": "string"}, "ids": {"description": "Comma-separated list of affected IDs", "type": "string"}}, "required": ["alert_type", "ids", "details"], "type": "object"}}}, {"type": "function", "function": {"name": "GetOpenTicketsCount", "description": "Get the total count of all open tickets that have not yet been resolved (stage < 90).", "parameters": {"properties": {}, "type": "object"}}}, {"type": "function", "function": {"name": "GetTicketsAtRisk", "description": "Get all tickets at risk of missing their SLA. Returns tickets with deadline of tomorrow or older that are not yet resolved ordered by date and priority.", "parameters": {"properties": {}, "type": "object"}}}, {"type": "function", "function": {"name": "GetTicketsAtRiskCount", "description": "Get the count of tickets at risk of missing their SLA. Returns the number of tickets with deadline of tomorrow or older that are not yet resolved.", "parameters": {"properties": {}, "type": "object"}}}, {"type": "function", "function": {"name": "GetTicketsWithGaps", "description": "Get tickets that have resource gaps and are at risk. Returns tickets where required skills exceed available staff ordered by deadline.", "parameters": {"properties": {}, "type": "object"}}}, {"type": "function", "function": {"name": "GetTicketsWithGapsCount", "description": "Get the count of tickets that have resource gaps and are at risk. Returns the number of tickets where required skills exceed available staff.", "parameters": {"properties": {}, "type": "object"}}}, {"type": "function", "function": {"name": "GetAvailableStaff", "description": "Get all available staff members that have not yet been assigned and that have the needed skills.", "parameters": {"properties": {}, "type": "object"}}}, {"type": "function", "function": {"name": "GetAvailableStaffCount", "description": "Get the count of unique tickets that have gaps and have available staff that can cover them.", "parameters": {"properties": {}, "type": "object"}}}, {"type": "function", "function": {"name": "GetUncoverableTickets", "description": "Get tickets that have resource gaps and no available staff. These tickets cannot be resolved on time.", "parameters": {"properties": {}, "type": "object"}}}, {"type": "function", "function": {"name": "GetUncoverableTicketsCount", "description": "Get the count of unique tickets that have resource gaps and no available staff.", "parameters": {"properties": {}, "type": "object"}}}, {"type": "function", "function": {"name": "GetRequesterContact", "description": "Get requester contact information for a ticket.", "parameters": {"properties": {"ticket_id": {}}, "required": ["ticket_id"], "type": "object"}}}], "tool_choice": "required", "temperature": 1}% ``` && ``` cat /tmp/ollama_req_1.json {"model": "mistral-small3.1:24b", "messages": [{"role": "system", "content": "# Today's date is: March 19, 2026\n\nName:\n- Your name is IT_Support_Coordinator_Agent.\n\nRole:\n- You are an IT Helpdesk Operations Coordinator.\n\nGoal:\n- Your goal is to manage helpdesk operations to ensure all support tickets with resource constraints that are at risk of missing their SLA deadline are identified, and relevant actions prioritized. If tickets are unable to be resolved because there are no available technicians, the user can ask to notify the Escalation Team. If the ticket can be resolved by an available technician, the user can ask to notify the team lead to prioritize the required skills and assign them.\n\nPrimary Instructions:\n- Only perform the specific tasks requested. Do NOT send notifications unless explicitly asked.\n- WHEN ASKED to identify at-risk tickets: Call 'get-tickets-at-risk-count' and 'get-open-tickets-count'. Report counts with a chart. Do NOT call 'get-tickets-at-risk' unless asked for the full list.\n- WHEN ASKED to list at-risk tickets: Call 'get-tickets-at-risk'. Report ticket ID, requester name, and due date.\n- WHEN ASKED to identify tickets with resource gaps: Call 'get-tickets-with-gaps-count' and 'get-tickets-at-risk-count'. Report counts with a chart showing 'Resource Gap' vs 'Other At Risk'.\n- WHEN ASKED to identify which gap tickets can be covered by available staff: Call 'get-available-staff-count' and 'get-uncoverable-tickets-count'. Report counts with a chart. Do NOT notify anyone.\n- WHEN ASKED to list gap tickets with available staff: Call 'get-available-staff'.\n- WHEN ASKED to identify gap tickets that cannot be covered: Call 'get-uncoverable-tickets-count' and 'get-available-staff-count'. Report with a chart. Do NOT notify anyone.\n- WHEN ASKED to list uncoverable gap tickets: Call 'get-uncoverable-tickets'.\n- WHEN ASKED to notify the Escalation Team: Use 'get-requester-contact' then 'send-escalation-email' with requester name, requester ID, ticket ID, skill needed, and deadline.\n- WHEN ASKED to notify the Team Lead: Use 'send-team-lead-alert' with alert type 'RESOURCE_GAP', affected ticket IDs, and details.\n- WHEN ASKED to process all gap tickets end-to-end: Call 'get-available-staff'. If no staff exists, notify the Escalation Team. If staff exists, notify the Team Lead.\n- Tool names use lowercase kebab-case. Tickets and incidents are interchangeable terms.\n- When reporting counts, include a chart on its own line using this format: <<<{\"title\":\"Title\",\"data\":{\"Label1\":count1,\"Label2\":count2}}>>> Use integers from tool results only. For at-risk queries show 'At Risk' vs 'Not At Risk'."}, {"role": "user", "content": "Identify tickets that are at risk."}], "tools": [{"type": "function", "function": {"name": "SendEscalationEmail", "description": "\nSend an email to the Escalation Team notifying them of a delayed ticket\ndue to resource constraints. Use this when a ticket has insufficient staff\nand no available technicians are expected soon.\n", "parameters": {"description": "Input schema for sending escalation notification.", "properties": {"requester_name": {"description": "Requester name", "type": "string"}, "requester_id": {"description": "Requester ID", "type": "string"}, "ticket_id": {"description": "Ticket identifier", "type": "string"}, "skill_needed": {"description": "Skill that is needed", "type": "string"}, "deadline": {"description": "Original SLA deadline", "type": "string"}}, "required": ["requester_name", "requester_id", "ticket_id", "skill_needed", "deadline"], "type": "object"}}}, {"type": "function", "function": {"name": "SendTeamLeadAlert", "description": "\nSend an alert to the team lead about critical issues requiring attention.\nUse this for resource gap escalations, when additional staff are needed, or\nwhen task assignments need priority adjustment.\n", "parameters": {"description": "Input schema for sending alert to team lead.", "properties": {"alert_type": {"description": "Type of alert: 'RESOURCE_GAP', 'PRIORITY_ESCALATION', 'STAFF_NEEDED'", "type": "string"}, "details": {"description": "Detailed description of the issue", "type": "string"}, "ids": {"description": "Comma-separated list of affected IDs", "type": "string"}}, "required": ["alert_type", "ids", "details"], "type": "object"}}}, {"type": "function", "function": {"name": "GetOpenTicketsCount", "description": "Get the total count of all open tickets that have not yet been resolved (stage < 90).", "parameters": {"properties": {}, "type": "object"}}}, {"type": "function", "function": {"name": "GetTicketsAtRisk", "description": "Get all tickets at risk of missing their SLA. Returns tickets with deadline of tomorrow or older that are not yet resolved ordered by date and priority.", "parameters": {"properties": {}, "type": "object"}}}, {"type": "function", "function": {"name": "GetTicketsAtRiskCount", "description": "Get the count of tickets at risk of missing their SLA. Returns the number of tickets with deadline of tomorrow or older that are not yet resolved.", "parameters": {"properties": {}, "type": "object"}}}, {"type": "function", "function": {"name": "GetTicketsWithGaps", "description": "Get tickets that have resource gaps and are at risk. Returns tickets where required skills exceed available staff ordered by deadline.", "parameters": {"properties": {}, "type": "object"}}}, {"type": "function", "function": {"name": "GetTicketsWithGapsCount", "description": "Get the count of tickets that have resource gaps and are at risk. Returns the number of tickets where required skills exceed available staff.", "parameters": {"properties": {}, "type": "object"}}}, {"type": "function", "function": {"name": "GetAvailableStaff", "description": "Get all available staff members that have not yet been assigned and that have the needed skills.", "parameters": {"properties": {}, "type": "object"}}}, {"type": "function", "function": {"name": "GetAvailableStaffCount", "description": "Get the count of unique tickets that have gaps and have available staff that can cover them.", "parameters": {"properties": {}, "type": "object"}}}, {"type": "function", "function": {"name": "GetUncoverableTickets", "description": "Get tickets that have resource gaps and no available staff. These tickets cannot be resolved on time.", "parameters": {"properties": {}, "type": "object"}}}, {"type": "function", "function": {"name": "GetUncoverableTicketsCount", "description": "Get the count of unique tickets that have resource gaps and no available staff.", "parameters": {"properties": {}, "type": "object"}}}, {"type": "function", "function": {"name": "GetRequesterContact", "description": "Get requester contact information for a ticket.", "parameters": {"properties": {"ticket_id": {}}, "required": ["ticket_id"], "type": "object"}}}], "tool_choice": "required", "temperature": 1}% ``` I noticed, when replacing only the system prompt with "You are a helpful assistant." (same tools, same tool_choice, same user message) it returns tool_calls correctly at 926 tokens. The bug triggers with longer structured system prompts around ~1500+ tokens combined with 12 tool definitions.

GiteaMirror commented

2026-04-29 10:18:16 -05:00

@cicoyle commented on GitHub (Mar 19, 2026):

Plz note, I did just run the example scenario from ^ with the latest ollama version 0.18.2 and see the same result:

echo "=== SHORT (should work) ===" && curl -s http://localhost:11434/v1/chat/completions -H "Content-Type: application/json" -d @/tmp/ollama_req_short.json | jq . && echo "=== LONG (should fail)
   ===" && curl -s http://localhost:11434/v1/chat/completions -H "Content-Type: application/json" -d @/tmp/ollama_req_1.json | jq .

=== SHORT (should work) ===
{
  "id": "chatcmpl-493",
  "object": "chat.completion",
  "created": 1773952967,
  "model": "mistral-small3.1:24b",
  "system_fingerprint": "fp_ollama",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "",
        "tool_calls": [
          {
            "id": "call_jcw40vr4",
            "index": 0,
            "type": "function",
            "function": {
              "name": "GetTicketsAtRisk",
              "arguments": "{}"
            }
          }
        ]
      },
      "finish_reason": "tool_calls"
    }
  ],
  "usage": {
    "prompt_tokens": 850,
    "completion_tokens": 16,
    "total_tokens": 866
  }
}
=== LONG (should fail)
   ===
{
  "id": "chatcmpl-593",
  "object": "chat.completion",
  "created": 1773952976,
  "model": "mistral-small3.1:24b",
  "system_fingerprint": "fp_ollama",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": ""
      },
      "finish_reason": "stop"
    }
  ],
  "usage": {
    "prompt_tokens": 1467,
    "completion_tokens": 31,
    "total_tokens": 1498
  }
}

ollama --version
ollama version is 0.18.2

@cicoyle commented on GitHub (Mar 19, 2026): Plz note, I did just run the example scenario from ^ with the latest ollama version `0.18.2` and see the same result: ``` echo "=== SHORT (should work) ===" && curl -s http://localhost:11434/v1/chat/completions -H "Content-Type: application/json" -d @/tmp/ollama_req_short.json | jq . && echo "=== LONG (should fail) ===" && curl -s http://localhost:11434/v1/chat/completions -H "Content-Type: application/json" -d @/tmp/ollama_req_1.json | jq . === SHORT (should work) === { "id": "chatcmpl-493", "object": "chat.completion", "created": 1773952967, "model": "mistral-small3.1:24b", "system_fingerprint": "fp_ollama", "choices": [ { "index": 0, "message": { "role": "assistant", "content": "", "tool_calls": [ { "id": "call_jcw40vr4", "index": 0, "type": "function", "function": { "name": "GetTicketsAtRisk", "arguments": "{}" } } ] }, "finish_reason": "tool_calls" } ], "usage": { "prompt_tokens": 850, "completion_tokens": 16, "total_tokens": 866 } } === LONG (should fail) === { "id": "chatcmpl-593", "object": "chat.completion", "created": 1773952976, "model": "mistral-small3.1:24b", "system_fingerprint": "fp_ollama", "choices": [ { "index": 0, "message": { "role": "assistant", "content": "" }, "finish_reason": "stop" } ], "usage": { "prompt_tokens": 1467, "completion_tokens": 31, "total_tokens": 1498 } } ollama --version ollama version is 0.18.2 ```

GiteaMirror commented

2026-04-29 10:18:17 -05:00

@rick-github commented on GitHub (Mar 19, 2026):

The system prompt in the long request tells the model to call 'get-tickets-at-risk-count'. There is no function by that name. If I change the get-... to the corresponding Get... functions, then the model emits tools calls.

$ curl -s http://localhost:11434/v1/chat/completions -H "Content-Type: application/json" -d @/tmp/ollama_req_1.json | jq .
{
  "id": "chatcmpl-77",
  "object": "chat.completion",
  "created": 1773954230,
  "model": "mistral-small3.1:24b",
  "system_fingerprint": "fp_ollama",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "",
        "tool_calls": [
          {
            "id": "call_z7f00f4g",
            "index": 0,
            "type": "function",
            "function": {
              "name": "GetTicketsAtRiskCount",
              "arguments": "{}"
            }
          },
          {
            "id": "call_4yb4u38h",
            "index": 1,
            "type": "function",
            "function": {
              "name": "GetOpenTicketsCount",
              "arguments": "{}"
            }
          }
        ]
      },
      "finish_reason": "tool_calls"
    }
  ],
  "usage": {
    "prompt_tokens": 1462,
    "completion_tokens": 29,
    "total_tokens": 1491
  }
}

@rick-github commented on GitHub (Mar 19, 2026): The system prompt in the long request tells the model to call 'get-tickets-at-risk-count'. There is no function by that name. If I change the `get-...` to the corresponding `Get...` functions, then the model emits tools calls. ```console $ curl -s http://localhost:11434/v1/chat/completions -H "Content-Type: application/json" -d @/tmp/ollama_req_1.json | jq . { "id": "chatcmpl-77", "object": "chat.completion", "created": 1773954230, "model": "mistral-small3.1:24b", "system_fingerprint": "fp_ollama", "choices": [ { "index": 0, "message": { "role": "assistant", "content": "", "tool_calls": [ { "id": "call_z7f00f4g", "index": 0, "type": "function", "function": { "name": "GetTicketsAtRiskCount", "arguments": "{}" } }, { "id": "call_4yb4u38h", "index": 1, "type": "function", "function": { "name": "GetOpenTicketsCount", "arguments": "{}" } } ] }, "finish_reason": "tool_calls" } ], "usage": { "prompt_tokens": 1462, "completion_tokens": 29, "total_tokens": 1491 } } ```

GiteaMirror commented

2026-04-29 10:18:17 -05:00

@cicoyle commented on GitHub (Mar 19, 2026):

Hi @rick-github - Thanks for identifying this - you were right! The root cause was a tool name casing mismatch between my system prompt instructions (kebab-case like get-orders-at-risk-count) and the actual tool definitions registered as PascalCase (GetOrdersAtRiskCount). The model saw "call get-orders-at-risk-count" in the prompt, but no tool by that name existed.

This wasn't an Ollama bug after all. That said, it might be worth considering whether Ollama could

emit a warning or error when tool_choice: "required" is set, but the model references a function name that doesn't match any registered tool.
apply fuzzy/case-insensitive matching on tool names (ex: treat get-orders-at-risk-count and GetOrdersAtRiskCount as the same tool) to avoid users falling into this same trap

Either would have saved my debugging time. I can open a separate feature request for that linking to this issue. Closing this issue - thx again for the quick reply :)

@cicoyle commented on GitHub (Mar 19, 2026): Hi @rick-github - Thanks for identifying this - you were right! The root cause was a tool name casing mismatch between my system prompt instructions (kebab-case like `get-orders-at-risk-count`) and the actual tool definitions registered as PascalCase (`GetOrdersAtRiskCount`). The model saw "call `get-orders-at-risk-count`" in the prompt, but no tool by that name existed. This wasn't an Ollama bug after all. That said, it might be worth considering whether Ollama could - emit a warning or error when `tool_choice: "required"` is set, but the model references a function name that doesn't match any registered tool. - apply fuzzy/case-insensitive matching on tool names (ex: treat `get-orders-at-risk-count` and `GetOrdersAtRiskCount` as the same tool) to avoid users falling into this same trap Either would have saved my debugging time. I can open a separate feature request for that linking to this issue. Closing this issue - thx again for the quick reply :)

Sign in to join this conversation.

Branches Tags

main

dhiltgen/ci

dhiltgen/llama-runner

hoyyeva/anthropic-local-image-path

hoyyeva/anthropic-reference-images-path

parth-anthropic-reference-images-path

brucemacd/download-before-remove

hoyyeva/editor-config-repair

parth-mlx-decode-checkpoints

parth-launch-codex-app

hoyyeva/fix-codex-model-metadata-warning

hoyyeva/qwen

parth/hide-claude-desktop-till-release

hoyyeva/opencode-image-modality

parth-add-claude-code-autoinstall

release_v0.22.0

pdevine/manifest-list

codex/fix-codex-model-metadata-warning

pdevine/addressable-manifest

brucemacd/launch-fetch-reccomended

jmorganca/llama-compat

launch-copilot-cli

hoyyeva/opencode-thinking

release_v0.20.7

parth-auto-save-backup

parth-test

jmorganca/gemma4-audio-replacements

fix-manifest-digest-on-pull

hoyyeva/vscode-improve

brucemacd/install-server-wait

parth/update-claude-docs

brucemac/start-ap-install

pdevine/mlx-update

pdevine/qwen35_vision

drifkin/api-show-fallback

mintlify/image-generation-1773352582

hoyyeva/server-context-length-local-config

jmorganca/faster-reptition-penalties

jmorganca/convert-nemotron

parth-pi-thinking

pdevine/sampling-penalties

jmorganca/fix-create-quantization-memory

dongchen/resumable_transfer_fix

pdevine/sampling-cache-error

jessegross/mlx-usage

hoyyeva/openclaw-config

hoyyeva/app-html

pdevine/qwen3next

brucemacd/sign-sh-install

brucemacd/tui-update

brucemacd/usage-api

jmorganca/launch-empty

fix-app-dist-embed

mxyng/mlx-compile

mxyng/mlx-quant

mxyng/mlx-glm4.7

mxyng/mlx

brucemacd/simplify-model-picker

jmorganca/qwen3-concurrent

fix-glm-4.7-flash-mla-config

drifkin/qwen3-coder-opening-tag

brucemacd/usage-cli

fix-cuda12-fattn-shmem

ollama-imagegen-docs

parth/fix-multiline-inputs

brucemacd/config-docs

mxyng/model-files

mxyng/simple-execute

fix-imagegen-ollama-models

mxyng/async-upload

jmorganca/lazy-no-dtype-changes

imagegen-auto-detect-create

parth/decrease-concurrent-download-hf

fix-mlx-quantize-init

jmorganca/x-cleanup

usage

imagegen-readme

jmorganca/glm-image

mlx-gpu-cd

jmorganca/imagegen-modelfile

parth/agent-skills

parth/agent-allowlist

parth/signed-in-offline

parth/agents

parth/fix-context-chopping

improve-cloud-flow

parth/add-models-websearch

parth/prompt-renderer-mcp

jmorganca/native-settings

jmorganca/download-stream-hash

jmorganca/client2-rebased

brucemacd/oai-chat-req-multipart

jessegross/multi_chunk_reserve

grace/additional-omit-empty

grace/mistral-3-large

mxyng/tokenizer2

mxyng/tokenizer

jessegross/flash

hoyyeva/windows-nacked-app

mxyng/cleanup-attention

grace/deepseek-parser

hoyyeva/remember-unsent-prompt

parth/add-lfs-pointer-error-conversion

parth/olmo2-test2

hoyyeva/ollama-launchagent-plist

nicole/olmo-model

parth/olmo-test

mxyng/remove-embedded

parth/render-template

jmorganca/intellect-3

parth/remove-prealloc-linter

jmorganca/cmd-eval

nicole/nomic-embed-text-fix

mxyng/lint-2

hoyyeva/add-gemini-3-pro-preview

hoyyeva/load-model-list

mxyng/expand-path

mxyng/environ-2

hoyyeva/deeplink-json-encoding

parth/improve-tool-calling-tests

hoyyeva/conversation

hoyyeva/assistant-edit-response

hoyyeva/thinking

origin/brucemacd/invalid-char-i-err

parth/improve-tool-calling

jmorganca/required-omitempty

grace/qwen3-vl-tests

mxyng/iter-client

parth/docs-readme

nicole/embed-test

pdevine/integration-benchstat

parth/remove-generate-cmd

parth/add-toolcall-id

mxyng/server-tests

jmorganca/glm-4.6

jmorganca/gin-h-compat

drifkin/stable-tool-args

pdevine/qwen3-more-thinking

parth/add-websearch-client

nicole/websearch_local

jmorganca/qwen3-coder-updates

grace/deepseek-v3-migration-tests

mxyng/fix-create

jmorganca/cloud-errors

pdevine/parser-tidy

revert-12233-parth/simplify-entrypoints-runner

parth/enable-so-gpt-oss

brucemacd/qwen3vl

jmorganca/readme-simplify

parth/gpt-oss-structured-outputs

revert-12039-jmorganca/tools-braces

mxyng/embeddings

mxyng/gguf

mxyng/benchmark

mxyng/types-null

parth/move-parsing

mxyng/gemma2

jmorganca/docs

mxyng/16-bit

mxyng/create-stdin

pdevine/authorizedkeys

mxyng/quant

parth/opt-in-error-context-window

brucemacd/cache-models

brucemacd/runner-completion

jmorganca/llama-update-6

brucemacd/benchmark-list

brucemacd/partial-read-caps

parth/deepseek-r1-tools

mxyng/omit-array

parth/tool-prefix-temp

brucemacd/runner-test

jmorganca/qwen25vl

brucemacd/model-forward-test-ext

parth/python-function-parsing

jmorganca/cuda-compression-none

drifkin/num-parallel

drifkin/chat-truncation-fix

jmorganca/sync

parth/python-tools-calling

drifkin/array-head-count

brucemacd/create-no-loop

parth/server-enable-content-stream-with-tools

qwen25omni

mxyng/v3

brucemacd/ropeconfig

jmorganca/silence-tokenizer

parth/sample-so-test

parth/sampling-structured-outputs

brucemacd/doc-go-engine

parth/constrained-sampling-json

jmorganca/mistral-wip

brucemacd/mistral-small-convert

parth/sample-unmarshal-json-for-params

brucemacd/jomorganca/mistral

pdevine/bfloat16

jmorganca/mistral

brucemacd/mistral

pdevine/logging

parth/sample-correctness-fix

parth/sample-fix-sorting

jmorgan/sample-fix-sorting-extras

jmorganca/temp-0-images

brucemacd/parallel-embed-models

brucemacd/shim-grammar

jmorganca/fix-gguf-error

bmizerany/nameswork

jmorganca/faster-releases

bmizerany/validatenames

brucemacd/err-no-vocab

brucemacd/rope-config

brucemacd/err-hint

brucemacd/qwen2_5

brucemacd/logprobs

brucemacd/new_runner_graph_bench

progress-flicker

brucemacd/forward-test

brucemacd/go_qwen2

pdevine/gemma2

jmorganca/add-missing-symlink-eval

mxyng/next-debug

parth/set-context-size-openai

brucemacd/next-bpe-bench

brucemacd/next-bpe-test

brucemacd/new_runner_e2e

brucemacd/new_runner_qwen2

pdevine/convert-cohere2

brucemacd/convert-cli

parth/log-probs

mxyng/next-mlx

mxyng/cmd-history

parth/templating

parth/tokenize-detokenize

brucemacd/check-key-register

bmizerany/grammar

jmorganca/vendor-081b29bd

mxyng/func-checks

jmorganca/fix-null-format

parth/fix-default-to-warn-json

jmorganca/qwen2vl

jmorganca/no-concat

parth/cmd-cleanup-SO

brucemacd/check-key-register-structured-err

parth/openai-stream-usage

parth/fix-referencing-so

stream-tools-stop

jmorganca/degin-1

brucemacd/install-path-clean

brucemacd/push-name-validation

brucemacd/browser-key-register

jmorganca/openai-fix-first-message

jmorganca/fix-proxy

jessegross/sample

parth/disallow-streaming-tools

dhiltgen/remove_submodule

jmorganca/ga

jmorganca/mllama

pdevine/newlines

pdevine/geems-2b

jmorganca/llama-bump

mxyng/modelname-7

mxyng/gin-slog

mxyng/modelname-6

jyan/convert-prog

jyan/quant5

paligemma-support

pdevine/import-docs

jmorganca/openai-context

jyan/paligemma

jyan/p2

jyan/palitest

bmizerany/embedspeedup

jmorganca/llama-vit

brucemacd/allow-ollama

royh/ep-methods

royh/whisper

mxyng/api-models

mxyng/fix-memory

jyan/q4_4/8

jyan/ollama-v

royh/stream-tools

roy-embed-parallel

bmizerany/hrm

revert-5963-revert-5924-mxyng/llama3.1-rope

royh/embed-viz

jyan/local2

jyan/auth

jyan/local

jyan/parse-temp

jmorganca/template-mistral

jyan/reord-g

royh-openai-suffixdocs

royh-imgembed

royh-embed-parallel

jyan/quant4

royh-precision

jyan/progress

pdevine/fix-template

jyan/quant3

pdevine/ggla

mxyng/update-registry-domain

jmorganca/ggml-static

mxyng/create-context

jyan/v0.146

mxyng/layers-from-files

build_dist

bmizerany/noseek

royh-ls

royh-name

timeout

mxyng/server-timestamp

bmizerany/nosillyggufslurps

royh-params

jmorganca/llama-cpp-7c26775

royh-openai-delete

royh-show-rigid

jmorganca/enable-fa

jmorganca/no-error-template

jyan/format

royh-testdelete

bmizerany/fastverify

language_support

pdevine/ps-glitches

brucemacd/tokenize

bruce/iq-quants

bmizerany/filepathwithcoloninhost

mxyng/split-bin

bmizerany/client-registry

jmorganca/if-none-match

native

jmorganca/native

jmorganca/batch-embeddings

jmorganca/initcmake

jmorganca/mm

pdevine/showggmlinfo

modenameenforcealphanum

bmizerany/modenameenforcealphanum

jmorganca/done-reason

jmorganca/llama-cpp-8960fe8

ollama.com

bmizerany/filepathnobuild

bmizerany/types/model/defaultfix

rmdisplaylong

nogogen

bmizerany/x

modelfile-readme

bmizerany/replacecolon

jmorganca/limit

jmorganca/execstack

jmorganca/replace-assets

mxyng/tune-concurrency

jmorganca/testing

whitespace-detection

jmorganca/options

upgrade-all

scratch

cuda-search

mattw/airenamer

mattw/allmodelsonhuggingface

mattw/quantcontext

mattw/whatneedstorun

brucemacd/llama-mem-calc

mattw/faq-context

mattw/communitylinks

mattw/noprune

mattw/python-functioncalling

rename

mxyng/install

pulse

remove-first

editor

mattw/selfqueryingretrieval

cgo

mattw/howtoquant

api

matt/streamingapi

format-config

mxyng/extra-args

shell

update-nous-hermes

cp-model

upload-progress

fix-unknown-model

fix-model-names

delete-fix

insecure-registry

ls

deletemodels

progressbar

readme-updates

license-layers

skip-list

list-models

modelpath

matt/examplemodelfiles

distribution

go-opts

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: github-starred/ollama#56131