[Bug]: ollama_chat/ provider does not honor timeout #8333

paul-gauthier · 2025-02-06T19:33:57Z

What happened?

I can pass timeout to completion() and most models seem to honor it. Models from the ollama_chat/ provider do not.

Relevant log output

import litellm

def doit(model):
    messages=[{"role": "user", "content": "hi"}]
    try:
        comp = litellm.completion(model, messages, timeout=0.1)
        print(model, comp.choices[0].message.content)
    except Exception as e:
        print(model, type(e))

doit("gpt-4o") 
# outputs: gpt-4o <class 'litellm.exceptions.Timeout'>

doit("ollama/llama3.2:3b-instruct-q5_K_S")
# outputs: ollama/llama3.2:3b-instruct-q5_K_S <class 'litellm.exceptions.APIConnectionError'>

doit("ollama_chat/llama3.2:3b-instruct-q5_K_S")
# outputs: ollama_chat/llama3.2:3b-instruct-q5_K_S Hello! How can I assist you today?

Are you a ML Ops Team?

No

What LiteLLM version are you on ?

v1.60.5

Twitter / LinkedIn details

No response

The text was updated successfully, but these errors were encountered:

vmajor · 2025-02-06T21:41:22Z

Since I was copied into this I am guessing at least one of my reports made it even though I cannot see them anywhere.

I do not us ollama, I use the OG llama-server and aider also ignores any and all --timeout settings that I tried and times out the session mid response.

I have yet to see any advice on how to stop this from happening. Is there a separate setting in model config that talks directly to LiteLLM? I do not have a standalone LitelLLM instalation, only what was installed by Aider itself.

Ideally in any situation where the API is on localhost, any and all timeout settings need to be disabled since we are directly in control of the API health and behaviour. Aider/LiteLLM should not interfere with this.

paul-gauthier · 2025-02-06T22:00:17Z

@vmajor Please follow up back in the aider issue:
Aider-AI/aider#276

krrishdholakia · 2025-02-07T02:24:16Z

Thanks for the issue @paul-gauthier. I believe we need to just refactor ollama_chat to also use the base_llm_http_handler

should fix this

paul-gauthier added the bug Something isn't working label Feb 6, 2025

This was referenced Feb 6, 2025

Is it possible to set custom timeout? Aider-AI/aider#276

Closed

ollama_chat/ provider does not honor timeout Aider-AI/aider#3161

Open

paul-gauthier mentioned this issue Feb 6, 2025

Add an option to skip SSL verification for users behind firewalls Aider-AI/aider#664

Open

krrishdholakia self-assigned this Feb 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug]: ollama_chat/ provider does not honor timeout #8333

[Bug]: ollama_chat/ provider does not honor timeout #8333

paul-gauthier commented Feb 6, 2025

vmajor commented Feb 6, 2025 •

edited

Loading

paul-gauthier commented Feb 6, 2025

krrishdholakia commented Feb 7, 2025

[Bug]: ollama_chat/ provider does not honor timeout #8333

[Bug]: ollama_chat/ provider does not honor timeout #8333

Comments

paul-gauthier commented Feb 6, 2025

What happened?

Relevant log output

Are you a ML Ops Team?

What LiteLLM version are you on ?

Twitter / LinkedIn details

vmajor commented Feb 6, 2025 • edited Loading

paul-gauthier commented Feb 6, 2025

krrishdholakia commented Feb 7, 2025

vmajor commented Feb 6, 2025 •

edited

Loading