You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When I set background_health_checks to true in this configuration file and run the docker container, it enters a loop checking the models. See the relevant log output for more information.
model_list:
- model_name: os.environ/AZURE_OPENAI_MODELlitellm_params:
model: os.environ/AZURE_OPENAI_DEPLOYMENTapi_base: os.environ/AZURE_OPENAI_API_BASEapi_key: "os.environ/AZURE_OPENAI_API_KEY"api_version: "os.environ/AZURE_OPENAI_API_VERSION"# [OPTIONAL] litellm uses the latest azure api_version by defaultrpm: 50model_info:
mode: completion # This setting is used to determine how to check the health of the model. See https://docs.litellm.ai/docs/proxy/healthinput_cost_per_token: 0.000002399240output_cost_per_token: 0.000009597000max_tokens: 16384
- model_name: gpt-4o-minilitellm_params:
model: azure/gpt-4o-miniapi_base: os.environ/AZURE_OPENAI_API_BASEapi_key: "os.environ/AZURE_OPENAI_API_KEY"api_version: "os.environ/AZURE_OPENAI_API_VERSION"# [OPTIONAL] litellm uses the latest azure api_version by defaultrpm: 50model_info:
mode: completioninput_cost_per_token: 0.000000143960output_cost_per_token: 0.000000575900max_tokens: 16384litellm_settings:
ssl_verify: true # [OPTIONAL] Set to False to disable SSL verification (not recommended)request_timeout: 30# (int) llm requesttimeout in seconds. Raise Timeout error if call takes longer than 10s. Sets litellm.request_timeout etectionredactiongeneral_settings:
# Parallelismmax_parallel_requests: 5# the max parallel requests allowed per deployment global_max_parallel_requests: 100# the max parallel requests allowed on the proxy all up # Health and monitorization. https://docs.litellm.ai/docs/proxy/healthbackground_health_checks: true # Uses model_info.mode to determine how to check the health of each modelhealth_check_interval: 300# frequency of background health checkshealth_check_details: false # If false, hides health check details (e.g. remaining rate limit)
What happened?
When I set background_health_checks to true in this configuration file and run the docker container, it enters a loop checking the models. See the relevant log output for more information.
docker run -v $(pwd)/config.yaml:/app/config.yaml --env-file .env -p 4000:4000 ghcr.io/berriai/litellm:main-latest --config /app/config.yaml --detailed_debug
config.yaml
Relevant log output
Are you a ML Ops Team?
Yes
What LiteLLM version are you on ?
v1.60.0.dev4
Twitter / LinkedIn details
No response
The text was updated successfully, but these errors were encountered: