llm.get_models() returns multiple instances of the same model #667

simonw · 2024-12-05T21:37:15Z

Just noticed this:

import llm
from pprint import pprint
pprint(llm.get_models())

[<Chat 'gpt-4o'>,
 <Chat 'gpt-4o'>,
 <Chat 'gpt-4o-mini'>,
 <Chat 'gpt-4o-mini'>,
 <Chat 'gpt-4o-audio-preview'>,
 <Chat 'gpt-3.5-turbo'>,
 <Chat 'gpt-3.5-turbo'>,
 <Chat 'gpt-3.5-turbo'>,
 <Chat 'gpt-3.5-turbo-16k'>,
 <Chat 'gpt-3.5-turbo-16k'>,
 <Chat 'gpt-3.5-turbo-16k'>,
 <Chat 'gpt-4'>,
 <Chat 'gpt-4'>,
 <Chat 'gpt-4'>,
 <Chat 'gpt-4-32k'>,
 <Chat 'gpt-4-32k'>,
 <Chat 'gpt-4-1106-preview'>,
 <Chat 'gpt-4-0125-preview'>,
 <Chat 'gpt-4-turbo-2024-04-09'>,
 <Chat 'gpt-4-turbo'>,
 <Chat 'gpt-4-turbo'>,
 <Chat 'gpt-4-turbo'>,
 <Chat 'gpt-4-turbo'>,
 <Chat 'o1-preview'>,
 <Chat 'o1-mini'>,
 <Completion 'gpt-3.5-turbo-instruct'>,
 <Completion 'gpt-3.5-turbo-instruct'>,
 <Completion 'gpt-3.5-turbo-instruct'>]

The text was updated successfully, but these errors were encountered:

simonw · 2024-12-05T21:38:35Z

It's suspicious that gpt-4-turbo shows up 4 times, but o1-preview and o1-mini only once.

Code in question:

llm/llm/default_plugins/openai_models.py

Lines 30 to 77 in e78fea1

    
           def register_models(register): 
        
               # GPT-4o 
        
               register( 
        
                   Chat("gpt-4o", vision=True), AsyncChat("gpt-4o", vision=True), aliases=("4o",) 
        
               ) 
        
               register( 
        
                   Chat("gpt-4o-mini", vision=True), 
        
                   AsyncChat("gpt-4o-mini", vision=True), 
        
                   aliases=("4o-mini",), 
        
               ) 
        
               register( 
        
                   Chat("gpt-4o-audio-preview", audio=True), 
        
                   AsyncChat("gpt-4o-audio-preview", audio=True), 
        
               ) 
        
               # 3.5 and 4 
        
               register( 
        
                   Chat("gpt-3.5-turbo"), AsyncChat("gpt-3.5-turbo"), aliases=("3.5", "chatgpt") 
        
               ) 
        
               register( 
        
                   Chat("gpt-3.5-turbo-16k"), 
        
                   AsyncChat("gpt-3.5-turbo-16k"), 
        
                   aliases=("chatgpt-16k", "3.5-16k"), 
        
               ) 
        
               register(Chat("gpt-4"), AsyncChat("gpt-4"), aliases=("4", "gpt4")) 
        
               register(Chat("gpt-4-32k"), AsyncChat("gpt-4-32k"), aliases=("4-32k",)) 
        
               # GPT-4 Turbo models 
        
               register(Chat("gpt-4-1106-preview"), AsyncChat("gpt-4-1106-preview")) 
        
               register(Chat("gpt-4-0125-preview"), AsyncChat("gpt-4-0125-preview")) 
        
               register(Chat("gpt-4-turbo-2024-04-09"), AsyncChat("gpt-4-turbo-2024-04-09")) 
        
               register( 
        
                   Chat("gpt-4-turbo"), 
        
                   AsyncChat("gpt-4-turbo"), 
        
                   aliases=("gpt-4-turbo-preview", "4-turbo", "4t"), 
        
               ) 
        
               # o1 
        
               register( 
        
                   Chat("o1-preview", can_stream=False, allows_system_prompt=False), 
        
                   AsyncChat("o1-preview", can_stream=False, allows_system_prompt=False), 
        
               ) 
        
               register( 
        
                   Chat("o1-mini", can_stream=False, allows_system_prompt=False), 
        
                   AsyncChat("o1-mini", can_stream=False, allows_system_prompt=False), 
        
               ) 
        
               # The -instruct completion model 
        
               register( 
        
                   Completion("gpt-3.5-turbo-instruct", default_max_tokens=256), 
        
                   aliases=("3.5-instruct", "chatgpt-instruct"), 
        
               )

I'm suspicious of the aliases.

simonw · 2024-12-05T21:39:35Z

Here's the problem: we aren't de-duping for aliases here:

llm/llm/__init__.py

Lines 170 to 177 in e78fea1

    
           def get_models() -> List[Model]: 
        
               "Get all registered models" 
        
               return [model for model in get_model_aliases().values()] 
        
           def get_async_models() -> List[AsyncModel]: 
        
               "Get all registered async models" 
        
               return [model for model in get_async_model_aliases().values()]

Bug was introduced in:

llm.get_models() and .get_async_models() documented functions #640

Refs #667

simonw added the bug Something isn't working label Dec 5, 2024

simonw closed this as completed in b6be09a Dec 5, 2024

simonw added a commit that referenced this issue Dec 5, 2024

Release 0.19.1

b8e8052

Refs #667

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llm.get_models() returns multiple instances of the same model #667

llm.get_models() returns multiple instances of the same model #667

simonw commented Dec 5, 2024

simonw commented Dec 5, 2024

simonw commented Dec 5, 2024

llm.get_models() returns multiple instances of the same model #667

llm.get_models() returns multiple instances of the same model #667

Comments

simonw commented Dec 5, 2024

simonw commented Dec 5, 2024

simonw commented Dec 5, 2024