APIs for fetching and setting model and embedding providers #101

3coins · 2023-04-21T03:34:51Z

Summary

This PR introduces the concept of ProviderConfig, that provides the values for the llm and embeddings provider and model, along with the api keys for the chat to work.

Actors

ProvidersActor: This actor loads and keeps a list of model and embeddings providers
ChatProviderActor: This actor stores the updated model provider (LLM) and provider params, which are updated when config is updated.
EmbeddingsProviderActor: This actor stores the updated embeddings provider and params, which are updated when config is updated.
ConfigActor: This actor stores the ProviderConfig object which is shared globally with other actors above, and is responsible for loading and saving the config from/to the disk.

APIs

GET /api/ai/providers: Returns list of model providers and models they support
GET /api/ai/providers/embeddings: Returns list of embeddings providers and supported model options
GET /api/ai/config: Returns the currently set config as ProviderConfig
POST /api/ai/config: Updates the config, saves to disk and updates chat provider and embeddings

Notes

The anthropic models don't wrap the code segments in the response in a markdown code block; I have tried updating the prompt template and hasn't seen this working yet.

The current setup is not working with certain embeddings, particularly Cohere because of the way the AskActor references the vectorstore, and uses it to get the retriever. Cohere embeddings has some non-serializable parts, which causes errors when the AskActor loads it to set the retriever but is unable to de-serialize.

The last line here causes an exception

index_actor = ray.get_actor(ACTOR_TYPE.LEARN.value)
handle = index_actor.get_index.remote()
vectorstore = ray.get(handle)

Here is the exception:

Traceback (most recent call last):
File "/Users/pijain/projects/jupyter-ai-3.9/jupyter-ai/packages/jupyter-ai/jupyter_ai/actors/base.py", line 55, in process_message
    self._process_message(message)
File "/Users/pijain/Library/Application Support/hatch/env/virtual/jupyter-ai-monorepo/X-JWbDZw/jupyter-ai-monorepo/lib/python3.9/site-packages/ray/util/tracing/tracing_helper.py", line 466, in _resume_span
    return method(self, *_args, **_kwargs)
File "/Users/pijain/projects/jupyter-ai-3.9/jupyter-ai/packages/jupyter-ai/jupyter_ai/actors/ask.py", line 54, in _process_message
    vectorstore = ray.get(handle)
File "/Users/pijain/Library/Application Support/hatch/env/virtual/jupyter-ai-monorepo/X-JWbDZw/jupyter-ai-monorepo/lib/python3.9/site-packages/ray/_private/client_mode_hook.py", line 105, in wrapper
    return func(*args, **kwargs)
File "/Users/pijain/Library/Application Support/hatch/env/virtual/jupyter-ai-monorepo/X-JWbDZw/jupyter-ai-monorepo/lib/python3.9/site-packages/ray/_private/worker.py", line 2309, in get
    raise value.as_instanceof_cause()
ray.exceptions.RayTaskError(TypeError): �[36mray::LearnActor.get_index()�[39m (pid=90820, ip=127.0.0.1, repr=<jupyter_ai.actors.learn.LearnActor object at 0x138093fd0>)
File "/Users/pijain/Library/Application Support/hatch/env/virtual/jupyter-ai-monorepo/X-JWbDZw/jupyter-ai-monorepo/lib/python3.9/site-packages/ray/cloudpickle/cloudpickle_fast.py", line 73, in dumps
    cp.dump(obj)
File "/Users/pijain/Library/Application Support/hatch/env/virtual/jupyter-ai-monorepo/X-JWbDZw/jupyter-ai-monorepo/lib/python3.9/site-packages/ray/cloudpickle/cloudpickle_fast.py", line 627, in dump
    return Pickler.dump(self, obj)
TypeError: cannot pickle '_queue.SimpleQueue' object

Switching embedding providers will cause issues, since the saved index created with one provider is not portable to adding documents after switching to a new provider. In order to make this safely work for users, we have to make sure that the existing indexes are deleted when embeddings provider changes. Users should be aware of this change when they update config.

packages/jupyter-ai-magics/jupyter_ai_magics/utils.py

packages/jupyter-ai/jupyter_ai/models.py

packages/jupyter-ai/jupyter_ai/handlers.py

dlqqq

@3coins This is awesome work! Thank you very much for getting this out so quickly! 🎉 This is 95% complete; most of my suggestions are minor improvements. I had a few high-level comments in addition to the more granular ones below:

Can you create a separate branch (e.g. model-config) that's identical to main, and then set the target of this PR to that branch?
I'm tracking naming suggestions in a separate issue that should be addressed before the branch is merged into main: Model configurability naming changes #129

packages/jupyter-ai/jupyter_ai/actors/base.py

dlqqq · 2023-04-28T17:44:33Z

packages/jupyter-ai/jupyter_ai/actors/base.py

+
+        if not provider:
+            return None
+        if provider.__class__.__name__ != self.embeddings.__class__.__name__:


Can just use provider.id here 😁

Same for line 85.

Could not do this for the embedding provider. There is a workaround for this, but want to iterate on this in the following PRs.

packages/jupyter-ai/jupyter_ai/actors/chat_provider.py

packages/jupyter-ai/jupyter_ai/actors/embeddings_provider.py

packages/jupyter-ai/jupyter_ai/actors/providers.py

packages/jupyter-ai/jupyter_ai/extension.py

packages/jupyter-ai/jupyter_ai/models.py

packages/jupyter-ai/jupyter_ai/actors/generate.py

…handling

3coins · 2023-05-01T16:53:55Z

@dlqqq
Taken care of most of the comments, issue 2 from notes is fixed now, so embedding providers work without any issues.
For issue 3 in the notes, I have added some error checking around failures when embedding provider is updated and /ask command is used, which should indicate the user to delete the index. For long term, we should not only delete the index when the embedding provider changes, but also re-index directories user has indexed with the previous provider; this needs some more work and can't be handled in this PR.

3coins added enhancement New feature or request maintenance Change related to maintenance of the repository labels Apr 21, 2023

dlqqq requested changes Apr 21, 2023

View reviewed changes

packages/jupyter-ai-magics/jupyter_ai_magics/utils.py Show resolved Hide resolved

packages/jupyter-ai/jupyter_ai/models.py Outdated Show resolved Hide resolved

dlqqq requested changes Apr 21, 2023

View reviewed changes

packages/jupyter-ai/jupyter_ai/handlers.py Outdated Show resolved Hide resolved

3coins force-pushed the add-config-apis branch from 9c85dc9 to f2fd428 Compare April 25, 2023 05:16

3coins requested review from dlqqq and ellisonbg April 27, 2023 04:41

3coins changed the title ~~Refactored provider load, decompose logic, aded model provider list api~~ APIs for fetching and setting model and embedding providers Apr 27, 2023

3coins marked this pull request as ready for review April 27, 2023 04:48

3coins self-assigned this Apr 27, 2023

3coins marked this pull request as draft April 27, 2023 04:49

3coins added this to the 0.7.0 Release milestone Apr 27, 2023

dlqqq requested changes Apr 28, 2023

View reviewed changes

3coins changed the base branch from main to model-config April 28, 2023 22:04

3coins added 11 commits April 29, 2023 21:48

Refactored provider load, decompose logic, aded model provider list api

b2e3ebe

Renamed model

f29ff44

Sorted the provider names

1a08f2a

WIP: Embedding providers

cbadcbf

Added embeddings provider api

ad555fa

Added missing import

9da927d

Moved providers to ray actor, added config actor

1ddbd6b

Ability to load llm and embeddings from config

e8db497

Moved llm creation to specific actors

ae167f1

Added apis for fetching, updating config. Fixed config update, error …

0784f43

…handling

Updated as per PR feedback

71f5c5f

3coins force-pushed the add-config-apis branch from e0a96a9 to 71f5c5f Compare April 30, 2023 04:49

3coins added 2 commits April 29, 2023 22:37

Fixes issue with cohere embeddings, api keys not working

559afc9

Added an error check when embedding change causes read error

1fcf7f9

dlqqq approved these changes May 1, 2023

View reviewed changes

3coins marked this pull request as ready for review May 1, 2023 17:15

3coins merged commit 72b69ca into jupyterlab:model-config May 1, 2023

dlqqq mentioned this pull request May 5, 2023

Runtime model configurability #146

Merged

3coins deleted the add-config-apis branch May 5, 2023 18:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

APIs for fetching and setting model and embedding providers #101

APIs for fetching and setting model and embedding providers #101

3coins commented Apr 21, 2023 •

edited

Loading

dlqqq left a comment

dlqqq Apr 28, 2023

dlqqq Apr 28, 2023

3coins May 1, 2023

3coins commented May 1, 2023

APIs for fetching and setting model and embedding providers #101

APIs for fetching and setting model and embedding providers #101

Conversation

3coins commented Apr 21, 2023 • edited Loading

Summary

Actors

APIs

Notes

dlqqq left a comment

Choose a reason for hiding this comment

dlqqq Apr 28, 2023

Choose a reason for hiding this comment

dlqqq Apr 28, 2023

Choose a reason for hiding this comment

3coins May 1, 2023

Choose a reason for hiding this comment

3coins commented May 1, 2023

3coins commented Apr 21, 2023 •

edited

Loading