feat: interfaces for async embeddings, implement async openai #6111

tyree731 · 2023-06-13T20:25:42Z

This change adds support to the base Embeddings class for two methods, aembed_query and aembed_documents, those two methods supporting async equivalents of embed_query and
embed_documents respectively. This ever so slightly rounds out async support within langchain, with an initial implementation of this functionality being implemented for openai.

Implements #6109

This change adds support to the base `Embeddings` class for two methods, `aembed_query` and `aembed_documents`, those two methods supporting async equivalents of `embed_query` and `embed_documents` respectively. This ever so slightly rounds out async support within langchain, with an initial implementation of this functionality being implemented for openai. Implements langchain-ai#6109

tyree731 · 2023-06-13T20:27:01Z

CC @hwchase17 @dev2049 for review. the _aget_len_safe_embeddings bit is a bit rough, as I couldn't think of a great way to reuse the non-embedding work from _get_len_safe_embeddings, open to suggestions there.

tyree731 · 2023-06-14T20:21:14Z

Let me know if there is anything I can do to help push this along. I know you're all busy, but just want to make sure there isn't anything on my end holding this up. Also:

I wasn’t 100% on where I should be adding more documentation, so if I should be adding more documentation, kindly point me in the right direction there.
I added integration tests for openai async embeddings, but if I should be adding tests for fake embeddings async or some such, I can do so.

Thanks!

langchain/embeddings/base.py

eyurtsev · 2023-06-14T17:38:26Z

langchain/embeddings/openai.py

@@ -53,6 +54,38 @@ def _create_retry_decorator(embeddings: OpenAIEmbeddings) -> Callable[[Any], Any
    )


+def _async_retry_decorator(embeddings: OpenAIEmbeddings) -> Any:


Note: this is mirroring the sync decorator which is also taking embeddings as an argument (should be taking retry parameters not embeddings), but OK as it's mirroring existing functionality

langchain/embeddings/openai.py

eyurtsev · 2023-06-15T02:38:36Z

Hi @tyree731 , code looks good -- let's remove the abstractmethod to reduce scope and avoid breaking changes and then code is good to merge.

vercel · 2023-06-16T01:26:48Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
langchain	❌ Failed (Inspect)			Jun 16, 2023 1:26am

tyree731 · 2023-06-16T01:28:49Z

Updated based on your comments. Let me know if there's anything I need to do for the vercel failure.

hwchase17

we can add these methods to the base class, but lets just do something like

    async def aembed_documents(
        self, texts: List[str], chunk_size: Optional[int] = 0
    ) -> List[List[float]]:
    raise NotImplementedError

In the base class (rather than mark as abstract and have each subclass have to implement that)

tyree731 · 2023-06-21T19:54:53Z

Sorry for the delay in responding @hwchase17 , currently on vacation in Germany for the next couple of weeks, so I'll get to your comments when I get back.

@tyree731

Since it seems like #6111 will be blocked for a bit, I've forked @tyree731's fork and implemented the requested changes. This change adds support to the base Embeddings class for two methods, aembed_query and aembed_documents, those two methods supporting async equivalents of embed_query and embed_documents respectively. This ever so slightly rounds out async support within langchain, with an initial implementation of this functionality being implemented for openai. Implements #6109 --------- Co-authored-by: Stephen Tyree <tyree731@gmail.com>

eyurtsev reviewed Jun 15, 2023

View reviewed changes

nit: remove abstract method, raise AssertionError instead

332d273

vercel bot temporarily deployed to Preview June 16, 2023 01:26 Inactive

hwchase17 reviewed Jun 17, 2023

View reviewed changes

dev2049 added the 03 enhancement Enhancement of existing functionality label Jun 21, 2023

BrendanGraham14 mentioned this pull request Jun 21, 2023

feat: interfaces for async embeddings, implement async openai #6563

Merged

hwchase17 closed this Jul 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: interfaces for async embeddings, implement async openai #6111

feat: interfaces for async embeddings, implement async openai #6111

tyree731 commented Jun 13, 2023

tyree731 commented Jun 13, 2023

tyree731 commented Jun 14, 2023

eyurtsev Jun 14, 2023

eyurtsev commented Jun 15, 2023

vercel bot commented Jun 16, 2023

tyree731 commented Jun 16, 2023

hwchase17 left a comment

tyree731 commented Jun 21, 2023

		@@ -53,6 +54,38 @@ def _create_retry_decorator(embeddings: OpenAIEmbeddings) -> Callable[[Any], Any
		)


		def _async_retry_decorator(embeddings: OpenAIEmbeddings) -> Any:

feat: interfaces for async embeddings, implement async openai #6111

feat: interfaces for async embeddings, implement async openai #6111

Conversation

tyree731 commented Jun 13, 2023

tyree731 commented Jun 13, 2023

tyree731 commented Jun 14, 2023

eyurtsev Jun 14, 2023

Choose a reason for hiding this comment

eyurtsev commented Jun 15, 2023

vercel bot commented Jun 16, 2023

tyree731 commented Jun 16, 2023

hwchase17 left a comment

Choose a reason for hiding this comment

tyree731 commented Jun 21, 2023