Improvements to the Embedding functionality #776

dkotter · 2024-06-07T20:19:39Z

Description of the Change

In testing out another plugin that uses the Embedding functionality from ClassifAI, ran into a few issues that could be improved.

The OpenAI API allows you to generate Embeddings from either a single string or an array of strings. In Updates to the OpenAI Embeddings Provider #758 we made some major improvements to Embeddings, one of which was to break content down into smaller chunks and embed each chunk. This was done with individual API requests as I didn't realize you could send array data. This PR fixes that by checking the size of the content and if it's below the token limit, we send that as a single request. For any bulk processing, this reduces the number of API requests significantly, helping with performance and helping to avoid rate limit errors
Also discovered a bug in our chunking code where each chunk would be larger than the last. We want an overlap with each chunk but there was a bug here where each iteration we would increase that overlap. This wasn't noticeable for smaller posts but for long posts, you'd end up with huge chunks that were over the token limit, resulting in API failures. This also meant more tokens were being used that necessary. For any bulk processing, this helps reduce the tokens used significantly
Ensure we set a higher timeout for the Azure OpenAI Embedding requests, as these can timeout at the default 5 seconds

How to test the Change

Setup the Classification Feature to use either OpenAI or Azure OpenAI (or try both)
Preview a post and ensure results show
(Optional) Test classifying a post and ensure results show

Changelog Entry

Added - Higher timeout for Azure OpenAI Embedding requests
Changed - Add the ability to send chunked content in a single request to the Embedding API
Fixed - Ensure we chunk content correctly, keeping each chunk roughly the same size

Credits

Props @dkotter

Checklist:

I agree to follow this project's Code of Conduct.
I have updated the documentation accordingly.
I have added tests to cover my change.
All new and existing tests pass.

… to generate embeddings using less requests

…quests

iamdharmesh

Thanks for this improvement @dkotter Code Looks good and it tests well.

dkotter added 4 commits June 6, 2024 16:59

Add methods to generate embeddings for an array of text, allowing you…

071ad1b

… to generate embeddings using less requests

Fix the chunking of content. Set a higher timeout for Azure OpenAI re…

6d3eb43

…quests

Send our chunked content in a single request if we can

cf8ae9b

Fix sanitizing the embedding values

b260d6f

dkotter added this to the 3.1.0 milestone Jun 7, 2024

dkotter self-assigned this Jun 7, 2024

dkotter requested review from jeffpaul and a team as code owners June 7, 2024 20:19

github-actions bot added the needs:code-review This requires code review. label Jun 7, 2024

dkotter requested review from iamdharmesh and removed request for a team and jeffpaul June 7, 2024 20:46

jeffpaul mentioned this pull request Jun 10, 2024

Release version 3.1.0 #773

Closed

22 tasks

iamdharmesh approved these changes Jun 11, 2024

View reviewed changes

dkotter merged commit fdddf1d into develop Jun 11, 2024
14 of 15 checks passed

dkotter deleted the fix/faster-embedding-requests branch June 11, 2024 14:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improvements to the Embedding functionality #776

Improvements to the Embedding functionality #776

dkotter commented Jun 7, 2024

iamdharmesh left a comment

Improvements to the Embedding functionality #776

Improvements to the Embedding functionality #776

Conversation

dkotter commented Jun 7, 2024

Description of the Change

How to test the Change

Changelog Entry

Credits

Checklist:

iamdharmesh left a comment

Choose a reason for hiding this comment