You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Some providers even return HTTP headers that inform the client of the current rate limit remaining and when it would be reset, so it would be possible to automatically throttle requests to those providers (sleep automatically until the next "reset" point).
A call to the OpenAI embeddings API for example returns this:
Oooh interesting, that would be way better than trying to wrap some rust rate limiter around every client. Will score through all these clients and see which ones support that besides openai
Different providers have different limits, so might be hard to coordinate. Will also need settings to disable/alter limits for self-hosted options.
The text was updated successfully, but these errors were encountered: