[Feature Request]: Batched Async Prediction #84

WindChimeRan · 2024-02-13T00:33:21Z

Hi,

The current scikit-llm is implemented in a synchronous way - the prompts are sent to the api one-by-one.

This is not ideal when we have a large dataset and a high tier (high TPM/RPM) account. Is it possible to incorporate batched async feature?

Reference:

oaib

OKUA1 · 2024-02-13T07:30:32Z

Hi @WindChimeRan,

Unfortunately, the OpenAI api does not support batched requests, so there is going to be 1 request per 1 sample anyway.

The only possibility of speedup is to send multiple requests in parallel. Adding an async api could be nice (e.g. a_predict method), but this is not really compliant with scikit-learn API and might be confusing for some users. The more straightforward way would be to just support synchronous parallel processing and allow to specify n_jobs hyperparameter. This is something we had in mind since day 1, but never prioritised as until relatively recently the rate limits would not allow for sufficient parallelisation anyway.

As for now, you can simply split your dataset and run predict on each chunk in a thread pool.

AndreasKarasenko · 2024-06-10T09:03:49Z

@WindChimeRan see #101.
Edit: I did some experimenting with the FewShotClassifier and I quickly run into rate limits. Tbf it's for sentiment classification and some of the reviews sampled are VERY long (which i do not monitor), so potentially there is no real speed-up.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request]: Batched Async Prediction #84

[Feature Request]: Batched Async Prediction #84

WindChimeRan commented Feb 13, 2024

OKUA1 commented Feb 13, 2024

AndreasKarasenko commented Jun 10, 2024 •

edited

Loading

[Feature Request]: Batched Async Prediction #84

[Feature Request]: Batched Async Prediction #84

Comments

WindChimeRan commented Feb 13, 2024

OKUA1 commented Feb 13, 2024

AndreasKarasenko commented Jun 10, 2024 • edited Loading

AndreasKarasenko commented Jun 10, 2024 •

edited

Loading