Allow the injection of TokenCountEstimator #705

andreadimaio · 2024-06-26T21:16:53Z

This PR resolves #697.

The idea is to allow the injection of the TokenCountEstimator interface, which can be used in different ways, for example to know in advance the cost of processing a certain text, or to choose one model or another based on the maximum number of tokens.

Today, when I look at the code to see which LLM providers implement this interface, I see

bam
watsonx
azure-openai

For the azure-openai I found something strange, the tokenizer variable is never set, so when creating tests I always get a NullPointerException(There are some TODOs in the code to understand what the points are).

As far as I understand, the correct way to create the tokenizer for azure-openai is to use something like:

new AzureOpenAiTokenizer(<modelId>)

but I'm not able to test this, so what I've done is just put some TODOs in the code.

Is there anyone who can look into this? Does it make sense to have a TokenCountEstimator for azure-openai or should I just ignore it for now?

geoand · 2024-06-27T06:43:53Z

Is there anyone who can look into this? Does it make sense to have a TokenCountEstimator for azure-openai or should I just ignore it for now?

Maybe @csotiriou knows :)

geoand

Very nice!

Allow the injection of TokenCountEstimator

cce6303

andreadimaio requested a review from a team as a code owner June 26, 2024 21:18

geoand approved these changes Jun 27, 2024

View reviewed changes

geoand merged commit ed9420e into quarkiverse:main Jun 27, 2024
12 checks passed

andreadimaio deleted the token_count_estimator branch June 27, 2024 06:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow the injection of TokenCountEstimator #705

Allow the injection of TokenCountEstimator #705

andreadimaio commented Jun 26, 2024 •

edited

Loading

geoand commented Jun 27, 2024

geoand left a comment

Allow the injection of TokenCountEstimator #705

Allow the injection of TokenCountEstimator #705

Conversation

andreadimaio commented Jun 26, 2024 • edited Loading

geoand commented Jun 27, 2024

geoand left a comment

Choose a reason for hiding this comment

andreadimaio commented Jun 26, 2024 •

edited

Loading