Consistent naming for counting tokens #4297
Labels
2.x
Related to Haystack v2.0
breaking change
P2
Medium priority, add to the next sprint if no P1 available
type:documentation
Improvements on the docs
type:enhancement
Milestone
I think it's a bit confusing how we use many different terms for number of tokens:
n_tokens
,max_seq_len
,max_tokens
,max_tokens_limit
,max_length
and earlierleftover_token_len
,n_full_prompt
,n_full_prompt_tokens
,n_skipped_tokens
. Maybe we could follow a convention of usingn_
when we count tokens andmax_
when we set a limit?_len
I would leave out. Soleftover_token_len
could becomen_leftover_tokens
andmax_seq_len
could becomemax_tokens
. What do you think?Originally posted by @julian-risch in #4179 (comment)
I agree that this is confusing and not consistent. Maybe we could combine this with your previous comment into one new PR to handle naming conventions?
Originally posted by @sjrl in #4179 (comment)
The text was updated successfully, but these errors were encountered: