You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Meanwhile, I have a distilled set of suggestions for how this library could be made more familiar to people who are already used to transformers and AutoModelForCausalLM.
Maybe some of these are already possible in the current version of rwkv. If so, I would be grateful if you could let me know how.
Many parameters are available for model.generate() in HF, but it seems to me that the absolutely essential ones that everyone uses are:
temperature ✅
top_p ✅
top_k
repetition_penalty
I am aware that alpha_frequency and alpha_presence are implemented, but these parameters are not usually found in presets that people have already come up with while working with other models. For this reason, having repetition_penalty would be valuable.
The text was updated successfully, but these errors were encountered:
I have expressed my interest in having RWKV officially implemented in Hugging Face in huggingface/transformers#17230.
Meanwhile, I have a distilled set of suggestions for how this library could be made more familiar to people who are already used to
transformers
andAutoModelForCausalLM
.Maybe some of these are already possible in the current version of
rwkv
. If so, I would be grateful if you could let me know how.1. Being able to load the tokenizer explicitly
with something like
and then use it with
Having the ability to count the number of tokens in a given prompt is very useful.
2. Generating text with input_ids as input rather than a string
Something like
3. Generation parameters
Many parameters are available for
model.generate()
in HF, but it seems to me that the absolutely essential ones that everyone uses are:I am aware that
alpha_frequency
andalpha_presence
are implemented, but these parameters are not usually found in presets that people have already come up with while working with other models. For this reason, havingrepetition_penalty
would be valuable.The text was updated successfully, but these errors were encountered: