You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Perhaps not so "novel" at this point. Min P has been integrated into a handful of backends at this point. Works just as one would imagine, acts as a floor for what the sampler will consider as "best response". This typically replaces K.
DRY I know less about, but I guess it comes from the "Don't Repeat Yourself" coding mantra and is a refined, clever formulation that seems to get the job done better than repetition penalty does. As some Redditor explained "it prevents within-sequence verbatim repetition (other solutions are still needed to prevent across-sequence repetition, synonym repetition, list repetition, etc.)."
These are definitely solid enhancements that aren't difficult to implement + wouldn't alter/hurt MLC-LLM's current offerings so they should be no-brainers.
🚀 Feature
Perhaps not so "novel" at this point. Min P has been integrated into a handful of backends at this point. Works just as one would imagine, acts as a floor for what the sampler will consider as "best response". This typically replaces K.
DRY I know less about, but I guess it comes from the "Don't Repeat Yourself" coding mantra and is a refined, clever formulation that seems to get the job done better than repetition penalty does. As some Redditor explained "it prevents within-sequence verbatim repetition (other solutions are still needed to prevent across-sequence repetition, synonym repetition, list repetition, etc.)."
Ah, no wonder, it is a very DIY origin it seems. oobabooga/text-generation-webui#5677
Motivation
These are definitely solid enhancements that aren't difficult to implement + wouldn't alter/hurt MLC-LLM's current offerings so they should be no-brainers.
Additional context
https://github.com/EricLBuehler/mistral.rs/pull/625/files
https://github.com/EricLBuehler/mistral.rs/pull/637/files
Mistral.rs happened to add this and DRY recently, so I'll just use those as demonstrations of the simplicity of these add-ins.
The text was updated successfully, but these errors were encountered: