v0.8.6 - support LongLLaMA

kddubey released this 22 Nov 19:27

· 69 commits to main since this release

4d976cf

Breaking changes

Setting the internal past attribute of the cache to None now will cause an error to be raised if you try to use it again. Please use the original model instead

New features

Support LongLLaMA
repr for cached model
Don't check logits from Llama CPP

Bug fixes

None

Assets 2