You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have followed the sharing setting in #79 to change the cache into your customized kv_cache (MistralAttention, MistralDecoderLayer, MistralModel, MistralForCausalLM, MistralForSequenceClassification) and not used the tree_mask after causal_mask. But the outputs are garbled characters. I think there are wrong settings in our modeling_mistral_kv.py. Could you please provide me with some suggestions?
The text was updated successfully, but these errors were encountered:
I have followed the sharing setting in #79 to change the cache into your customized kv_cache (MistralAttention, MistralDecoderLayer, MistralModel, MistralForCausalLM, MistralForSequenceClassification) and not used the tree_mask after causal_mask. But the outputs are garbled characters. I think there are wrong settings in our modeling_mistral_kv.py. Could you please provide me with some suggestions?
The text was updated successfully, but these errors were encountered: