Garbled characters decoding on Mistral-7b-v0.1-instruct #163

zengxy20 · 2024-11-25T10:57:48Z

I have followed the sharing setting in #79 to change the cache into your customized kv_cache (MistralAttention, MistralDecoderLayer, MistralModel, MistralForCausalLM, MistralForSequenceClassification) and not used the tree_mask after causal_mask. But the outputs are garbled characters. I think there are wrong settings in our modeling_mistral_kv.py. Could you please provide me with some suggestions?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Garbled characters decoding on Mistral-7b-v0.1-instruct #163

Garbled characters decoding on Mistral-7b-v0.1-instruct #163

zengxy20 commented Nov 25, 2024

Garbled characters decoding on Mistral-7b-v0.1-instruct #163

Garbled characters decoding on Mistral-7b-v0.1-instruct #163

Comments

zengxy20 commented Nov 25, 2024