Is there a plan to support model Qwen2? #9

gloritygithub11 · 2024-09-24T05:06:45Z

No description provided.

zengchao0424 · 2024-09-27T02:39:46Z

Hello, Qwen2 implements attention calculation using GQA. In our implementation, we have added support for GQA, and using our LLaMA implementation, it can support GQA models like LLaMA-3. The model architecture of Qwen2 shares similarities with LLaMA, so you can extend Qwen2 based on our LLaMA implementation.

LiMa-cas · 2024-11-14T00:40:18Z

hi what the transformers version for？ I use the latest which is 4.46.2,but got the problems :TypeError: LlamaRotaryEmbedding.forward() got an unexpected keyword argument 'seq_len',which maybe need a
old version of 4.37, but I quantiztioned for LLAMA3.1-8B-INSTURCT

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is there a plan to support model Qwen2? #9

Is there a plan to support model Qwen2? #9

gloritygithub11 commented Sep 24, 2024

zengchao0424 commented Sep 27, 2024

LiMa-cas commented Nov 14, 2024

Is there a plan to support model Qwen2? #9

Is there a plan to support model Qwen2? #9

Comments

gloritygithub11 commented Sep 24, 2024

zengchao0424 commented Sep 27, 2024

LiMa-cas commented Nov 14, 2024