Add support for the Gemma 2 model #84

EricLBuehler · 2024-08-21T02:41:44Z

Refs #79

EricLBuehler · 2024-08-21T03:27:37Z

@guoqingbao the model runs but seems to give garbage output. Do you see anything which stands out?

guoqingbao · 2024-08-21T03:33:19Z

@guoqingbao the model runs but seems to give garbage output. Do you see anything which stands out?

Perhaps there is a need for 'attn_logit_softcapping' if 'final_logit_softcapping' is used.

guoqingbao · 2024-08-21T08:25:00Z

I found the recent update #83 overwritten previous bug fixes including #80 @EricLBuehler

guoqingbao · 2024-08-21T10:29:16Z

@guoqingbao the model runs but seems to give garbage output. Do you see anything which stands out?

Perhaps there is a need for 'attn_logit_softcapping' if 'final_logit_softcapping' is used.

I have submitted a PR #86 to resolve the issue where both 'attn_logit_softcapping' and 'final_logit_softcapping' are necessary for gemma-2 inference. The corresponding PA kernel has also been revised. I'm curious why this isn't supported in vLLM. Google only mentioned that softcapping is beneficial for training.

EricLBuehler · 2024-08-21T10:42:10Z

@guoqingbao sounds good, I'll close this PR!

EricLBuehler added 6 commits August 20, 2024 22:14

Add the gemma2 model

83e126f

Fixes

dcad11a

Even more fixes

df1bed9

Fix dim size

5355289

Get head size

2874135

Fix reshape

83b227e

EricLBuehler requested a review from guoqingbao August 21, 2024 03:16

EricLBuehler added 2 commits August 20, 2024 23:19

Handle 9b

4ca50f9

Handle 9b

b416c77

guoqingbao mentioned this pull request Aug 21, 2024

Add model support for gemma 9b #79

Open

EricLBuehler closed this Aug 21, 2024

EricLBuehler deleted the gemma2 branch August 21, 2024 10:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for the Gemma 2 model #84

Add support for the Gemma 2 model #84

EricLBuehler commented Aug 21, 2024

EricLBuehler commented Aug 21, 2024

guoqingbao commented Aug 21, 2024

guoqingbao commented Aug 21, 2024

guoqingbao commented Aug 21, 2024

EricLBuehler commented Aug 21, 2024

Add support for the Gemma 2 model #84

Add support for the Gemma 2 model #84

Conversation

EricLBuehler commented Aug 21, 2024

EricLBuehler commented Aug 21, 2024

guoqingbao commented Aug 21, 2024

guoqingbao commented Aug 21, 2024

guoqingbao commented Aug 21, 2024

EricLBuehler commented Aug 21, 2024