Implement Quick GELU #254

monatis · 2023-06-12T22:14:15Z

Closes #253

This is needed for CLIP-like models and I'm implementing clip.cpp here. It will also be a base for upcoming multimodal models that uses CLIP as an image embedder.

Green-Sky · 2023-06-12T23:03:19Z

you let some formatting happen. please only commit code changes.

monatis · 2023-06-12T23:55:30Z

Oh sorry, missed that. I'll fix it.

This reverts commit ff220cc.

monatis · 2023-06-13T06:37:14Z

This is ready for review now.

monatis · 2023-06-16T12:54:38Z

Hi @ggerganov Can ı have your attention here please? clip.cpp is almost ready, and I'm only crafting more examples and then I can announce it.

I can also add a link in the examples section in readme if you thing it deserves this.

p.s.: The next step will be using clip.cpp and llama.cpp to infer with LLaVA

ggerganov

Rename to ggml_gelu_quick, GELU_QUICK, etc, in all instances

From what I read, this is just an approximation of GELU that is supposed to be faster but inaccurate. Why not use the original ggml_gelu()? It is already doing table lookup so there is no difference in performance + it is more accurate

Also, keep in mind that in the future you can use ggml_map_unary_f32() to implement this kind of 1D mapping functions in your project without having to wait for ggml to provide them

monatis · 2023-06-16T14:02:56Z

Unfortunately that theoretically small divergence between GELU and Quick GELU lead to large differences at the end, I suppose it accumulates through 12 layers. So I couldn't get good results until implementing Quick GELU.

This causes long prompts to parse very slowly.

Implement Quick GELU

ff220cc

monatis added 5 commits June 13, 2023 03:45

Revert "Implement Quick GELU"

650b1d3

This reverts commit ff220cc.

Tidy up ggml.h

2d6bf53

Respect to the style of ggml

8973f4f

Merge branch 'quick-gelu'

3010bea

Fix: Fix minor typo

eca2e16

monatis mentioned this pull request Jun 16, 2023

Hey there monatis/clip.cpp#1

Closed

ggerganov requested changes Jun 16, 2023

View reviewed changes

Rename quick_gelu -> gelu_quick

668e734

monatis requested a review from ggerganov June 16, 2023 14:04

ggerganov approved these changes Jun 16, 2023

View reviewed changes

ggerganov merged commit 873f19f into ggerganov:master Jun 16, 2023

CCLDArjun pushed a commit to CCLDArjun/ggml that referenced this pull request Dec 18, 2023

Fix n^2 loop in tokenization (ggerganov#254)

a81d0c2

This causes long prompts to parse very slowly.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement Quick GELU #254

Implement Quick GELU #254

monatis commented Jun 12, 2023

Green-Sky commented Jun 12, 2023

monatis commented Jun 12, 2023

monatis commented Jun 13, 2023

monatis commented Jun 16, 2023

ggerganov left a comment

monatis commented Jun 16, 2023 •

edited

Loading

Implement Quick GELU #254

Implement Quick GELU #254

Conversation

monatis commented Jun 12, 2023

Green-Sky commented Jun 12, 2023

monatis commented Jun 12, 2023

monatis commented Jun 13, 2023

monatis commented Jun 16, 2023

ggerganov left a comment

Choose a reason for hiding this comment

monatis commented Jun 16, 2023 • edited Loading

monatis commented Jun 16, 2023 •

edited

Loading