How to use embedding correctly #547

xuzeyu91 · 2024-02-27T06:00:02Z

What kind of model should be used for embedding? When I use nomic-embed-text-v1.5.f32.gguf, it will report protected memory, while when I use tinyllama-1.1b-chat.gguf, it can run normally. However, I feel that the returned float array is not correct. When I use the same text for vector matching, the similarity is only 0.42

martindevans · 2024-02-27T13:30:54Z

I'm not familiar with nomic, but if it's based on the BERT architecture it's supported in LLamaSharp yet. BERT support was only added to llama.cpp a couple of weeks ago (ggerganov/llama.cpp#5423), and we haven't updated our binaries yet.

However, I feel that the returned float array is not correct. When I use the same text for vector matching, the similarity is only 0.42

Do you mean you literally fed the same text in twice at it wasn't identical? If so that's definitely a bug!

ladeak · 2024-02-28T06:22:07Z

I have the same issue using the phi-2 and llama models through the integration of semantickernel. The values returned from the 'memory' seems to be completely independent to the search value.
And I have the same issue I put in an exact match for the search.

AshD · 2024-02-28T16:29:56Z

I experienced the same issue with the poor similarity matching with Semantic Kernel.
Once LlamaSharp updates the binaries to support the Bert models, this issue should go away.

ladeak · 2024-02-28T16:31:38Z

Why will the update of Bert models help?
@AshD could you expand the issue should go away?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to use embedding correctly #547

How to use embedding correctly #547

xuzeyu91 commented Feb 27, 2024

martindevans commented Feb 27, 2024

ladeak commented Feb 28, 2024

AshD commented Feb 28, 2024

ladeak commented Feb 28, 2024 •

edited

Loading

How to use embedding correctly #547

How to use embedding correctly #547

Comments

xuzeyu91 commented Feb 27, 2024

martindevans commented Feb 27, 2024

ladeak commented Feb 28, 2024

AshD commented Feb 28, 2024

ladeak commented Feb 28, 2024 • edited Loading

ladeak commented Feb 28, 2024 •

edited

Loading