feat: add bert.cpp embeddings #222

mudler · 2023-05-10T12:02:35Z

Problem:

Embedding from llama.cpp models are slow, and not really working here

Solution:
this PR adds high performant C++ bindings based on https://github.com/skeskinen/bert.cpp - which makes calculation bloody fast! You can check out benchmarks here: https://github.com/skeskinen/bert.cpp/tree/master/benchmarks

This also allows any model to use behind the scene bert.

The PR also simplifies the examples

mudler force-pushed the bert branch from d6ed43e to 19a2c08 Compare May 10, 2023 12:05

mudler linked an issue May 10, 2023 that may be closed by this pull request

feature: embedding support #70

Closed

3 tasks

mudler mentioned this pull request May 10, 2023

feature: embedding support #70

Closed

3 tasks

mudler force-pushed the bert branch from 19a2c08 to 0e23f4e Compare May 10, 2023 12:24

feat: add bert.cpp embeddings

426b255

mudler force-pushed the bert branch from 0e23f4e to 426b255 Compare May 10, 2023 12:49

mudler merged commit f8ee209 into master May 10, 2023

mudler deleted the bert branch May 10, 2023 13:20

Provide feedback