Generate control vector using llama.cpp #6880

ngxson · 2024-04-24T17:19:33Z

Motivation

Support for control vector is added in #5970 , but to generate the vector, users must run python code (which uses huggingface API instead of llama.cpp)

Possible Implementation

By looking at https://github.com/vgel/repeng/blob/main/repeng/extract.py , we can see some key steps that need to be adapted to cpp:

batched_get_hiddens returns a list of embedding vectors from hidden layers. This can be done in llama.cpp using eval callback
For the moment, we can't find any lightweight implementation of Principal component analysis (PCA) in cpp. An idea would be to replace it with this UMAP C++ implementation

The text was updated successfully, but these errors were encountered:

jukofyork · 2024-04-26T06:55:50Z

* For the moment, we can't find any lightweight implementation of Principal component analysis (PCA) in cpp. An idea would be to replace it with [this UMAP C++ implementation ](https://github.com/LTLA/umappp)

pca_model = PCA(n_components=1, whiten=False).fit(train)

Parameters:

    n_componentsint, float or ‘mle’, default=None

        Number of components to keep. if n_components is not set all components are kept:

https://scikit-learn.org/stable/modules/generated/sklearn.decomposition.PCA.html

This looks to just be computing the single eigenvector associated with the dominant eigenvalue? If so, then this can be computed very easily in about 10 lines of C:

https://en.m.wikipedia.org/wiki/Power_iteration

If for any reason this doesn't work then you can also compute via the SVD which can easily be found using gradient descent: minimise the Frobenius-norm of the outer product of two vectors, then standardise the vectors to have an L2-norm of 1 (this method is often used in recommendation systems).

Yorizuka · 2024-05-17T21:31:30Z

I'm just here to comment that this would be a really useful feature! Having a non python way of doing this would be ideal.
I am willing to put out a small bounty on this if that will motivate someone to do it!

I am willing to pay a minimum of 100 USD for a working solution I can apply. (sorry if that's not much, I am just a hobbyist paying out of my own pocket, I hope its not a insultingly small amount)

christianazinn · 2024-05-22T13:53:03Z

+1 on this, would be useful. I can try my hand at implementing something in a little while, but I'm new to C++ and can't guarantee anything.

ngxson · 2024-05-22T13:56:45Z

@jukofyork Thanks for the info. Unfortunately I'm not very good at math so I'm quite struggle understanding. It would be nice if someone can implement something equivalent to PCA but on cpp.

@christianazinn FYI, this example from eval-callback can be a good start if you want to give it a try.

jukofyork · 2024-05-22T15:15:26Z

@jukofyork Thanks for the info. Unfortunately I'm not very good at math so I'm quite struggle understanding. It would be nice if someone can implement something equivalent to PCA but on cpp.

Power Iteration is one of the simplest algorithms imaginable:

Pick a random vector, v.
Divide all the elements in the vector v by their sum of squared values and take the square root to create v_norm.
Multiply this vector by the target matrix, v = W . v_norm.
Goto step 2.

The only reason for step 2 is to stop it blowing up and overflowing the machine precision.

If the matrix is "nice" then this will converge quite quickly and you'll find the vector doesn't change hardly at all and you have found your principle eigenvector (or equivalently the first principle component).

If you try this on a 2D example you'll see how it works: it looks like the needle of a compass settling towards north after you jolt it. The step 2 above makes the tip of the needle always touch the edge of the circle with radius 1 (and in 3D+ the shell of a unit (hyper-)sphere). If you don't bother with step 2 then it will still end up pointing in the right direction, but just be composed of huge numbers....

The 2D example shows how the algorithm can struggle for "non-nice" matrices too: if the 1st and 2nd eigenvector are at right angles to each other it will converge quickly, but if they are pointing in approximately the same direction, it will take much longer. The needle on a compass analogy would be having a strong magnet somewhere near that is almost north but not quite north.

If the matrix isn't square then you need use another technique called "Singular Value Decomposition" which is more involved, but not hard to implement if all you care about is getting a very low-rank approximation (you can just use gradient descent).

EDIT: Here's a nice video showing it in 2D: https://www.youtube.com/watch?v=wRhYfAObXzY and 3D: https://www.youtube.com/watch?v=AtmpkYYSMk4

(those aren't "nice" matrices and hence why it struggles because of the large off diagonal values!).

ngxson · 2024-05-24T10:06:30Z

@jukofyork Thanks for the direction. It's not the easiest thing that I can understand, but I'll give it a try.

@Yorizuka @christianazinn I have a draft PR just to get some directions. I'm not doing this for bug-bounty motivation, so if there're someone can help me out, you can take the bounty if you want. Thank you.

christianazinn · 2024-05-24T19:11:03Z

I have a draft PR just to get some directions. I'm not doing this for bug-bounty motivation, so if there're someone can help me out, you can take the bounty if you want. Thank you.

Thanks, will move discussion there. Not doing this for bounty motivation either. Currently have PCA working-ish but am stuck getting vector normalization to work (ggml_norm isn't working how I expect it to and there's no docs :/).

jukofyork · 2024-06-11T09:36:03Z

Did you manage to get any further with this?

ngxson · 2024-06-11T12:59:00Z

@jukofyork FYI, I've already got a working version with good performance. Here is the last result: #7514 (comment)

ngxson added the enhancement New feature or request label Apr 24, 2024

ngxson mentioned this issue May 24, 2024

Add cvector-generator example #7514

Merged

6 tasks

ngxson closed this as completed in #7514 Jun 15, 2024

mhnghfv mentioned this issue Aug 12, 2024

Improve cvector-generator #8724

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generate control vector using llama.cpp #6880

Generate control vector using llama.cpp #6880

ngxson commented Apr 24, 2024 •

edited

Loading

jukofyork commented Apr 26, 2024

Yorizuka commented May 17, 2024 •

edited

Loading

christianazinn commented May 22, 2024

ngxson commented May 22, 2024

jukofyork commented May 22, 2024 •

edited

Loading

ngxson commented May 24, 2024 •

edited

Loading

christianazinn commented May 24, 2024

jukofyork commented Jun 11, 2024

ngxson commented Jun 11, 2024

Generate control vector using llama.cpp #6880

Generate control vector using llama.cpp #6880

Comments

ngxson commented Apr 24, 2024 • edited Loading

Motivation

Possible Implementation

jukofyork commented Apr 26, 2024

Yorizuka commented May 17, 2024 • edited Loading

christianazinn commented May 22, 2024

ngxson commented May 22, 2024

jukofyork commented May 22, 2024 • edited Loading

ngxson commented May 24, 2024 • edited Loading

christianazinn commented May 24, 2024

jukofyork commented Jun 11, 2024

ngxson commented Jun 11, 2024

ngxson commented Apr 24, 2024 •

edited

Loading

Yorizuka commented May 17, 2024 •

edited

Loading

jukofyork commented May 22, 2024 •

edited

Loading

ngxson commented May 24, 2024 •

edited

Loading