embedding: adjust `n_ubatch` value, print error on insufficient `n_batch` value #6296

mscheong01 · 2024-03-25T11:24:45Z

updates to the embedding example based on discussion from #6193

assign n_batch value to u_batch
output error on insufficient batch size

examples/embedding/embedding.cpp

ngxson · 2024-03-25T21:36:35Z

examples/embedding/embedding.cpp

@@ -114,7 +116,9 @@ int main(int argc, char ** argv) {
    for (const auto & prompt : prompts) {
        auto inp = ::llama_tokenize(ctx, prompt, true, false);
        if (inp.size() > n_batch) {
-            inp.resize(n_batch);
+            fprintf(stderr, "%s: error: number of tokens in input line (%lld) exceeds batch size (%lld), increase batch size and re-run\n",
+                    __func__, (long long int) inp.size(), (long long int) n_batch);


Here you can use %ld instead of %lld, no need to cast the type then

Suggested change

__func__, (long long int) inp.size(), (long long int) n_batch);

__func__, inp.size(), n_batch);

applied & rolled back due to build failure.
IIRC, this is why I used %lld with casting in #6193. Although I tried your suggestion just in case 😉.

We normally use PRIu64 / PRId64 to print 64-bit integers. Alternatively, in this case just %d and cast to (int) is fine

Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com>

…tch`-overflow' of github.com:mscheong01/llama.cpp into embedding-assign-`n_ubatch`-value,-print-error-on-`n_batch`-overflow

This reverts commit ea753ed.

* embedding: assign `n_ubatch` value, print error on `n_batch` overflow * Update examples/embedding/embedding.cpp Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com> * use %ld instead of %lld * Revert "use %ld instead of %lld" This reverts commit ea753ed. --------- Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com>

embedding: assign n_ubatch value, print error on n_batch overflow

6e27406

ngxson requested changes Mar 25, 2024

View reviewed changes

mscheong01 and others added 4 commits March 26, 2024 09:25

Update examples/embedding/embedding.cpp

d054109

Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com>

use %ld instead of %lld

ea753ed

Merge branch 'embedding-assign-n_ubatch-value,-print-error-on-`n_ba…

544b447

…tch`-overflow' of github.com:mscheong01/llama.cpp into embedding-assign-`n_ubatch`-value,-print-error-on-`n_batch`-overflow

Revert "use %ld instead of %lld"

2258098

This reverts commit ea753ed.

ggerganov approved these changes Mar 26, 2024

View reviewed changes

ggerganov merged commit deb7240 into ggerganov:master Mar 26, 2024
55 of 56 checks passed

cebtenzzre mentioned this pull request May 28, 2024

llamamodel: fix embedding crash for >512 tokens after #2310 nomic-ai/gpt4all#2383

Merged

thxCode mentioned this pull request Aug 6, 2024

fix: crash on bge-m3 embedding model #8883

Open

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

embedding: adjust `n_ubatch` value, print error on insufficient `n_batch` value #6296

embedding: adjust `n_ubatch` value, print error on insufficient `n_batch` value #6296

mscheong01 commented Mar 25, 2024

ngxson Mar 25, 2024

mscheong01 Mar 26, 2024

ggerganov Mar 26, 2024

	__func__, (long long int) inp.size(), (long long int) n_batch);
	__func__, inp.size(), n_batch);

embedding: adjust n_ubatch value, print error on insufficient n_batch value #6296

embedding: adjust n_ubatch value, print error on insufficient n_batch value #6296

Conversation

mscheong01 commented Mar 25, 2024

ngxson Mar 25, 2024

Choose a reason for hiding this comment

mscheong01 Mar 26, 2024

Choose a reason for hiding this comment

ggerganov Mar 26, 2024

Choose a reason for hiding this comment

embedding: adjust `n_ubatch` value, print error on insufficient `n_batch` value #6296

embedding: adjust `n_ubatch` value, print error on insufficient `n_batch` value #6296