Llava 1.6: server not decoding images, but works via CLI #5515

tctrautman · 2024-02-15T19:38:58Z

First, let me say that I really appreciate all the work you guys are putting into llama.cpp -- it's really impressive.

I'm testing out yesterday's release of llava 1.6 (thanks so much for working on that tricky PR, @cmp-nct), and it's working well via the CLI, but when I run it via the server I'm seeing the below when it receives a request:

clip_image_load_from_bytes: failed to decode image bytes
slot 0 - failed to load image [id: 12]
task 1 - error: internal_error

How I'm running via CLI (works)

./llava-cli -m ./models/llava-1-6/mistral-7b-q_5_k.gguf --mmproj ./models/llava-1-6/mmproj-mistral7b-f16.gguf --image ./media/images/ginsberg.png -p "Who is this?" --temp 0.1

How I'm running via Server (doesn't work)

To start the server:

./server -m ./models/llava-1-6/mistral-7b-q_5_k.gguf --mmproj ./models/llava-1-6/mmproj-mistral7b-f16.gguf --host 127.0.0.1 --port 8080

The request I'm sending:

curl --request POST \
  --url http://localhost:8080/completion \
  --header 'Content-Type: application/json' \
  --data '{
	"prompt": "USER:[img-12]Who is this?.\nASSISTANT:",
	"temperature": 0.1,
	"image_data": [
		{
			"data": <BASE64_IMG>,
			"id": 12
		}
	]
}'

BASE64_IMG is the base 64 of the below image

Details

My system: 2021 M1 Max MBP w/ 64 GB of RAM, running Sonoma 14.3
Llava 1.6 Model files: both from this HF repo
- Model: mistral-7b-q_5_k.gguf
- Mmproj: mmproj-mistral7b-f16.gguf
Version of llama.cpp: I'm on the most recent commit as of this issue, commit 4524290e87b8e107cc2b56e1251751546f4b9051

The text was updated successfully, but these errors were encountered:

tctrautman · 2024-02-15T19:40:23Z

Seems possibly related to #5514, but that's pure speculation

cmp-nct · 2024-02-15T20:23:35Z

yes it's the same, I responded with what needs to be done there

tctrautman · 2024-02-15T20:41:39Z

Thanks for the thorough explanation in that issue, @cmp-nct. I'll close this issue out since it's a duplicate of that one.

tctrautman · 2024-02-16T05:11:33Z

Opening this back up.

After seeing the conversation develop in the other ticket, that issue seems to stem from differences in the system prompt between the CLI and server.

But this error seems to appear while the server is loading the Base64 image into clip. Also, AFAICT this error doesn't seem to touch process_images at all, but instead comes from launch_slot_with_data:

https://github.com/ggerganov/llama.cpp/blob/4524290e87b8e107cc2b56e1251751546f4b9051/examples/server/server.cpp#L686-L698

I haven't made much progress with debugging (unfamiliar with C++), but I'll see if I can dive in a bit deeper over the next day or two.

cjpais · 2024-02-17T20:26:23Z

Is the base64 you're putting in valid? At first I used an online converter and got the same result as you. Then I pushed the image through the ./server UI, copied the b64 from there and sent it from the command line without issue

This gist is what I used: https://gist.github.com/cjpais/6b7b620d29b4a8e6ca81eb5b87371bb5

tctrautman · 2024-02-17T20:49:59Z

@cjpais that was it -- thank you! A silly mistake on my part -- I was sending the entire URL instead of just the base 64 data. In case others stumble upon this issue, this might be helpful reading.

tctrautman added the bug-unconfirmed label Feb 15, 2024

tctrautman closed this as completed Feb 15, 2024

tctrautman reopened this Feb 16, 2024

cjpais mentioned this issue Feb 17, 2024

support llava 1.6 image embedding dimension in server #5553

Merged

tctrautman closed this as completed Feb 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Llava 1.6: server not decoding images, but works via CLI #5515

Llava 1.6: server not decoding images, but works via CLI #5515

tctrautman commented Feb 15, 2024 •

edited

Loading

tctrautman commented Feb 15, 2024

cmp-nct commented Feb 15, 2024

tctrautman commented Feb 15, 2024

tctrautman commented Feb 16, 2024

cjpais commented Feb 17, 2024

tctrautman commented Feb 17, 2024

Llava 1.6: server not decoding images, but works via CLI #5515

Llava 1.6: server not decoding images, but works via CLI #5515

Comments

tctrautman commented Feb 15, 2024 • edited Loading

How I'm running via CLI (works)

How I'm running via Server (doesn't work)

To start the server:

The request I'm sending:

Details

tctrautman commented Feb 15, 2024

cmp-nct commented Feb 15, 2024

tctrautman commented Feb 15, 2024

tctrautman commented Feb 16, 2024

cjpais commented Feb 17, 2024

tctrautman commented Feb 17, 2024

tctrautman commented Feb 15, 2024 •

edited

Loading