Missing LLava 1.6 support for handling custom templates with the respect of the chosen LLM. #1301

DomainFlag · 2024-03-25T02:06:20Z

I'm using the Llava15ChatHandler but it seems I don't see anything for Llava16ChatHandler by looking at the source code? Moreover, it contains hard-coded templating instead of having support for custom in-model given the prop-value from the metadata tokenizer.chat_template by Nous Hermes 2 Yi 34B for example (Link) which is quite different from the hard-coded one? Any plans for that? Is LLava 1.6 really supported or should I fallback to the parent project?

Update: Seems using the current codebase state, I can get fairly okay results during inference but not sure if there might be some regression, need to check the original and compare.

The text was updated successfully, but these errors were encountered:

abetlen · 2024-04-04T14:34:22Z

Started here #1147 but got sidetracked.

shelbywhite · 2024-04-09T22:23:14Z

Would love to see full support for LLaVA 1.6 in this project.

Vinventive · 2024-04-20T06:06:59Z

Started here #1147 but got sidetracked.

Definitely, there will be new, ground-breaking LLaVA models coming this month, fine-tuned on Llama-3. It would be great to run them quantized in GGUF using this cpp-python library.

abetlen · 2024-04-28T02:30:59Z

Coming soon in #1147 , already added llava1.6, obsidian, and moondream support using the new system.

Vinventive · 2024-04-30T18:40:13Z

we appreciate the addition of LLaVaV1.6 34B support would be great to have a support for smaller 7B quants and projectors, or at least a single cjpais/llava-1.6-mistral-7b-gguf. that would be truly awesome!

abetlen · 2024-04-30T18:50:46Z

@Vinventive I wasn't aware there were differences in the chat formats, do you mind sharing a link and I'll add that right away, cheers!

Vinventive · 2024-05-01T04:32:27Z

@Vinventive I wasn't aware there were differences in the chat formats, do you mind sharing a link and I'll add that right away, cheers!

Here is the link: ggml-org/llama.cpp#5267

For Mistral and using llava-cli binary:
Add this: -p "\nUSER:\nProvide a full description.\nASSISTANT:\n"
The mistral template for llava-1.6 seems to be no system print and a USER/ASSISTANT role

Vinventive · 2024-05-01T04:43:02Z

really struggling to run LLaVA with CUDA instead of cuBLAS and I was wondering if it's just an isolated issue, I've seen other open issue where people are running into similar problems #1393

maybe we're doing something incorrectly, or there is a missing info/step in the readme how to run it on Windows 64-bit?

abetlen added the enhancement New feature or request label Apr 4, 2024

abetlen mentioned this issue Apr 28, 2024

Generic Chat Formats for Multimodal Models (Obsidian, LLaVA1.6, Moondream) #1147

Merged

14 tasks

abetlen closed this as completed in #1147 Apr 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Missing LLava 1.6 support for handling custom templates with the respect of the chosen LLM. #1301

Missing LLava 1.6 support for handling custom templates with the respect of the chosen LLM. #1301

DomainFlag commented Mar 25, 2024 •

edited

Loading

abetlen commented Apr 4, 2024

shelbywhite commented Apr 9, 2024

Vinventive commented Apr 20, 2024

abetlen commented Apr 28, 2024

Vinventive commented Apr 30, 2024

abetlen commented Apr 30, 2024

Vinventive commented May 1, 2024

Vinventive commented May 1, 2024

Missing LLava 1.6 support for handling custom templates with the respect of the chosen LLM. #1301

Missing LLava 1.6 support for handling custom templates with the respect of the chosen LLM. #1301

Comments

DomainFlag commented Mar 25, 2024 • edited Loading

abetlen commented Apr 4, 2024

shelbywhite commented Apr 9, 2024

Vinventive commented Apr 20, 2024

abetlen commented Apr 28, 2024

Vinventive commented Apr 30, 2024

abetlen commented Apr 30, 2024

Vinventive commented May 1, 2024

Vinventive commented May 1, 2024

DomainFlag commented Mar 25, 2024 •

edited

Loading