Skip to content

Latest commit

 

History

History
24 lines (17 loc) · 1.17 KB

MODEL_ZOO.md

File metadata and controls

24 lines (17 loc) · 1.17 KB

Model Zoo

SQ-LLaVA-v1.0

Version Size Data Checkpoint VQAv2 GQA VizWiz SQA TextVQA POPE MM-Bench MM-Bench-CN LLaVA-Bench-Wild MM-Vet
SQ-LLaVA 7B ShareGPT4V ZachSun/sqllava-7B 80.3 63.7 55.3 70.5 60.5 87.2 66.6 60.0 74.3 37.6
SQ-LLaVA 13B ShareGPT4V ZachSun/sqllava-13B 81.3 65.0 58.2 71.5 61.9 87.4 68.5 62.5 80.7 39.7


SQ-LLaVA achieves state-of-the-art performance on 9 out of 10 tasks compared with other open-ended models.

We provide the LoRA weights (LLM and ViT), please download them first and follow the settings to load the correct model.

--model-path ./checkpoints/path/to/sqllava-lora-7b \ # include the adapter and non-adapter weights 
--model-base Lin-Chen/ShareGPT4V-7B_Pretrained_vit-large336-l12_vicuna-7b-v1.5 \ # pre-trained LLM weights
--lora_pretrain ./checkpoints/path/to/ckpt/Vit-lora \  # ViT LoRA