An benchmark for evaluating the capabilities of large vision-language models (LVLMs)
benchmark
multimodal
pre-training
reformulation
embodied-ai
instruction-following
gpt4
in-context-learning
large-language-models
llm
instruction-tuning
large-vision-language-models
visual-chain-of-thought
multimodal-chain-of-thought
-
Updated
Nov 17, 2023 - Python