Visual cognition in multimodal large language models Accompanying code for arxiv.org/abs/2311.16093 This repository includes: The code used to evaluate the models (in /eval) The images the models were evaluated on (in /eval/images) Analysis notebooks for each experiment