Skip to content

PyTorch code for Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles (DANCE)

Notifications You must be signed in to change notification settings

pleaseconnectwifi/DANCE

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles

Shuquan Ye2,Yujia Xie1,Dongdong Chen1, Yichong Xu1, Lu Yuan1, Chenguang Zhu1, Jing Liao2

1Microsoft, 2City University of Hong Kong

This is the PyTorch code of the DANCE [paper]. The code is on PyTorch 1.11. Pre-training with ours code requires 4 nodes each with 8 A100 GPUs.

Catalog:

  • Code for DANCE-augmented Pre-training

  • Code for DANCE-augmented Fine-tuning

  • Code for Image-Text Retrieval, OK-VQA

  • Download of Pre-trained and Fine-tuned Checkpoints

BibTeX


About

PyTorch code for Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles (DANCE)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published