M3-VQA

M3-VQA, a novel pipeline for multilingual and multimodal biomedical VQA. M3-VQA leverages translation for multilingual inputs, retrieval augmented generation (RAG) for knowledge grounding, and in-context learning (ICL) with Chain-of-Thought prompting for accurate reasoning.

Getting Started

Prerequisites

Get a free API Key for Google Translate and configure locally, please refer to https://cloud.google.com/translate/docs/reference/rest/

Clone the repo

git clone https://github.com/AmuroEita/M3-VQA.git && cd M3-VQA

Use git lfs fetch the faiss index files
```
git lfs pull
```
Install required Python packages
```
pip install -r requirements.txt
```

Enter your GPT API in utils/GPT-API.txt

echo "${Your GPT API Key}" > utils/GPT-API.txt

Prepare the datasets
```
cd data && python download_data.py
```

Download the model via hugging face

huggingface-cli login
huggingface-cli download --resume-download unsloth/Llama-3.2-11B-Vision-Instruct --local-dir Llama-3.2-11B-Vision-Instruct

Usage

Specify a Question for Testing

Use this mode to provide a specific question for Med-VQA to answer. The following example demonstrates how to test the 11th question in the israel_local_processed.tsv dataset. The process and results will be displayed directly in the command line.

export GOOGLE_APPLICATION_CREDENTIALS="/your_path_to/google_translate.json" && python3 demo.py --dataset data/israel_local_processed.tsv --question_idx 11

Evaluate on a Dataset

Run on the entire dataset to compute accuracy. Results will be saved in the results folder for further analysis.

export GOOGLE_APPLICATION_CREDENTIALS="/your_path_to/google_translate.json" && python3 inference.py

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
caption		caption
data		data
rag		rag
reasoning		reasoning
results		results
utils		utils
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
demo.py		demo.py
faiss_index.index		faiss_index.index
inference.py		inference.py
metadata.pkl		metadata.pkl
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

M3-VQA

Getting Started

Prerequisites

Usage

Specify a Question for Testing

Evaluate on a Dataset

About

Releases

Packages

Contributors 2

Languages

AmuroEita/M3-VQA

Folders and files

Latest commit

History

Repository files navigation

M3-VQA

Getting Started

Prerequisites

Usage

Specify a Question for Testing

Evaluate on a Dataset

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages