#

gqa

Here are 12 public repositories matching this topic...

bknyaz / sgg

Train Scene Graph Generation for Visual Genome and GQA in PyTorch >= 1.2 with improved zero and few-shot generalization.

computer-vision deep-learning graph pytorch generative-adversarial-network gan scene-graph message-passing paper-implementations visual-genome scene-graph-generation gqa augmentations wandb

Updated Jun 18, 2023
Jupyter Notebook

phiyodr / vqaloader

PyTorch DataLoader for many VQA datasets

pytorch vqa dataloader gqa textvqa vqav2

Updated Jan 10, 2023
Python

DigitalPhonetics / Intrinsic-Subgraph-Generation-for-VQA

Predicting a subgraph alongside the answer in a graph based VQA model

vqa discrete sampling subgraph interpretability masking visual-question-answering explainable-ai graph-neural-networks gqa scene-graphs interpreatable-ai subgraph-extraction graph-based-vqa

Updated Dec 12, 2024
Python

leaderj1001 / Vision-Language

Vision-Language, Solve GQA(Visual Reasoning in the Real World) dataset.

vqa vision-language gqa

Updated May 9, 2019
Python

ExplainableML / ZS-A2T

[GCPR 2023] Zero-shot Translation of Attention Patterns in VQA Models to Natural Language

vqa vlm opt gqa llm zs-a2t gqa-rex vqa-x attention-rollout gcpr-2023

Updated Jul 28, 2023

sahasourav17 / IntelliAnswer

A RAG-based question-answering system that processes user queries using local documents. It extracts relevant information to answer questions, falling back to a large language model when local sources are insufficient, ensuring accurate and contextual responses.

openai rag gqa llm generative-qa chromadb ollama local-rag

Updated Dec 20, 2024
Python

ycchen218 / VisionQA-Llama2-OWLViT

This is a multimodal model design for the Vision Question Answering (VQA) task. It integrates the Llama2 13B, OWL-ViT, and YOLOv8 models.

deep-learning vqa llama gqa yolov8 owl-vit

Updated Jun 13, 2024
Python

alexmirrington / honours-thesis

LaTeX files for my honours thesis: "Graph Attention Networks for Compositional Visual Question Answering"

deep-learning pytorch vqa scene-graph visual-reasoning visual-question-answering graph-attention-networks compositional-attention-networks gqa

Updated Aug 16, 2021
TeX

eltoto1219 / vltk

A toolkit for vision-language processing to support the increasing popularity of mulit-modal transformer-based models

extraction transformers vqa frcnn roipooling gqa

Updated Oct 30, 2022
HTML

alexmirrington / gat-vqa

Source code for my honours thesis: "Graph Attention Networks for Compositional Visual Question Answering"

deep-learning pytorch vqa scene-graph visual-reasoning visual-question-answering graph-attention-networks compositional-attention-networks gqa

Updated Jan 29, 2021
Python

NMPoole / CS5014-MLVisualAttributes

Case study of multi-layer perceptron and random forest techniques as applied to a subset of the GQA dataset.

python machine-learning sklearn data-preprocessing multi-layer-perceptron model-evaluation model-training random-forest-classifier gqa

Updated Jun 27, 2022
Python

sushantkumar23 / baby-llama

Simple Llama architecture LLM in pytorch

transformers pytorch llama attention-mechanism gqa llama2

Updated May 27, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the gqa topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the gqa topic, visit your repo's landing page and select "manage topics."