flickr30k

Star

Here are 14 public repositories matching this topic...

kumarsantosh04 / image-captioning

Star

Attention Based image captioning

computer-vision lstm image-captioning transfer-learning attention-mechanism encoder-decoder flickr30k

Updated Dec 27, 2024
Python

spoortimorabad / ImageCaptioningGeneration-Using-Swin-Transformer-and-GRU-attention-Mechansim

Star

Image captioning generation using Swin transformer and GRU attention mechanism

tensorflow captions gru mit-license imagecaptioning swin-transformer flickr30k

Updated Oct 8, 2024
Jupyter Notebook

Sh-31 / ImgCap

Star

ImgCap is an image captioning model designed to automatically generate descriptive captions for images. It has two versions CNN + LSTM model and CNN + LSTM + Attention mechanism model.

torch lstm resnet deeplearning imagecaptioning torchtext torchvision flickr30k

Updated Sep 10, 2024
Python

nssharmaofficial / image-caption-generator

Sponsor

Star

Image captioning model with Resnet50 encoder and LSTM decoder

encoder decoder pytorch embeddings lstm image-captioning vocabulary-builder resnet50 image-caption-generator flickr30k

Updated Sep 6, 2024
Python

eric-ai-lab / ComCLIP

Star

Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"

causality clip svo slip vision-and-language compositionality flickr8k-dataset image-text-matching flickr30k image-text-retrieval winoground blip2

Updated Aug 18, 2024
Python

Delphboy / karpathy-splits

Star

Karpathy Splits json files for image captioning

image-caption mscoco-dataset flickr8k-dataset flickr30k karpathy-split

Updated Apr 4, 2024

KimRass / CLIP

Star

PyTorch implementation of 'CLIP' (Radford et al., 2021) from scratch and training it on Flickr8k + Flickr30k

multi-modal clip linear-classification flickr8k zero-shot-classification flickr30k text-image-retrieval

Updated Mar 14, 2024
Python

thisisankit27 / SnapSpeak

Star

Visual Elocution Synthesis

docker tesseract-ocr image-captioning flickr30k

Updated Mar 29, 2024
Python

awsaf49 / flickr-dataset

Star

Download flickr8k, flickr30k image caption datasets

image flickr dataset clip captioning-images image-text flickr8k flickr30k siglip

Updated Feb 6, 2024

bkhanal-11 / clip-openai

Star

Implementation of CLIP from OpenAI using pretrained Image and Text Encoders.

vit clip flickr30k all-mpnet-base-v2

Updated Dec 12, 2023
Jupyter Notebook

spoluan / flickr30k_image_captioning

Star

"Flickr30k_image_captioning" is a project or repository focused on image captioning using the Flickr30k dataset. The project aims to develop and showcase algorithms and models that generate descriptive captions for images.