GitHub - spoluan/flickr30k_image_captioning: "Flickr30k_image_captioning" is a project or repository focused on image captioning using the Flickr30k dataset. The project aims to develop and showcase algorithms and models that generate descriptive captions for images.

Project Descriptions

Predicted captions

The Flickr30k dataset has emerged as a popular benchmark for image captioning tasks. This dataset comprises over 31,000 images with a total of approximately 158,000 captions. In this repository, we will be using the Flickr30k dataset to train a model for generating captions for images. Additionally, this dataset has been augmented with the Flickr30k Entities, which includes over 244,000 coreference chains that link mentions of the same entities across different captions for the same image. This extension is intended to enable better natural language understanding of the images and is expected to lead to more informative and accurate captions. To get started, you can download the datasets from https://www.kaggle.com/datasets/hsankesara/flickr-image-dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
README.md		README.md
datasets_dict.pickle		datasets_dict.pickle
download.png		download.png
flickr30k_image_captioning__.ipynb		flickr30k_image_captioning__.ipynb
vocabulary.txt		vocabulary.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

spoluan/flickr30k_image_captioning

Folders and files

Latest commit

History

Repository files navigation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages