image_captioning_flickr

In this project, we worked on both, the flickr_8k and flickr_30k, datasets but we had some storage and runtime complications with the flickr_30k dataset.

We used the encoder-decoder model to create our image caption generator, with the encoder as a CNN network and the decoder as an LSTM network.

Datasets can be found here:

flickr_8k: https://www.kaggle.com/datasets/waelboussbat/flickr8ksau

flickr_30k: https://www.kaggle.com/datasets/hsankesara/flickr-image-dataset

More details can be found in the report and/or presentation.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Final Project Report.pdf		Final Project Report.pdf
Image Captioning.pptx		Image Captioning.pptx
README.md		README.md
image_captioning_flickr30k.ipynb		image_captioning_flickr30k.ipynb
image_captioning_flickr8k.ipynb		image_captioning_flickr8k.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

image_captioning_flickr

About

Releases

Packages

Languages

nouranHisham/image_captioning_flickr

Folders and files

Latest commit

History

Repository files navigation

image_captioning_flickr

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages