Image Captioning Project

Introduction

The repository contains a neural network, which can automatically generate captions from images.

The solution architecture consists of:

2. Decoder, which is a sequential neural network consisting of LSTM units, which translates the feature vector into a sequence of tokens:

These are some of the outputs give by the network using the COCO dataset:

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
assets		assets
0_Dataset.ipynb		0_Dataset.ipynb
1_Preliminaries.ipynb		1_Preliminaries.ipynb
2_Training.html		2_Training.html
2_Training.ipynb		2_Training.ipynb
3_Inference.html		3_Inference.html
3_Inference.ipynb		3_Inference.ipynb
4_Zip Your Project Files and Submit.ipynb		4_Zip Your Project Files and Submit.ipynb
README.md		README.md
data_loader.py		data_loader.py
filelist.txt		filelist.txt
model.py		model.py
vocab.pkl		vocab.pkl
vocabulary.py		vocabulary.py
workspace_utils.py		workspace_utils.py