Image Captioning

This is an excercise of Image Captioning, as a part of Udacity Comuputer Vision Nanodegree Program.

What is Image Captioning?

Image captioning is to attach a short descriptiong sentence to a image. This tries to automatically generate the sentence by loading images.

Dataset Used

Used COCO dataset (http://cocodataset.org)

Network Structure

The network is as below. This is an encoder-decoder structure. Encoder part is a pre-trained CNN(ResNet), and provides an embedded vector that decribes the features of images. Decoder part is RNN (LSTM) and transforms the features into word vector.

Some Results

This simple network surprisingly works well.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
images		images
0_Dataset.ipynb		0_Dataset.ipynb
1_Preliminaries.ipynb		1_Preliminaries.ipynb
2_Training.ipynb		2_Training.ipynb
3_Inference.ipynb		3_Inference.ipynb
4_Zip Your Project Files and Submit.ipynb		4_Zip Your Project Files and Submit.ipynb
README.md		README.md
data_loader.py		data_loader.py
filelist.txt		filelist.txt
model.py		model.py
vocabulary.py		vocabulary.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image Captioning

What is Image Captioning?

Dataset Used

Network Structure

Some Results

About

Releases

Packages

Languages

waterwheel31/P2_Image_Captioning2

Folders and files

Latest commit

History

Repository files navigation

Image Captioning

What is Image Captioning?

Dataset Used

Network Structure

Some Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages