Image-Captioning using VGG for feature extraction

Using Flickr8k dataset 1GB. for each photo 5 descriptions are available.

Used Keras with Tensorflow backend for the code. VGG is used for extracting the features.

No Beam search is yet implemented.

You can download the weights here

Examples

Dependencies

Keras 1.2.2
Tensorflow 0.12.1
numpy
matplotlib

References

[1] Vinyals, Oriol, et al. "Show and tell: A neural image caption generator." Proceedings of the IEEE conference on computer vision and pattern recognition. 2015. Show and Tell: A Neural Image Caption Generator

[2] Simonyan, Karen, and Andrew Zisserman. "Very deep convolutional networks for large-scale image recognition." arXiv preprint arXiv:1409.1556 (2014). VGG

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
model_weights		model_weights
results/photos		results/photos
README.md		README.md
encoder.ipynb		encoder.ipynb
image_captioning.ipynb		image_captioning.ipynb
img_capt.py		img_capt.py
presentation.pptx		presentation.pptx

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Image-Captioning using VGG for feature extraction

Examples

Dependencies

References

About

Uh oh!

Releases

Packages

Uh oh!

Languages

teodor-cotet/ImageCaptioning

Folders and files

Latest commit

History

Repository files navigation

Image-Captioning using VGG for feature extraction

Examples

Dependencies

References

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages