This repo was a course project for CS 598DL Fall 2018 in UIUC. It is a PyTorch implementation of the paper Show and Tell: A Neural Image Caption Generator. Please refer to Project Slides for more details. The model trained with our code (~130 hrs training!) can perform better than Google's original implementation! See MSCOCO Leaderboard for detailed performance ranking with team siqimiao, and team OriolVinyals is the Google's implementation.
If you find our repo useful, please star us or cite us. :)
@software{Miao_Show-and-tell_2018,
author = {Miao, Siqi},
doi = {10.5281/zenodo.1234},
month = {11},
title = {{Show-and-tell}},
url = {https://github.com/siqim/Show-and-tell},
version = {1.0.0},
year = {2018}
}