Generating captions without looking beyond objects

Arnold W.M. Smeulders; Christof Monz; Hendrik Heuer

arxiv: 1610.03708 · v2 · pith:UZ4DCOSEnew · submitted 2016-10-12 · 💻 cs.CV · cs.CL

Generating captions without looking beyond objects

Hendrik Heuer , Christof Monz , Arnold W.M. Smeulders This is my paper

classification 💻 cs.CV cs.CL

keywords captionsimagenounscaptioningcategoriesperformancewithoutword

0 comments

read the original abstract

This paper explores new evaluation perspectives for image captioning and introduces a noun translation task that achieves comparative image caption generation performance by translating from a set of nouns to captions. This implies that in image captioning, all word categories other than nouns can be evoked by a powerful language model without sacrificing performance on n-gram precision. The paper also investigates lower and upper bounds of how much individual word categories in the captions contribute to the final BLEU score. A large possible improvement exists for nouns, verbs, and prepositions.

This paper has not been read by Pith yet.

Generating captions without looking beyond objects

discussion (0)