pith. sign in

arxiv: 1807.09434 · v1 · pith:GBTLRNBTnew · submitted 2018-07-25 · 💻 cs.CV · cs.CL

Distinctive-attribute Extraction for Image Captioning

classification 💻 cs.CV cs.CL
keywords imagenetworksneuralanalyzedcaptioncaptioningcaptionsdescribing
0
0 comments X
read the original abstract

Image captioning, an open research issue, has been evolved with the progress of deep neural networks. Convolutional neural networks (CNNs) and recurrent neural networks (RNNs) are employed to compute image features and generate natural language descriptions in the research. In previous works, a caption involving semantic description can be generated by applying additional information into the RNNs. In this approach, we propose a distinctive-attribute extraction (DaE) which explicitly encourages significant meanings to generate an accurate caption describing the overall meaning of the image with their unique situation. Specifically, the captions of training images are analyzed by term frequency-inverse document frequency (TF-IDF), and the analyzed semantic information is trained to extract distinctive-attributes for inferring captions. The proposed scheme is evaluated on a challenge data, and it improves an objective performance while describing images in more detail.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.