pith. the verified trust layer for science. sign in

arxiv: 1604.03249 · v1 · pith:UNYUTF7Lnew · submitted 2016-04-12 · 💻 cs.CV · cs.CL

Attributes as Semantic Units between Natural Language and Visual Recognition

classification 💻 cs.CV cs.CL
keywords languagenaturalattributesvisualallowdiscussimagesinteraction
0
0 comments X p. Extension
Add this Pith Number to your LaTeX paper What is a Pith Number?
\usepackage{pith}
\pithnumber{UNYUTF7L}

Prints a linked pith:UNYUTF7L badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more

read the original abstract

Impressive progress has been made in the fields of computer vision and natural language processing. However, it remains a challenge to find the best point of interaction for these very different modalities. In this chapter we discuss how attributes allow us to exchange information between the two modalities and in this way lead to an interaction on a semantic level. Specifically we discuss how attributes allow using knowledge mined from language resources for recognizing novel visual categories, how we can generate sentence description about images and video, how we can ground natural language in visual content, and finally, how we can answer natural language questions about images.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.