pith. machine review for the scientific record. sign in

arxiv: 1904.09317 · v2 · pith:67PFI5BKnew · submitted 2019-04-19 · 💻 cs.LG · cs.CL· cs.CV· cs.NE· stat.ML

Challenges and Prospects in Vision and Language Research

classification 💻 cs.LG cs.CLcs.CVcs.NEstat.ML
keywords languagetasksunderstandingvisionachievingaffairsartificialbeen
0
0 comments X
read the original abstract

Language grounded image understanding tasks have often been proposed as a method for evaluating progress in artificial intelligence. Ideally, these tasks should test a plethora of capabilities that integrate computer vision, reasoning, and natural language understanding. However, rather than behaving as visual Turing tests, recent studies have demonstrated state-of-the-art systems are achieving good performance through flaws in datasets and evaluation procedures. We review the current state of affairs and outline a path forward.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.