pith. sign in

arxiv: 1811.05013 · v1 · pith:IWH5J3G3new · submitted 2018-11-12 · 💻 cs.CV · cs.AI· cs.CL· cs.LG

Blindfold Baselines for Embodied QA

classification 💻 cs.CV cs.AIcs.CLcs.LG
keywords blindfoldagentansweringbaselinebaselinesembodiedembodiedqaenvironment
0
0 comments X
read the original abstract

We explore blindfold (question-only) baselines for Embodied Question Answering. The EmbodiedQA task requires an agent to answer a question by intelligently navigating in a simulated environment, gathering necessary visual information only through first-person vision before finally answering. Consequently, a blindfold baseline which ignores the environment and visual information is a degenerate solution, yet we show through our experiments on the EQAv1 dataset that a simple question-only baseline achieves state-of-the-art results on the EmbodiedQA task in all cases except when the agent is spawned extremely close to the object.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.