pith. sign in

arxiv: 1807.09664 · v1 · pith:SIW6ER6Enew · submitted 2018-07-25 · 💻 cs.AI · cs.CV

Attend Before you Act: Leveraging human visual attention for continual learning

classification 💻 cs.AI cs.CV
keywords imagesattentionbuildenvironmentgeneratedhumanslearningleveraging
0
0 comments X
read the original abstract

When humans perform a task, such as playing a game, they selectively pay attention to certain parts of the visual input, gathering relevant information and sequentially combining it to build a representation from the sensory data. In this work, we explore leveraging where humans look in an image as an implicit indication of what is salient for decision making. We build on top of the UNREAL architecture in DeepMind Lab's 3D navigation maze environment. We train the agent both with original images and foveated images, which were generated by overlaying the original images with saliency maps generated using a real-time spectral residual technique. We investigate the effectiveness of this approach in transfer learning by measuring performance in the context of noise in the environment.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.