CAR-Net: Clairvoyant Attentive Recurrent Network

Alexandre Alahi; Amir Sadeghian; Ferdinand Legros; Maxime Voisin; Ricky Vesel; Silvio Savarese

arxiv: 1711.10061 · v3 · pith:IVOWSG42new · submitted 2017-11-28 · 💻 cs.CV

CAR-Net: Clairvoyant Attentive Recurrent Network

Amir Sadeghian , Ferdinand Legros , Maxime Voisin , Ricky Vesel , Alexandre Alahi , Silvio Savarese This is my paper

classification 💻 cs.CV

keywords car-netagentsimagenavigationpredictionscenestrajectoryagent

0 comments

read the original abstract

We present an interpretable framework for path prediction that leverages dependencies between agents' behaviors and their spatial navigation environment. We exploit two sources of information: the past motion trajectory of the agent of interest and a wide top-view image of the navigation scene. We propose a Clairvoyant Attentive Recurrent Network (CAR-Net) that learns where to look in a large image of the scene when solving the path prediction task. Our method can attend to any area, or combination of areas, within the raw image (e.g., road intersections) when predicting the trajectory of the agent. This allows us to visualize fine-grained semantic elements of navigation scenes that influence the prediction of trajectories. To study the impact of space on agents' trajectories, we build a new dataset made of top-view images of hundreds of scenes (Formula One racing tracks) where agents' behaviors are heavily influenced by known areas in the images (e.g., upcoming turns). CAR-Net successfully attends to these salient regions. Additionally, CAR-Net reaches state-of-the-art accuracy on the standard trajectory forecasting benchmark, Stanford Drone Dataset (SDD). Finally, we show CAR-Net's ability to generalize to unseen scenes.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Social-BiGAT: Multimodal Trajectory Forecasting using Bicycle-GAN and Graph Attention Networks
cs.CV 2019-07 unverdicted novelty 5.0

Social-BiGAT is a graph-based generative adversarial network using GAT for social interaction features and Bicycle-GAN for multimodal outputs that reports state-of-the-art results on pedestrian trajectory forecasting ...