Learning to Sequence Robot Behaviors for Visual Navigation

Ali Salman; Guillaume Sartoretti; Hadi Salman; Howie Choset; Matthew Travers; Peng Yin; Puneet Singhal; Tanmay Shankar; William Paivine

arxiv: 1803.01446 · v3 · pith:WYUM2GIJnew · submitted 2018-03-05 · 💻 cs.RO

Learning to Sequence Robot Behaviors for Visual Navigation

Hadi Salman , Puneet Singhal , Tanmay Shankar , Peng Yin , Ali Salman , William Paivine , Guillaume Sartoretti , Matthew Travers

show 1 more author

Howie Choset

This is my paper

classification 💻 cs.RO

keywords robotbehaviorslow-levelcontrollearningnavigationobstaclespolicies

0 comments

read the original abstract

Recent literature in the robotics community has focused on learning robot behaviors that abstract out lower-level details of robot control. To fully leverage the efficacy of such behaviors, it is necessary to select and sequence them to achieve a given task. In this paper, we present an approach to both learn and sequence robot behaviors, applied to the problem of visual navigation of mobile robots. We construct a layered representation of control policies composed of low- level behaviors and a meta-level policy. The low-level behaviors enable the robot to locomote in a particular environment while avoiding obstacles, and the meta-level policy actively selects the low-level behavior most appropriate for the current situation based purely on visual feedback. We demonstrate the effectiveness of our method on three simulated robot navigation tasks: a legged hexapod robot which must successfully traverse varying terrain, a wheeled robot which must navigate a maze-like course while avoiding obstacles, and finally a wheeled robot navigating in the presence of dynamic obstacles. We show that by learning control policies in a layered manner, we gain the ability to successfully traverse new compound environments composed of distinct sub-environments, and outperform both the low-level behaviors in their respective sub-environments, as well as a hand-crafted selection of low-level policies on these compound environments.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

A model of phase-coupled delay equations for the dynamics of word usage
physics.soc-ph 2023-04 unverdicted novelty 4.0

Transforms Volterra model near Hopf bifurcation into phase model for coupling word usage dynamics to address coherent oscillations.