pith. sign in

arxiv: 1803.01446 · v3 · pith:WYUM2GIJnew · submitted 2018-03-05 · 💻 cs.RO

Learning to Sequence Robot Behaviors for Visual Navigation

classification 💻 cs.RO
keywords robotbehaviorslow-levelcontrollearningnavigationobstaclespolicies
0
0 comments X
read the original abstract

Recent literature in the robotics community has focused on learning robot behaviors that abstract out lower-level details of robot control. To fully leverage the efficacy of such behaviors, it is necessary to select and sequence them to achieve a given task. In this paper, we present an approach to both learn and sequence robot behaviors, applied to the problem of visual navigation of mobile robots. We construct a layered representation of control policies composed of low- level behaviors and a meta-level policy. The low-level behaviors enable the robot to locomote in a particular environment while avoiding obstacles, and the meta-level policy actively selects the low-level behavior most appropriate for the current situation based purely on visual feedback. We demonstrate the effectiveness of our method on three simulated robot navigation tasks: a legged hexapod robot which must successfully traverse varying terrain, a wheeled robot which must navigate a maze-like course while avoiding obstacles, and finally a wheeled robot navigating in the presence of dynamic obstacles. We show that by learning control policies in a layered manner, we gain the ability to successfully traverse new compound environments composed of distinct sub-environments, and outperform both the low-level behaviors in their respective sub-environments, as well as a hand-crafted selection of low-level policies on these compound environments.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. A model of phase-coupled delay equations for the dynamics of word usage

    physics.soc-ph 2023-04 unverdicted novelty 4.0

    Transforms Volterra model near Hopf bifurcation into phase model for coupling word usage dynamics to address coherent oscillations.