pith. sign in

arxiv: 1903.01959 · v1 · pith:KOCTRMCCnew · submitted 2019-03-05 · 💻 cs.RO · cs.AI· cs.LG

Learning Exploration Policies for Navigation

classification 💻 cs.RO cs.AIcs.LG
keywords explorationexplorepoliciesdown-streamenvironmentsfinallylearninglearning-based
0
0 comments X
read the original abstract

Numerous past works have tackled the problem of task-driven navigation. But, how to effectively explore a new environment to enable a variety of down-stream tasks has received much less attention. In this work, we study how agents can autonomously explore realistic and complex 3D environments without the context of task-rewards. We propose a learning-based approach and investigate different policy architectures, reward functions, and training paradigms. We find that the use of policies with spatial memory that are bootstrapped with imitation learning and finally finetuned with coverage rewards derived purely from on-board sensors can be effective at exploring novel environments. We show that our learned exploration policies can explore better than classical approaches based on geometry alone and generic learning-based exploration techniques. Finally, we also show how such task-agnostic exploration can be used for down-stream tasks. Code and Videos are available at: https://sites.google.com/view/exploration-for-nav.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Flying to Image-Specified Objects: 3D Quadrotor Navigation via Cross-Graph Memory and Viewpoint Planning

    cs.RO 2026-06 unverdicted novelty 4.0

    Proposes a hierarchical navigation framework with viewpoint-aware action nodes, cross-graph memory, and learning-based policy for quadrotor InstanceImageNav, claiming improvements over baselines in simulation and real...