pith. sign in

arxiv: 1801.02209 · v2 · pith:6OF5ROXXnew · submitted 2018-01-07 · 💻 cs.LG · cs.AI

Building Generalizable Agents with a Realistic and Rich 3D Environment

classification 💻 cs.LG cs.AI
keywords house3denvironmentenvironmentshousesunseenvariationsagentagents
0
0 comments X
read the original abstract

Teaching an agent to navigate in an unseen 3D environment is a challenging task, even in the event of simulated environments. To generalize to unseen environments, an agent needs to be robust to low-level variations (e.g. color, texture, object changes), and also high-level variations (e.g. layout changes of the environment). To improve overall generalization, all types of variations in the environment have to be taken under consideration via different level of data augmentation steps. To this end, we propose House3D, a rich, extensible and efficient environment that contains 45,622 human-designed 3D scenes of visually realistic houses, ranging from single-room studios to multi-storied houses, equipped with a diverse set of fully labeled 3D objects, textures and scene layouts, based on the SUNCG dataset (Song et.al.). The diversity in House3D opens the door towards scene-level augmentation, while the label-rich nature of House3D enables us to inject pixel- & task-level augmentations such as domain randomization (Toubin et. al.) and multi-task training. Using a subset of houses in House3D, we show that reinforcement learning agents trained with an enhancement of different levels of augmentations perform much better in unseen environments than our baselines with raw RGB input by over 8% in terms of navigation success rate. House3D is publicly available at http://github.com/facebookresearch/House3D.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 4 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Emergence of Exploratory Look-Around Behaviors through Active Observation Completion

    cs.CV 2019-06 unverdicted novelty 6.0

    An RL agent learns to actively explore by being rewarded for inferring unobserved scene parts after short glimpse sequences, with sidekick policy learning enabling generalization to other active perception tasks.

  2. Continual Reinforcement Learning with Diversity Exploration and Adversarial Self-Correction

    cs.LG 2019-06 unverdicted novelty 6.0

    CDAN framework uses diversity exploration and adversarial self-correction for continual RL in continuous control, evaluated on new CAM environment with NSD metric showing 18.35% NSD improvement over baseline.

  3. On Evaluation of Embodied Navigation Agents

    cs.AI 2018-07 accept novelty 6.0

    Consensus recommendations for standardized evaluation measures, problem statements, and benchmarking scenarios in embodied navigation research.

  4. Why Build an Assistant in Minecraft?

    cs.AI 2019-07 unverdicted novelty 4.0

    A rationale is presented for developing an assistant in Minecraft to advance natural language understanding and dialogue learning.