Natural Environment Benchmarks for Reinforcement Learning

Amy Zhang; Joelle Pineau; Yuxin Wu

Natural Environment Benchmarks for Reinforcement Learning

Not yet reviewed by Pith; the record is open.

Re-run · record.json Download PDF Read on arXiv ↗

This paper has not been read by Pith yet. Machine review is queued; the pith claim, tier, and objections will appear here once it completes.

SPECIMEN: schema-true, not a live event

T0 review · schema-true

One-sentence machine reading of the paper's core claim.

pith:XXXXXXXX · record.json · timestamp

arxiv 1811.06032 v1 pith:FTAU7XYS submitted 2018-11-14 cs.LG cs.AIstat.ML

Natural Environment Benchmarks for Reinforcement Learning

Amy Zhang , Yuxin Wu , Joelle Pineau This is my paper

classification cs.LG cs.AIstat.ML

keywords learningalgorithmsbenchmarkdatadomainsnaturalreinforcementwhile

verification ladder T0 review T1 audit T2 compute T3 formal T4 reserved

0 comments

read the original abstract

While current benchmark reinforcement learning (RL) tasks have been useful to drive progress in the field, they are in many ways poor substitutes for learning with real-world data. By testing increasingly complex RL algorithms on low-complexity simulation environments, we often end up with brittle RL policies that generalize poorly beyond the very specific domain. To combat this, we propose three new families of benchmark RL domains that contain some of the complexity of the natural world, while still supporting fast and extensive data acquisition. The proposed domains also permit a characterization of generalization through fair train/test separation, and easy comparison and replication of results. Through this work, we challenge the RL research community to develop more robust algorithms that meet high standards of evaluation.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

stable-worldmodel: A Platform for Reproducible World Modeling Research and Evaluation
cs.LG 2026-05 unverdicted novelty 5.0

The paper presents stable-worldmodel (swm), a platform with high-performance data layer, modern world model baselines, planning solvers, and extended environments for reproducible research and generalization evaluation.
Optimal Control with Natural Images: Efficient Reinforcement Learning using Overcomplete Sparse Codes
cs.LG 2024-12 unverdicted novelty 5.0

Overcomplete sparse coding of natural images enables reinforcement learning to solve optimal control tasks orders of magnitude larger than with complete codes, via a new scalable benchmark and theoretical justification.