Continual World: A Robotic Benchmark For Continual Reinforcement Learning

{\L}ukasz Kuci\'nski; Maciej Wo{\l}czyk; Micha{\l} Zaj\k{a}c; Piotr Mi{\l}o\'s; Razvan Pascanu

arxiv: 2105.10919 · v3 · pith:QDXX764Cnew · submitted 2021-05-23 · 💻 cs.LG · cs.AI· cs.RO

Continual World: A Robotic Benchmark For Continual Reinforcement Learning

Maciej Wo{\l}czyk , Micha{\l} Zaj\k{a}c , Razvan Pascanu , {\L}ukasz Kuci\'nski , Piotr Mi{\l}o\'s This is my paper

classification 💻 cs.LG cs.AIcs.RO

keywords benchmarkcontinuallearningabilityagentsbuildingcommunitycomputationally

0 comments

read the original abstract

Continual learning (CL) -- the ability to continuously learn, building on previously acquired knowledge -- is a natural requirement for long-lived autonomous reinforcement learning (RL) agents. While building such agents, one needs to balance opposing desiderata, such as constraints on capacity and compute, the ability to not catastrophically forget, and to exhibit positive transfer on new tasks. Understanding the right trade-off is conceptually and computationally challenging, which we argue has led the community to overly focus on catastrophic forgetting. In response to these issues, we advocate for the need to prioritize forward transfer and propose Continual World, a benchmark consisting of realistic and meaningfully diverse robotic tasks built on top of Meta-World as a testbed. Following an in-depth empirical evaluation of existing CL methods, we pinpoint their limitations and highlight unique algorithmic challenges in the RL setting. Our benchmark aims to provide a meaningful and computationally inexpensive challenge for the community and thus help better understand the performance of existing and future solutions. Information about the benchmark, including the open-source code, is available at https://sites.google.com/view/continualworld.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Continual Learning Bench: Evaluating Frontier AI Systems in Real-World Stateful Environments
cs.AI 2026-06 unverdicted novelty 8.0

CL-Bench is the first expert-validated benchmark for continual learning in frontier LLMs across six real-world domains, showing limited gains and that naive in-context learning outperforms dedicated memory systems.
Fine-Tuning Regimes Define Distinct Continual Learning Problems
cs.LG 2026-04 unverdicted novelty 6.0

The relative rankings of continual learning methods are not preserved across different fine-tuning regimes defined by trainable parameter depth.
Evidence of an Emergent "Self" in Continual Robot Learning
cs.RO 2026-03 unverdicted novelty 6.0

Continual learning robots form a significantly more stable invariant subnetwork than constant-task controls, and preserving it improves adaptation while damaging it hurts performance.