pith. machine review for the scientific record. sign in

arxiv: 1610.02707 · v1 · submitted 2016-10-09 · 💻 cs.AI

Recognition: unknown

Multi-Objective Deep Reinforcement Learning

Authors on Pith no claims yet
classification 💻 cs.AI
keywords learningdeepmulti-objectivereinforcementconvexhigh-dimensionalobjectivesaddition
0
0 comments X
read the original abstract

We propose Deep Optimistic Linear Support Learning (DOL) to solve high-dimensional multi-objective decision problems where the relative importances of the objectives are not known a priori. Using features from the high-dimensional inputs, DOL computes the convex coverage set containing all potential optimal solutions of the convex combinations of the objectives. To our knowledge, this is the first time that deep reinforcement learning has succeeded in learning multi-objective policies. In addition, we provide a testbed with two experiments to be used as a benchmark for deep multi-objective reinforcement learning.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. A Reward-Free Viewpoint on Multi-Objective Reinforcement Learning

    cs.LG 2026-04 unverdicted novelty 7.0

    Adapting RFRL objectives as auxiliary tasks with preference-guided exploration outperforms prior MORL methods in performance and data efficiency on MO-Gymnasium tasks.

  2. A Single Deep Preference-Conditioned Policy for Learning Pareto Coverage Sets

    cs.LG 2026-05 unverdicted novelty 6.0

    A single preference-conditioned policy achieves unique and Lipschitz-continuous Pareto coverage in multi-objective MDPs via a new mirror-descent policy iteration algorithm with O(1/k) convergence.