Temporal Logic Guided Safe Reinforcement Learning Using Control Barrier Functions

· 2019 · cs.LG · arXiv 1903.09885

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open full Pith review browse 2 citing papers arXiv PDF

abstract

Using reinforcement learning to learn control policies is a challenge when the task is complex with potentially long horizons. Ensuring adequate but safe exploration is also crucial for controlling physical systems. In this paper, we use temporal logic to facilitate specification and learning of complex tasks. We combine temporal logic with control Lyapunov functions to improve exploration. We incorporate control barrier functions to safeguard the exploration and deployment process. We develop a flexible and learnable system that allows users to specify task objectives and constraints in different forms and at various levels. The framework is also able to take advantage of known system dynamics and handle unknown environmental dynamics by integrating model-free learning with model-based planning.

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Model-Free Learning of Safe yet Effective Controllers

cs.RO · 2021-03-26 · unverdicted · novelty 5.0

A model-free RL algorithm learns policies by sequentially maximizing safety probability, then LTL satisfaction probability, then discounted QoC rewards in unknown MDPs.

A Review On Safe Reinforcement Learning Using Lyapunov and Barrier Functions

eess.SY · 2025-08-12 · unverdicted · novelty 2.0

A literature review of safe RL using Lyapunov and barrier functions that identifies a shift to model-free methods since 2017, well-defined open problems per approach class, and high-dimensional scalability as the main barrier.

citing papers explorer

Showing 2 of 2 citing papers.

Model-Free Learning of Safe yet Effective Controllers cs.RO · 2021-03-26 · unverdicted · none · ref 16 · internal anchor
A model-free RL algorithm learns policies by sequentially maximizing safety probability, then LTL satisfaction probability, then discounted QoC rewards in unknown MDPs.
A Review On Safe Reinforcement Learning Using Lyapunov and Barrier Functions eess.SY · 2025-08-12 · unverdicted · none · ref 117 · internal anchor
A literature review of safe RL using Lyapunov and barrier functions that identifies a shift to model-free methods since 2017, well-defined open problems per approach class, and high-dimensional scalability as the main barrier.

Temporal Logic Guided Safe Reinforcement Learning Using Control Barrier Functions

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer