and Valenzano, Richard and McIlraith, Sheila A

Icarte, R · 2022 · DOI 10.1613/jair.1.12440

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

representative citing papers

About Time: Model-free Reinforcement Learning with Timed Reward Machines

cs.AI · 2025-12-19 · conditional · novelty 7.0

Timed reward machines extend reward machines with timing constraints, allowing model-free RL algorithms to learn policies that satisfy precise temporal requirements on standard benchmarks.

Reward Shaping and Action Masking for Compositional Tasks using Behavior Trees and LLMs

cs.LG · 2026-05-07

citing papers explorer

Showing 2 of 2 citing papers.

About Time: Model-free Reinforcement Learning with Timed Reward Machines cs.AI · 2025-12-19 · conditional · none · ref 27
Timed reward machines extend reward machines with timing constraints, allowing model-free RL algorithms to learn policies that satisfy precise temporal requirements on standard benchmarks.
Reward Shaping and Action Masking for Compositional Tasks using Behavior Trees and LLMs cs.LG · 2026-05-07 · unreviewed · ref 10

and Valenzano, Richard and McIlraith, Sheila A

fields

years

verdicts

representative citing papers

citing papers explorer