pith. sign in

Contrastive learning as goal-conditioned reinforcement learning

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

citation-role summary

background 1 method 1

citation-polarity summary

fields

cs.LG 3 cs.RO 2

years

2026 4 2022 1

verdicts

UNVERDICTED 5

representative citing papers

Bellman Value Decomposition for Task Logic in Safe Optimal Control

cs.RO · 2026-02-23 · unverdicted · novelty 7.0

Bellman values for temporal logic tasks decompose into a graph of reach-avoid, avoid, and reach-avoid-loop equations solved by embedding the graph in a two-layer neural net (VDPPO) for safe high-dimensional control.

citing papers explorer

Showing 5 of 5 citing papers.