End-to-end training of deep visuomotor policies.Journal of Machine Learning Research, 17(39):1–40, 2016

Sergey Levine, Chelsea Finn, Trevor Darrell, Pieter Abbeel · 2016

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

browse 4 citing papers

representative citing papers

SnareNet: Flexible Repair Layers for Neural Networks with Hard Constraints

cs.LG · 2026-02-10 · unverdicted · novelty 7.0

SnareNet introduces a repair layer that navigates the range space of constraints plus adaptive relaxation training to enforce hard non-convex constraints on neural network outputs more reliably than prior methods.

Tighter Regret Bounds for Contextual Action-Set Reinforcement Learning

cs.LG · 2026-05-15 · unverdicted · novelty 6.0

Extends MVP to contextual action-set RL and derives minimax regret bound O~(sqrt(S A H^3 K log L)) for adversarial contexts plus a gap-dependent bound.

Precise Aggressive Aerial Maneuvers with Sensorimotor Policies

cs.RO · 2026-04-07 · unverdicted · novelty 6.0

Reinforcement learning sensorimotor policies enable quadrotors to traverse narrow gaps at extreme tilts with 5 cm clearance using only vision and proprioception, including reactive traversal of moving gaps.

LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architecture from Pixels

cs.LG · 2026-03-13 · unverdicted · novelty 6.0

LeWM is the first end-to-end trainable JEPA from pixels that uses only two loss terms for stable training and fast planning on 2D/3D control tasks.

citing papers explorer

Showing 4 of 4 citing papers.

SnareNet: Flexible Repair Layers for Neural Networks with Hard Constraints cs.LG · 2026-02-10 · unverdicted · none · ref 23
SnareNet introduces a repair layer that navigates the range space of constraints plus adaptive relaxation training to enforce hard non-convex constraints on neural network outputs more reliably than prior methods.
Tighter Regret Bounds for Contextual Action-Set Reinforcement Learning cs.LG · 2026-05-15 · unverdicted · none · ref 23
Extends MVP to contextual action-set RL and derives minimax regret bound O~(sqrt(S A H^3 K log L)) for adversarial contexts plus a gap-dependent bound.
Precise Aggressive Aerial Maneuvers with Sensorimotor Policies cs.RO · 2026-04-07 · unverdicted · none · ref 23
Reinforcement learning sensorimotor policies enable quadrotors to traverse narrow gaps at extreme tilts with 5 cm clearance using only vision and proprioception, including reactive traversal of moving gaps.
LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architecture from Pixels cs.LG · 2026-03-13 · unverdicted · none · ref 1
LeWM is the first end-to-end trainable JEPA from pixels that uses only two loss terms for stable training and fast planning on 2D/3D control tasks.

End-to-end training of deep visuomotor policies.Journal of Machine Learning Research, 17(39):1–40, 2016

fields

years

verdicts

representative citing papers

citing papers explorer