SnareNet introduces a repair layer that navigates the range space of constraints plus adaptive relaxation training to enforce hard non-convex constraints on neural network outputs more reliably than prior methods.
End-to-end training of deep visuomotor policies.Journal of Machine Learning Research, 17(39):1–40, 2016
4 Pith papers cite this work. Polarity classification is still indexing.
years
2026 4verdicts
UNVERDICTED 4representative citing papers
Extends MVP to contextual action-set RL and derives minimax regret bound O~(sqrt(S A H^3 K log L)) for adversarial contexts plus a gap-dependent bound.
Reinforcement learning sensorimotor policies enable quadrotors to traverse narrow gaps at extreme tilts with 5 cm clearance using only vision and proprioception, including reactive traversal of moving gaps.
LeWM is the first end-to-end trainable JEPA from pixels that uses only two loss terms for stable training and fast planning on 2D/3D control tasks.
citing papers explorer
-
SnareNet: Flexible Repair Layers for Neural Networks with Hard Constraints
SnareNet introduces a repair layer that navigates the range space of constraints plus adaptive relaxation training to enforce hard non-convex constraints on neural network outputs more reliably than prior methods.
-
Tighter Regret Bounds for Contextual Action-Set Reinforcement Learning
Extends MVP to contextual action-set RL and derives minimax regret bound O~(sqrt(S A H^3 K log L)) for adversarial contexts plus a gap-dependent bound.
-
Precise Aggressive Aerial Maneuvers with Sensorimotor Policies
Reinforcement learning sensorimotor policies enable quadrotors to traverse narrow gaps at extreme tilts with 5 cm clearance using only vision and proprioception, including reactive traversal of moving gaps.
-
LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architecture from Pixels
LeWM is the first end-to-end trainable JEPA from pixels that uses only two loss terms for stable training and fast planning on 2D/3D control tasks.