Specialized deep residual policy safe reinforcement learning-based controller for complex and continuous state-action spaces.arXiv preprint arXiv:2310.14788,

Ammar N Abbas, Georgios C Chasparis, John D Kelleher · arXiv 2310.14788

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Behavior Uncloning: Distilling Mode Redirection into Policy Weights without Inference-Time Steering

cs.RO · 2026-06-28 · unverdicted · novelty 5.0

MoRE improves robot policy success rates by 44 percentage points by distilling mode redirection into weights, matching filtered retraining performance without inference overhead.

citing papers explorer

Showing 1 of 1 citing paper.

Behavior Uncloning: Distilling Mode Redirection into Policy Weights without Inference-Time Steering cs.RO · 2026-06-28 · unverdicted · none · ref 1
MoRE improves robot policy success rates by 44 percentage points by distilling mode redirection into weights, matching filtered retraining performance without inference overhead.

Specialized deep residual policy safe reinforcement learning-based controller for complex and continuous state-action spaces.arXiv preprint arXiv:2310.14788,

fields

years

verdicts

representative citing papers

citing papers explorer