pith. machine review for the scientific record. sign in

arxiv: 1206.6398 · v2 · submitted 2012-06-27 · 💻 cs.LG · stat.ML

Recognition: unknown

Learning Parameterized Skills

Authors on Pith no claims yet
classification 💻 cs.LG stat.ML
keywords methodparameterizedparameterslearningmanifolddistributionpoliciespolicy
0
0 comments X
read the original abstract

We introduce a method for constructing skills capable of solving tasks drawn from a distribution of parameterized reinforcement learning problems. The method draws example tasks from a distribution of interest and uses the corresponding learned policies to estimate the topology of the lower-dimensional piecewise-smooth manifold on which the skill policies lie. This manifold models how policy parameters change as task parameters vary. The method identifies the number of charts that compose the manifold and then applies non-linear regression in each chart to construct a parameterized skill by predicting policy parameters from task parameters. We evaluate our method on an underactuated simulated robotic arm tasked with learning to accurately throw darts at a parameterized target location.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Emergent Neural Automaton Policies: Learning Symbolic Structure from Visuomotor Trajectories

    cs.RO 2026-03 unverdicted novelty 6.0

    ENAP extracts an emergent Mealy automaton from visuomotor trajectories to act as a high-level planner for a low-level residual policy, yielding up to 27% higher success than end-to-end VLA policies in low-data regimes.