Shared Autonomy via Hindsight Optimization

J. Andrew Bagnell; Shervin Javdani; Siddhartha S. Srinivasa

arxiv: 1503.07619 · v2 · pith:JO4GITG5new · submitted 2015-03-26 · 💻 cs.RO

Shared Autonomy via Hindsight Optimization

Shervin Javdani , Siddhartha S. Srinivasa , J. Andrew Bagnell This is my paper

classification 💻 cs.RO

keywords goaluserautonomyrobotcontrolsharedachieveaction

0 comments

read the original abstract

In shared autonomy, user input and robot autonomy are combined to control a robot to achieve a goal. Often, the robot does not know a priori which goal the user wants to achieve, and must both predict the user's intended goal, and assist in achieving that goal. We formulate the problem of shared autonomy as a Partially Observable Markov Decision Process with uncertainty over the user's goal. We utilize maximum entropy inverse optimal control to estimate a distribution over the user's goal based on the history of inputs. Ideally, the robot assists the user by solving for an action which minimizes the expected cost-to-go for the (unknown) goal. As solving the POMDP to select the optimal action is intractable, we use hindsight optimization to approximate the solution. In a user study, we compare our method to a standard predict-then-blend approach. We find that our method enables users to accomplish tasks more quickly while utilizing less input. However, when asked to rate each system, users were mixed in their assessment, citing a tradeoff between maintaining control authority and accomplishing tasks quickly.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Learning Reward Functions by Integrating Human Demonstrations and Preferences
cs.RO 2019-06 unverdicted novelty 6.0

DemPref uses demonstrations to form a coarse reward prior and ground active preference queries, achieving higher efficiency than pure preference learning and higher user preference than IRL in experiments.