Correcting robot plans with natural language feedback

Pratyusha Sharma, Balakumar Sundaralingam, Valts Blukis, Chris Paxton, Tucker Hermans, Antonio Torralba, Jacob Andreas, Dieter Fox · 2022 · DOI 10.15607/rss.2022.xviii.065

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

representative citing papers

Robots That Know What to Ask: Recovering Misaligned Rewards through Targeted Explanations

cs.RO · 2026-05-21 · unverdicted · novelty 6.0

Robots detect underspecified reward features via demonstration variation and query targeted natural language explanations to improve reward recovery from imperfect demos.

QuickLAP: Quick Language-Action Preference Learning for Semi-Autonomous Agents

cs.AI · 2025-11-22 · unverdicted · novelty 6.0 · 2 refs

QuickLAP fuses LLM-extracted language observations with physical feedback in a closed-form Bayesian update to cut reward learning error by over 70% in a driving simulator and improve user preference in a 15-person study.

citing papers explorer

Showing 2 of 2 citing papers.

Robots That Know What to Ask: Recovering Misaligned Rewards through Targeted Explanations cs.RO · 2026-05-21 · unverdicted · none · ref 42
Robots detect underspecified reward features via demonstration variation and query targeted natural language explanations to improve reward recovery from imperfect demos.
QuickLAP: Quick Language-Action Preference Learning for Semi-Autonomous Agents cs.AI · 2025-11-22 · unverdicted · none · ref 53 · 2 links
QuickLAP fuses LLM-extracted language observations with physical feedback in a closed-form Bayesian update to cut reward learning error by over 70% in a driving simulator and improve user preference in a 15-person study.

Correcting robot plans with natural language feedback

fields

years

verdicts

representative citing papers

citing papers explorer