MAVIC corrects Bellman backups at instruction boundaries by adjusting the incoming objective and restoring continuation value, enabling consistent estimation under stochastic instruction switching in cooperative MARL.
Proceedings of the 2024
3 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
verdicts
UNVERDICTED 3roles
background 1polarities
background 1representative citing papers
Distill refines user task specifications for robots by pruning unnecessary steps, generalizing meanings, and relaxing order constraints, as demonstrated in a crowdsourcing study on a web interface.
Mixed-methods research shows collective care practices are constrained by personal, relational, technological, and structural factors in existing PHI systems, leading to the CC-Proact operational map with three design levers and ten recommendations for collective health informatics.
citing papers explorer
-
Robust Instruction Compliance in Cooperative Multi-Agent Reinforcement Learning
MAVIC corrects Bellman backups at instruction boundaries by adjusting the incoming objective and restoring continuation value, enabling consistent estimation under stochastic instruction switching in cooperative MARL.
-
Distill: Uncovering the True Intent behind Human-Robot Communication
Distill refines user task specifications for robots by pruning unnecessary steps, generalizing meanings, and relaxing order constraints, as demonstrated in a crowdsourcing study on a web interface.
-
Unpacking "Personal" Health Informatics for Proactive Collective Care
Mixed-methods research shows collective care practices are constrained by personal, relational, technological, and structural factors in existing PHI systems, leading to the CC-Proact operational map with three design levers and ten recommendations for collective health informatics.