SPLC uses social preference feedback to auto-generate preference data for offline RL, improving socially compliant crowd robot navigation over baselines.
Conservative Q- learning for offline reinforcement learning,
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.RO 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
SPLC: Social Preference Learning for Crowd Robot Navigation
SPLC uses social preference feedback to auto-generate preference data for offline RL, improving socially compliant crowd robot navigation over baselines.