EBM-RL applies a GRPO-based RL method with decomposed rewards for scene alignment, perceptual utility, faithfulness, and format to improve video-grounded role-playing dialogue over text-only baselines.
The man stands very close in front of the woman, facing her directly
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.AI 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Reward-Decomposed Reinforcement Learning for Immersive Video Role-Playing
EBM-RL applies a GRPO-based RL method with decomposed rewards for scene alignment, perceptual utility, faithfulness, and format to improve video-grounded role-playing dialogue over text-only baselines.