ImagineAgent uses generative world modeling and tool-augmented reinforcement learning to reach state-of-the-art open-vocabulary HOI performance on SWIG-HOI and HICO-DET while using only 36.7% of the training data required by prior methods.
person" as an OBJECT (not as subject) if there are clear interactions where a person is the target/recipient of an action. For example: * CORRECT: Detect
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CV 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
What if Agents Could Imagine? Reinforcing Open-Vocabulary HOI Comprehension through Generation
ImagineAgent uses generative world modeling and tool-augmented reinforcement learning to reach state-of-the-art open-vocabulary HOI performance on SWIG-HOI and HICO-DET while using only 36.7% of the training data required by prior methods.