A method estimates mass from single RGB images by fusing depth-based volume cues with vision-language model density semantics via adaptive gating and separate regression heads trained on mass labels only.
Yago Vicente, Thomas Dideriksen, Himanshu Arora, Matthieu Guillaumin, and Jitendra Malik
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CV 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Physically Guided Visual Mass Estimation from a Single RGB Image
A method estimates mass from single RGB images by fusing depth-based volume cues with vision-language model density semantics via adaptive gating and separate regression heads trained on mass labels only.