A method estimates mass from single RGB images by fusing depth-based volume cues with vision-language model density semantics via adaptive gating and separate regression heads trained on mass labels only.
Qi, Hao Su, Kaichun Mo, and Leonidas J
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CV 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Physically Guided Visual Mass Estimation from a Single RGB Image
A method estimates mass from single RGB images by fusing depth-based volume cues with vision-language model density semantics via adaptive gating and separate regression heads trained on mass labels only.