Fine-tuning Gemma 3 27B on modest human-labeled street-view data yields building condition scores that align with and sometimes exceed individual human raters on correlation metrics, with knowledge distillation producing comparable smaller LLM, CNN, and transformer models.
arXiv preprint arXiv:2409.19527 (2024)
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CV 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Leveraging Multimodal LLMs for Built Environment and Housing Attribute Assessment from Street-View Imagery
Fine-tuning Gemma 3 27B on modest human-labeled street-view data yields building condition scores that align with and sometimes exceed individual human raters on correlation metrics, with knowledge distillation producing comparable smaller LLM, CNN, and transformer models.