A data-fusion pipeline generates pseudo-labels from video, telematics, and CV models to fine-tune QwenVL-2.5 with DoRA adapters, yielding reported gains in detecting and explaining safety-critical driving events.
Scvlm: Enhancing vision- language model for safety-critical event understanding,
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CV 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Enhancing Multimodal Large Language Models for Safety-Critical Driving Video Analysis
A data-fusion pipeline generates pseudo-labels from video, telematics, and CV models to fine-tune QwenVL-2.5 with DoRA adapters, yielding reported gains in detecting and explaining safety-critical driving events.