DTPQA is a new VQA benchmark consisting of synthetic and real-world traffic images with distance annotations to isolate and measure VLM perception capabilities for driving decisions.
Holistic autonomous driving understanding by bird’s-eye-view injected multi- modal large models,
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CV 1years
2025 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Descriptor: Distance-Annotated Traffic Perception Question Answering (DTPQA)
DTPQA is a new VQA benchmark consisting of synthetic and real-world traffic images with distance annotations to isolate and measure VLM perception capabilities for driving decisions.