CrashSight: A Phase-Aware, Infrastructure-Centric Video Benchmark for Traffic Crash Scene Understanding and Reasoning

· 2026 · cs.CV · arXiv 2604.08457

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Cooperative autonomous driving requires traffic scene understanding from both vehicle and infrastructure perspectives. While vision-language models (VLMs) show strong general reasoning capabilities, their performance in safety-critical traffic scenarios remains insufficiently evaluated due to the ego-vehicle focus of existing benchmarks. To bridge this gap, we present \textbf{CrashSight}, a large-scale vision-language benchmark for roadway crash understanding using real-world roadside camera data. The dataset comprises 250 crash videos, annotated with 13K multiple-choice question-answer pairs organized under a two-tier taxonomy. Tier 1 evaluates the visual grounding of scene context and involved parties, while Tier 2 probes higher-level reasoning, including crash mechanics, causal attribution, temporal progression, and post-crash outcomes. We benchmark 8 state-of-the-art VLMs and show that, despite strong scene description capabilities, current models struggle with temporal and causal reasoning in safety-critical scenarios. We provide a detailed analysis of failure scenarios and discuss directions for improving VLM crash understanding. The benchmark provides a standardized evaluation framework for infrastructure-assisted perception in cooperative autonomous driving. The CrashSight benchmark, including the full dataset and code, is accessible at https://mcgrche.github.io/crashsight.

representative citing papers

An Agentic Workflow for Detecting Personally Identifiable Information in Crash Narratives

cs.CR · 2026-04-15 · unverdicted · novelty 6.0

A hybrid agentic workflow using Presidio for structured PII and fine-tuned LLMs plus verification for names, addresses, and identifiers detects PII in crash narratives at 0.82 precision and 0.94 recall.

citing papers explorer

Showing 1 of 1 citing paper.

An Agentic Workflow for Detecting Personally Identifiable Information in Crash Narratives cs.CR · 2026-04-15 · unverdicted · none · ref 6 · internal anchor
A hybrid agentic workflow using Presidio for structured PII and fine-tuned LLMs plus verification for names, addresses, and identifiers detects PII in crash narratives at 0.82 precision and 0.94 recall.

CrashSight: A Phase-Aware, Infrastructure-Centric Video Benchmark for Traffic Crash Scene Understanding and Reasoning

fields

years

verdicts

representative citing papers

citing papers explorer