CAN-QA: A Question-Answering Benchmark for Reasoning over In-Vehicle CAN Traffic

· 2026 · cs.CR · arXiv 2604.24935

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

The Controller Area Network (CAN) is a safety-critical in-vehicle communication protocol that lacks built-in security mechanisms, making intrusion detection essential. Existing approaches predominantly formulate CAN intrusion detection as a classification task, mapping complex traffic patterns to attack labels. However, this formulation abstracts away the temporal and relational structure of CAN traffic and misaligns with real-world forensic workflows, which require systematic reasoning about traffic behavior. To address this gap, we introduce CAN-QA, the first benchmark that reformulates CAN traffic analysis as a question-answering (QA) task. CAN-QA converts raw CAN logs into temporally segmented windows and applies deterministic rule-based templates to generate natural-language questions paired with automatically derived ground-truth answers. The resulting dataset comprises 33,128 QA pairs across 10 categories, each targeting distinct semantic and temporal properties of CAN traffic. Using CAN-QA, we evaluate large language models across both True/False and multiple-choice formats. Our results indicate that, although these models capture superficial statistical regularities, they struggle with temporal reasoning, multi-condition inference, and higher-level behavioral interpretation. Our code is available at https://github.com/Kriiiiss/CAN-QA.

representative citing papers

CyberMaskQA: A Privacy-Aware Benchmark for Evaluating Large Language Models in Cybersecurity Question Answering

cs.CR · 2026-05-23 · unverdicted · novelty 7.0

CyberMaskQA is a new privacy-aware QA benchmark for cybersecurity that annotates private entities in realistic organizational scenarios with causal dependencies to jointly evaluate reasoning accuracy and masking performance.

citing papers explorer

Showing 1 of 1 citing paper after filters.

CyberMaskQA: A Privacy-Aware Benchmark for Evaluating Large Language Models in Cybersecurity Question Answering cs.CR · 2026-05-23 · unverdicted · none · ref 2 · internal anchor
CyberMaskQA is a new privacy-aware QA benchmark for cybersecurity that annotates private entities in realistic organizational scenarios with causal dependencies to jointly evaluate reasoning accuracy and masking performance.

CAN-QA: A Question-Answering Benchmark for Reasoning over In-Vehicle CAN Traffic

fields

years

verdicts

representative citing papers

citing papers explorer