Four tools competed to expose cases where an LLM car manual assistant fails to mention warnings, scored on effectiveness and test diversity.
Ahmed, Ludwig Otto Baader, Firas Bayram, Siri Jagstedt, and Peter Magnusson
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.AI 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
DeepTest Tool Competition 2026: Benchmarking an LLM-Based Automotive Assistant
Four tools competed to expose cases where an LLM car manual assistant fails to mention warnings, scored on effectiveness and test diversity.