Early Comparative Evaluation of Transformer Models for Multilingual Software Vulnerability Detection

Alexios Mylonas; Fiza Naseer; Javad Khan; Muhammad Yaqoob

arxiv: 2606.10925 · v1 · pith:VO4KC4NBnew · submitted 2026-06-09 · 💻 cs.SE

Early Comparative Evaluation of Transformer Models for Multilingual Software Vulnerability Detection

Fiza Naseer , Javad Khan , Muhammad Yaqoob , Alexios Mylonas This is my paper

classification 💻 cs.SE

keywords detectionvulnerabilityacrosscomparativeearlyevaluationlanguagesmultilingual

0 comments

read the original abstract

Software vulnerability detection is increasingly important as modern applications combine multiple programming languages. This paper presents an early comparative evaluation of BERT, RoBERTa, and CodeBERT for binary vulnerability detection across HTML, Python, JavaScript, and PHP using the CVEFixes dataset and language-wise three-fold stratified cross-validation. The results show clear performance differences across languages, indicating that multilingual vulnerability detection requires more language-aware and robust transformer-based modelling strategies.

This paper has not been read by Pith yet.

Early Comparative Evaluation of Transformer Models for Multilingual Software Vulnerability Detection

discussion (0)