GPT-5 leads English CS certifications, Qwen-Plus leads Chinese ones, DeepSeek-R1 is most balanced, and Llama-3.3 lags in higher reasoning and robustness, with performance dropping on complex questions.
Title resolution pending
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CY 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Are LLMs Ready for Computer Science Education? A Cross-Domain, Cross-Lingual and Cognitive-Level Evaluation Using Professional Certification Exams
GPT-5 leads English CS certifications, Qwen-Plus leads Chinese ones, DeepSeek-R1 is most balanced, and Llama-3.3 lags in higher reasoning and robustness, with performance dropping on complex questions.