Evaluating the Performance of Large Language Models for Spanish Language in Undergraduate Admissions Exams

Bella Mart\'inez-Seis; Obdulia Pichardo-Lagunas; Pierre Baldi; Sabino Miranda

arxiv: 2312.16845 · v1 · pith:KGFG2HLLnew · submitted 2023-12-28 · 💻 cs.CL · cs.AI

Evaluating the Performance of Large Language Models for Spanish Language in Undergraduate Admissions Exams

Sabino Miranda , Obdulia Pichardo-Lagunas , Bella Mart\'inez-Seis , Pierre Baldi This is my paper

classification 💻 cs.CL cs.AI

keywords bardexamsgpt-3languagemodelssciencesacademicadmissions

0 comments

read the original abstract

This study evaluates the performance of large language models, specifically GPT-3.5 and BARD (supported by Gemini Pro model), in undergraduate admissions exams proposed by the National Polytechnic Institute in Mexico. The exams cover Engineering/Mathematical and Physical Sciences, Biological and Medical Sciences, and Social and Administrative Sciences. Both models demonstrated proficiency, exceeding the minimum acceptance scores for respective academic programs to up to 75% for some academic programs. GPT-3.5 outperformed BARD in Mathematics and Physics, while BARD performed better in History and questions related to factual information. Overall, GPT-3.5 marginally surpassed BARD with scores of 60.94% and 60.42%, respectively.

This paper has not been read by Pith yet.

Evaluating the Performance of Large Language Models for Spanish Language in Undergraduate Admissions Exams

discussion (0)