Evaluating and explaining training strategies for zero-shot cross-lingual news sentiment analysis

Andra\v{z} Pelicon; Boshko Koloski; Luka Andren\v{s}ek; Matthew Purver; Nada Lavra\v{c}; Senja Pollak

arxiv: 2409.20054 · v1 · pith:W6GXSYPSnew · submitted 2024-09-30 · 💻 cs.CL · cs.AI

Evaluating and explaining training strategies for zero-shot cross-lingual news sentiment analysis

Luka Andren\v{s}ek , Boshko Koloski , Andra\v{z} Pelicon , Nada Lavra\v{c} , Senja Pollak , Matthew Purver This is my paper

classification 💻 cs.CL cs.AI

keywords cross-lingualnovelsentimenttraininggivingin-contextincludinglanguage

0 comments

read the original abstract

We investigate zero-shot cross-lingual news sentiment detection, aiming to develop robust sentiment classifiers that can be deployed across multiple languages without target-language training data. We introduce novel evaluation datasets in several less-resourced languages, and experiment with a range of approaches including the use of machine translation; in-context learning with large language models; and various intermediate training regimes including a novel task objective, POA, that leverages paragraph-level information. Our results demonstrate significant improvements over the state of the art, with in-context learning generally giving the best performance, but with the novel POA approach giving a competitive alternative with much lower computational overhead. We also show that language similarity is not in itself sufficient for predicting the success of cross-lingual transfer, but that similarity in semantic content and structure can be equally important.

This paper has not been read by Pith yet.

Evaluating and explaining training strategies for zero-shot cross-lingual news sentiment analysis

discussion (0)