Dissecting In-Context Learning of Translations in GPTs

Arul Menezes; Hany Hassan Awadalla; Vikas Raunak

arxiv: 2310.15987 · v1 · pith:ASVSGGC7new · submitted 2023-10-24 · 💻 cs.CL · cs.AI

Dissecting In-Context Learning of Translations in GPTs

Vikas Raunak , Hany Hassan Awadalla , Arul Menezes This is my paper

classification 💻 cs.CL cs.AI

keywords learningtranslationsin-contextperturbationtranslationfew-shotgpt-3prompting

0 comments

read the original abstract

Most of the recent work in leveraging Large Language Models (LLMs) such as GPT-3 for Machine Translation (MT) has focused on selecting the few-shot samples for prompting. In this work, we try to better understand the role of demonstration attributes for the in-context learning of translations through perturbations of high-quality, in-domain demonstrations. We find that asymmetric perturbation of the source-target mappings yield vastly different results. We show that the perturbation of the source side has surprisingly little impact, while target perturbation can drastically reduce translation quality, suggesting that it is the output text distribution that provides the most important learning signal during in-context learning of translations. We propose a method named Zero-Shot-Context to add this signal automatically in Zero-Shot prompting. We demonstrate that it improves upon the zero-shot translation performance of GPT-3, even making it competitive with few-shot prompted translations.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Semantic Needles in Document Haystacks: Sensitivity Testing of LLM-as-a-Judge Similarity Scoring
cs.CL 2026-04 unverdicted novelty 7.0

LLMs exhibit positional bias and context-dependent scoring patterns when judging document similarity, with each model showing a stable scoring fingerprint but a shared hierarchy of sensitivity to different semantic pe...