Language models show slight sensitivity to gender perturbations in fairytale QA but gain robustness after fine-tuning on counterfactual anti-stereotypical examples.
Association for Computing Machinery
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2023 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Will the Prince Get True Love's Kiss? On the Model Sensitivity to Gender Perturbation over Fairytale Texts
Language models show slight sensitivity to gender perturbations in fairytale QA but gain robustness after fine-tuning on counterfactual anti-stereotypical examples.