Language models show slight sensitivity to gender perturbations in fairytale QA but gain robustness after fine-tuning on counterfactual anti-stereotypical examples.
In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing , pages 9496–9521, Abu Dhabi, United Arab Emirates
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2023 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Will the Prince Get True Love's Kiss? On the Model Sensitivity to Gender Perturbation over Fairytale Texts
Language models show slight sensitivity to gender perturbations in fairytale QA but gain robustness after fine-tuning on counterfactual anti-stereotypical examples.