People Make Better Edits: Measuring the Efficacy of LLM -Generated Counterfactually Augmented Data for Harmful Language Detection

Sen, Indira, Assenmacher, Dennis, Samory, Mattia, Augenstein, Isabelle, Aalst, Wil, Wagner, Claudia · 2023 · DOI 10.18653/v1/2023.emnlp-main.649

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open at publisher browse 1 citing papers

representative citing papers

Want Better Synthetic Data? Steer It: Activation Steering for Low-Resource Language Generation

cs.CL · 2026-06-16 · unverdicted · novelty 6.0

Activation steering on early layers improves diversity of synthetic data for low-resource languages and often boosts downstream classifier performance compared to non-steered prompting.

citing papers explorer

Showing 1 of 1 citing paper.

Want Better Synthetic Data? Steer It: Activation Steering for Low-Resource Language Generation cs.CL · 2026-06-16 · unverdicted · none · ref 47
Activation steering on early layers improves diversity of synthetic data for low-resource languages and often boosts downstream classifier performance compared to non-steered prompting.

People Make Better Edits: Measuring the Efficacy of LLM -Generated Counterfactually Augmented Data for Harmful Language Detection

fields

years

verdicts

representative citing papers

citing papers explorer