Ontheexploitability of instruction tuning

ManliShu,JiongxiaoWang,ChenZhu,JonasGeiping,ChaoweiXiao,andTomGoldstein · 2023 · arXiv 2306.17194

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Self-Study Reconsidered: The Hidden Fragility of Learning from Self-Generated QA

cs.AI · 2026-06-30 · unverdicted · novelty 7.0

Self-generated QA supervision for language models is fragile due to non-uniform question selection and instruction compliance during answering, with mitigations that reduce compliance from 88% to 13%.

citing papers explorer

Showing 1 of 1 citing paper.

Self-Study Reconsidered: The Hidden Fragility of Learning from Self-Generated QA cs.AI · 2026-06-30 · unverdicted · none · ref 28
Self-generated QA supervision for language models is fragile due to non-uniform question selection and instruction compliance during answering, with mitigations that reduce compliance from 88% to 13%.

Ontheexploitability of instruction tuning

fields

years

verdicts

representative citing papers

citing papers explorer