What’s in my big data? In Proceedings of the 12th International Conference on Learning Representations (ICLR 2024), 2024

Yanai Elazar, Akshita Bhagia, Ian Helgi Magnusson, Abhilasha Ravichander, Dustin Schwenk, Alane Suhr, Pete Walsh, Dirk Groeneveld, Luca Soldaini, Sameer Singh, Hannaneh Hajishirzi, Noah Smith, Jesse Dodge · 2024

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

What Is The Political Content in LLMs' Pre- and Post-Training Data?

cs.CL · 2025-09-26 · unverdicted · novelty 5.0

Training data for open LLMs is systematically left-leaning, with pre-training corpora containing more political material than post-training data and model stances aligning with data distributions.

citing papers explorer

Showing 1 of 1 citing paper.

What Is The Political Content in LLMs' Pre- and Post-Training Data? cs.CL · 2025-09-26 · unverdicted · none · ref 9
Training data for open LLMs is systematically left-leaning, with pre-training corpora containing more political material than post-training data and model stances aligning with data distributions.

What’s in my big data? In Proceedings of the 12th International Conference on Learning Representations (ICLR 2024), 2024

fields

years

verdicts

representative citing papers

citing papers explorer