Distribution Shift Alignment Helps LLMs Simulate Survey Response Distributions

· 2025 · cs.AI · arXiv 2510.21977

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Large language models (LLMs) offer a promising way to simulate human survey responses, potentially reducing the cost of large-scale data collection. However, existing zero-shot methods suffer from prompt sensitivity and low accuracy, while conventional fine-tuning approaches mostly fit the training set distributions and struggle to produce results more accurate than the training set itself, which deviates from the original goal of using LLMs to simulate survey responses. Building on this observation, we introduce Distribution Shift Alignment (DSA), a two-stage fine-tuning method that aligns both the output distributions and the distribution shifts across different backgrounds. By learning how these distributions change rather than fitting training data, DSA can provide results substantially closer to the true distribution than the training data. Empirically, DSA consistently outperforms other methods on five public survey datasets. We further conduct a comprehensive comparison covering accuracy, robustness, and data savings. DSA reduces the required real data by 53.48-69.12%, demonstrating its effectiveness and efficiency in survey simulation.

representative citing papers

Beyond the Mean: Three-Axis Fidelity for Aligning LLM-Based Survey Simulators from Small Pilot Data

cs.CL · 2026-06-27 · unverdicted · novelty 5.0

Fine-tuning LLMs on small pilot survey data balances structural, marginal, and individual fidelity better than prompting or rectification, but fidelity levels vary across subsamples in a COVID-19 misinformation case study.

citing papers explorer

Showing 1 of 1 citing paper.

Beyond the Mean: Three-Axis Fidelity for Aligning LLM-Based Survey Simulators from Small Pilot Data cs.CL · 2026-06-27 · unverdicted · none · ref 5 · internal anchor
Fine-tuning LLMs on small pilot survey data balances structural, marginal, and individual fidelity better than prompting or rectification, but fidelity levels vary across subsamples in a COVID-19 misinformation case study.

Distribution Shift Alignment Helps LLMs Simulate Survey Response Distributions

fields

years

verdicts

representative citing papers

citing papers explorer