Valid sur- vey simulations with limited human data: The roles of prompting, fine-tuning, and rectification

Stefan Krsteski, Giuseppe Russo, Serina Chang, Robert West, Kristina Gligorić · 2025 · arXiv 2510.11408

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

representative citing papers

Rectification Difficulty and Optimal Sample Allocation in LLM-Augmented Surveys

cs.AI · 2026-04-19 · unverdicted · novelty 7.0

A method using predicted rectification difficulty for optimal human sample allocation in LLM-augmented surveys captures 61-79% of theoretical efficiency gains and reduces MSE by 11% on two datasets without pilot data.

Adaptive Budget Allocation in LLM-Augmented Surveys

cs.LG · 2026-04-14 · unverdicted · novelty 7.0

An adaptive budget allocation algorithm for LLM-augmented surveys learns question-level LLM reliability on the fly from human labels and reduces labeling waste from 10-12% to 2-6% compared to uniform allocation.

Text-Based Personas for Simulating User Privacy Decisions

cs.CR · 2026-03-20 · unverdicted · novelty 7.0

Narriva generates behavior-grounded text personas from survey data that achieve up to 87% accuracy in predicting privacy decisions, improve 6-17 points over baselines, cut tokens by 80-95%, and reproduce aggregate distributions across different studies.

Graph-Based Alternatives to LLMs for Human Simulation

cs.CL · 2025-11-03 · conditional · novelty 6.0

GEMS formulates close-ended human-behavior simulation as link prediction on a heterogeneous graph and matches or exceeds LLM performance with three orders of magnitude fewer parameters across three datasets and three evaluation settings.

Benchmarking LLMs for Community Governance Simulation with Life-history Narratives

cs.CY · 2026-05-22 · unverdicted · novelty 5.0

Introduces a 1.2-million-character narrative dataset from 92 residents, benchmarks 18 LLMs on fidelity with life-history profiles, and presents curriculum-LoRA as a low-cost personalization method that matches high-fidelity baselines at 10x lower token cost.

citing papers explorer

Showing 5 of 5 citing papers.

Rectification Difficulty and Optimal Sample Allocation in LLM-Augmented Surveys cs.AI · 2026-04-19 · unverdicted · none · ref 4
A method using predicted rectification difficulty for optimal human sample allocation in LLM-augmented surveys captures 61-79% of theoretical efficiency gains and reduces MSE by 11% on two datasets without pilot data.
Adaptive Budget Allocation in LLM-Augmented Surveys cs.LG · 2026-04-14 · unverdicted · none · ref 22
An adaptive budget allocation algorithm for LLM-augmented surveys learns question-level LLM reliability on the fly from human labels and reduces labeling waste from 10-12% to 2-6% compared to uniform allocation.
Text-Based Personas for Simulating User Privacy Decisions cs.CR · 2026-03-20 · unverdicted · none · ref 18
Narriva generates behavior-grounded text personas from survey data that achieve up to 87% accuracy in predicting privacy decisions, improve 6-17 points over baselines, cut tokens by 80-95%, and reproduce aggregate distributions across different studies.
Graph-Based Alternatives to LLMs for Human Simulation cs.CL · 2025-11-03 · conditional · none · ref 45
GEMS formulates close-ended human-behavior simulation as link prediction on a heterogeneous graph and matches or exceeds LLM performance with three orders of magnitude fewer parameters across three datasets and three evaluation settings.
Benchmarking LLMs for Community Governance Simulation with Life-history Narratives cs.CY · 2026-05-22 · unverdicted · none · ref 33
Introduces a 1.2-million-character narrative dataset from 92 residents, benchmarks 18 LLMs on fidelity with life-history profiles, and presents curriculum-LoRA as a low-cost personalization method that matches high-fidelity baselines at 10x lower token cost.

Valid sur- vey simulations with limited human data: The roles of prompting, fine-tuning, and rectification

fields

years

verdicts

representative citing papers

citing papers explorer