Do Large Language Model Voters Strategize? An Oracle-Based Benchmark for Manipulation under Voting Rules
Pith reviewed 2026-06-26 13:11 UTC · model grok-4.3
The pith
An oracle benchmark supplies exact ground truth on whether LLM voters can manipulate election outcomes under five voting rules.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The paper establishes an oracle-based benchmark in which an exact oracle enumerates every feasible report by the LLM voter, computes the sincere outcome, identifies all profitable reports, and records the best achievable outcome. This benchmark covers plurality, Borda, approval, instant-runoff voting, and Copeland-style pairwise majority voting. Prompt conditions separate sincere, strategic, civic, and expert framings. The registered core design fixes a single electorate size, uses 600 balanced election instances, and produces 9,600 model-prompt responses across four model configurations and four prompt conditions. It reports exact oracle-calibration baselines rather than imputing performanc
What carries the argument
The exact oracle that enumerates every feasible ballot report by the LLM voter and evaluates its outcome under each deterministic voting rule to identify profitable manipulations.
If this is right
- The benchmark enables measurement of manipulation discovery, optimal manipulation, false manipulation, near-miss, and invalid-ballot rates for LLMs.
- It produces oracle-calibration baselines that bound and contextualize model results on the exact task.
- It separates performance across sincere, strategic, civic, and expert prompt conditions.
- It reduces strategic-voting behavior to exact counterfactual evaluation, turning the question of sincere versus strategic voting into a reproducible experiment.
- The fixed design with 600 instances supports direct comparisons across model configurations without reliance on external studies.
Where Pith is reading between the lines
- If models frequently reach the oracle's best outcome, it would indicate they can reason about how their reported ballot affects the collective result given others' votes.
- The oracle method could be extended to measure strategic behavior by LLMs in other multi-agent decision settings that have well-defined outcome functions.
- Persistent gaps between model performance and oracle optimum might highlight limits in LLMs' ability to simulate full counterfactual election results.
- The benchmark's ground-truth approach could apply to testing whether other AI systems can identify profitable deviations in mechanism-design problems.
Load-bearing premise
The assumption that an exact oracle can enumerate every feasible report and correctly compute the outcome under each of the five deterministic voting rules for any ballot the LLM might submit.
What would settle it
An election instance in which the oracle reports no profitable deviation exists, yet a hand calculation shows that one of the LLM's possible ballots would produce a strictly better outcome for the voter under the rule.
read the original abstract
Strategic voting is a canonical failure mode for collective choice: a voter may obtain a more preferred outcome by reporting a ballot that differs from its true preferences. This paper introduces an oracle-based benchmark for testing whether large language model (LLM) voters can discover and execute such manipulations. Each instance gives an LLM voter a true preference ranking, the other voters' ballots, a deterministic voting rule, and a prompt condition. An exact oracle enumerates every feasible report by the LLM voter, computes the sincere outcome, identifies all profitable reports, and records the best achievable outcome. The benchmark therefore supplies ground truth for strategic success without human labels or subjective grading of explanations. The benchmark covers plurality, Borda, approval, instant-runoff voting, and Copeland-style pairwise majority voting; prompt conditions separate sincere, strategic, civic, and expert framings. To keep the primary study defensible while preserving the main comparisons, the registered core design fixes a single electorate size, uses 600 balanced election instances, and produces 9,600 model--prompt responses when run with four model configurations and four prompt conditions. Because existing peer-reviewed work does not report manipulation discovery, optimal manipulation, false manipulation, near-miss, or invalid-ballot rates for this exact task, we do not impute LLM performance from unrelated studies. Instead, we report exact oracle-calibration baselines that bound and contextualize subsequent model results. By reducing strategic-voting behavior to exact counterfactual evaluation, the benchmark turns the question ``Do LLM voters vote sincerely or strategically?'' into a reproducible social-choice experiment.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper introduces an oracle-based benchmark to test whether large language models engage in strategic (manipulative) voting. Each instance supplies an LLM with true preferences, other voters' ballots, one of five deterministic rules (plurality, Borda, approval, IRV, Copeland), and a prompt condition (sincere/strategic/civic/expert). An exact oracle enumerates every feasible report, computes the sincere outcome, flags all profitable deviations, and records the best achievable outcome, thereby supplying objective ground truth for 600 balanced instances without human labels or subjective grading. The registered core design fixes electorate size and produces 9,600 model-prompt responses across four models; oracle-calibration baselines are reported to bound subsequent results.
Significance. If implemented as described, the benchmark supplies a reproducible, label-free method for measuring manipulation discovery, optimal manipulation, false manipulation, and invalid-ballot rates under standard voting rules. The exact-oracle construction and provision of calibration baselines are concrete strengths that allow direct, falsifiable comparison of LLM behavior against known ground truth, addressing a gap where prior work does not report these metrics.
minor comments (2)
- [Abstract] Abstract: the claim that the oracle 'enumerates every feasible report' would be strengthened by an explicit statement of the number of candidates (or preference-space size) assumed in the 600 instances, as this directly determines computational feasibility of exhaustive enumeration.
- The manuscript should clarify in the methods section how the oracle treats ballots that violate the voting rule's validity constraints (e.g., non-ranked candidates under IRV) and whether such ballots are simply discarded or trigger a separate error category.
Simulated Author's Rebuttal
We thank the referee for the accurate summary of the oracle benchmark, its emphasis on the label-free ground-truth construction, and the recommendation for minor revision. No major comments were raised in the report.
Circularity Check
No significant circularity
full rationale
The paper defines a benchmark whose ground truth is supplied by an external oracle that exhaustively enumerates ballots and applies standard deterministic voting rules (plurality, Borda, approval, IRV, Copeland) to compute sincere and manipulated outcomes. No equations, fitted parameters, or derivations are present that reduce the claimed ground truth to the benchmark's own inputs by construction. The design explicitly avoids imputing results from prior studies and instead reports oracle-calibration baselines. No self-citation is load-bearing for the central claim, and the construction is self-contained against the external voting rules.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption The oracle can enumerate every feasible report and compute the sincere and manipulated outcomes exactly for plurality, Borda, approval, instant-runoff, and Copeland rules.
Reference graph
Works this paper leans on
-
[1]
arXiv preprint arXiv:2602.02932 (2026)
Amiri-Margavi, A., Gharagozlou, A., Davodi, A.G., Davoudi, S.P.M., Balyani, H.H.: Equal ac- cess, unequal interaction: A counterfactual audit of llm fairness. arXiv preprint arXiv:2602.02932 (2026)
arXiv 2026
-
[2]
In: IFIP International Conference on Artificial Intelligence Applications and Innovations
Amiri-Margavi, A., Jebellat, I., Jebellat, E., Davoudi, S.P.M.: Enhancing answer reliability through inter-model consensus of large language models. In: IFIP International Conference on Artificial Intelligence Applications and Innovations. pp. 299–316. Springer (2025)
2025
-
[3]
arXiv preprint arXiv:2606.01456 (2026)
Balyani, H.H., Davoudi, S.P.M., Amiri-Margavi, A., Davodi, A.G., Gharagozlou, A.: Truthful ai advisors: A pre-specified benchmark for large language model honesty under preference misalignment. arXiv preprint arXiv:2606.01456 (2026)
Pith/arXiv arXiv 2026
-
[4]
Social Choice and Welfare8(4), 341–354 (1991) 18
Bartholdi III, J.J., Orlin, J.B.: Single transferable vote resists strategic voting. Social Choice and Welfare8(4), 341–354 (1991) 18
1991
-
[5]
Social choice and welfare6(3), 227–241 (1989)
Bartholdi III, J.J., Tovey, C.A., Trick, M.A.: The computational difficulty of manipulating an election. Social choice and welfare6(3), 227–241 (1989)
1989
-
[6]
Cambridge University Press (2016)
Brandt, F., Conitzer, V., Endriss, U., Lang, J., Procaccia, A.D.: Handbook of computational social choice. Cambridge University Press (2016)
2016
-
[7]
arXiv preprint arXiv:2404.10271 (2024)
Conitzer, V., Freedman, R., Heitzig, J., Holliday, W.H., Jacobs, B.M., Lambert, N., Mossé, M., Pacuit, E., Russell, S., Schoelkopf, H., et al.: Social choice should guide ai alignment in dealing with diverse human feedback. arXiv preprint arXiv:2404.10271 (2024)
arXiv 2024
-
[8]
Conitzer, V., Sandholm, T., Lang, J.: When are elections with few candidates hard to manipulate? Journal of the ACM (JACM)54(3), 14–es (2007)
2007
-
[9]
In: Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)
Davoodi, A.G., Davoudi, S.P.M., Pezeshkpour, P.: Llms are not intelligent thinkers: Introducing mathematical topic tree benchmark for comprehensive evaluation of llms. In: Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers). pp. 3127...
2025
-
[10]
In: 2025 3rd International Conference on Foundation and Large Language Models (FLLM)
Davoudi, S.P.M., Davodi, A.G., Margavi, A.A., Jafari, M.: Collective reasoning among llms: A framework for answer validation without ground truth. In: 2025 3rd International Conference on Foundation and Large Language Models (FLLM). pp. 343–349. IEEE (2025)
2025
-
[11]
Econometrica: journal of the Econometric Society pp
Gibbard, A.: Manipulation of voting schemes: a general result. Econometrica: journal of the Econometric Society pp. 587–601 (1973)
1973
-
[12]
In: Proceedings of the AAAI Conference on Artificial Intelligence
Holliday, W.H., Kristoffersen, A., Pacuit, E.: Learning to manipulate under limited information. In: Proceedings of the AAAI Conference on Artificial Intelligence. vol. 39, pp. 13915–13925 (2025)
2025
-
[13]
arXiv preprint arXiv:2605.12991 (2026)
Kumarappan, A., Mujoo, A.: Not just rlhf: Why alignment alone won’t fix multi-agent sycophancy. arXiv preprint arXiv:2605.12991 (2026)
Pith/arXiv arXiv 2026
-
[14]
Journal of economic theory10(2), 187–217 (1975)
Satterthwaite, M.A.: Strategy-proofness and arrow’s conditions: Existence and correspondence theorems for voting procedures and social welfare functions. Journal of economic theory10(2), 187–217 (1975)
1975
-
[15]
Cambridge University Press (2005)
Taylor, A.D.: Social choice and the mathematics of manipulation. Cambridge University Press (2005)
2005
-
[16]
Walsh, T.: Where are the hard manipulation problems? Journal of Artificial Intelligence Research42, 1–29 (2011)
2011
-
[17]
In: Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society
Yang, J.C., Dailisan, D., Korecki, M., Hausladen, C.I., Helbing, D.: Llm voting: Human choices and ai collective decision-making. In: Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society. vol. 7, pp. 1696–1708 (2024) 19
2024
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.