Title resolution pending

Gao, Y · 2025 · DOI 10.1073/pnas.2501660122

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

open at publisher browse 4 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Large language models converge on competitive rationality but diverge on cooperation across providers and generations

physics.soc-ph · 2026-04-01 · unverdicted · novelty 6.0

LLMs converge on competitive rationality and coordination but diverge 48-fold on cooperation, with provider identity and generational shifts as dominant factors across 38 games.

How Many Human Survey Respondents is a Large Language Model Worth? An Uncertainty Quantification Perspective

stat.ME · 2025-02-25 · unverdicted · novelty 6.0

A data-driven method adaptively selects the number of LLM-simulated responses to form confidence sets with nominal coverage for human survey parameters and equates that number to the LLM's effective human-equivalent sample size.

What Would GPT Click: Practical Effects of Human-AI Behavioral Misalignment and the Cost of Synthetic Participants in User Experience

cs.HC · 2026-05-18 · unverdicted · novelty 5.0

GPT produces click distributions significantly different from real humans in 53% of UX first-click tasks, with prompting techniques like personas and chain-of-thought failing to improve alignment.

Model-Free Assessment of Simulator Fidelity via Quantile Curves

stat.ME · 2025-12-04 · unverdicted · novelty 5.0

A model-free method builds confidence sets for latent parameters to proxy sim-to-real discrepancies and estimates the quantile function of that proxy to produce a distribution-level fidelity profile for simulators.

citing papers explorer

Showing 4 of 4 citing papers.

Large language models converge on competitive rationality but diverge on cooperation across providers and generations physics.soc-ph · 2026-04-01 · unverdicted · none · ref 30
LLMs converge on competitive rationality and coordination but diverge 48-fold on cooperation, with provider identity and generational shifts as dominant factors across 38 games.
How Many Human Survey Respondents is a Large Language Model Worth? An Uncertainty Quantification Perspective stat.ME · 2025-02-25 · unverdicted · none · ref 24
A data-driven method adaptively selects the number of LLM-simulated responses to form confidence sets with nominal coverage for human survey parameters and equates that number to the LLM's effective human-equivalent sample size.
What Would GPT Click: Practical Effects of Human-AI Behavioral Misalignment and the Cost of Synthetic Participants in User Experience cs.HC · 2026-05-18 · unverdicted · none · ref 5
GPT produces click distributions significantly different from real humans in 53% of UX first-click tasks, with prompting techniques like personas and chain-of-thought failing to improve alignment.
Model-Free Assessment of Simulator Fidelity via Quantile Curves stat.ME · 2025-12-04 · unverdicted · none · ref 10
A model-free method builds confidence sets for latent parameters to proxy sim-to-real discrepancies and estimates the quantile function of that proxy to produce a distribution-level fidelity profile for simulators.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer