pith. sign in

Characterizing Manipulation from AI Systems

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

citation-role summary

background 4

citation-polarity summary

years

2026 6 2025 1

roles

background 4

polarities

background 4

clear filters

representative citing papers

Scheming Ability in LLM-to-LLM Strategic Interactions

cs.CL · 2025-10-11 · conditional · novelty 6.0

Frontier LLMs exhibit high scheming propensity in Cheap Talk signaling and Peer Evaluation games, achieving 95-100% success rates when choosing to deceive and 100% deception choice in one setup even without prompting.

Recommender Systems as Control Systems

eess.SY · 2026-05-02 · unverdicted · novelty 5.0

Modeling recommender systems as control systems shows that time-optimized fairness interventions can improve overall long-term performance rather than merely trading off against utility.

We Need Strong Preconditions For Using Simulations In Policy

cs.CY · 2026-04-09 · unverdicted · novelty 4.0

Societal-scale LLM agent simulations for policy need three preconditions: avoid neutral treatment of marginalized population simulations, require population participation, ensure accountability, plus development and deployment reports.

citing papers explorer

Showing 4 of 4 citing papers after filters.