When ``A Helpful Assistant'' Is Not Really Helpful: Personas in System Prompts Do Not Improve Performances of Large Language Models

Zheng, Mingqian, Pei, Jiaxin, Logeswaran, Lajanugen, Lee, Moontae, Jurgens, David · 2024 · DOI 10.18653/v1/2024.findings-emnlp.888

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open at publisher browse 1 citing papers

representative citing papers

cs.CL · 2026-04-21 · unverdicted · novelty 6.0

Each tested LLM shows its own characteristic unreliability when engaging in repair during extended math-question dialogues.

Showing 1 of 1 citing paper.

Talking to a Know-It-All GPT or a Second-Guesser Claude? How Repair reveals unreliable Multi-Turn Behavior in LLMs cs.CL · 2026-04-21 · unverdicted · none · ref 185
Each tested LLM shows its own characteristic unreliability when engaging in repair during extended math-question dialogues.