Mind your tone: Investigating how prompt politeness affects llm accuracy (short paper)

Dobariya, Om, Kumar, Akhil , year = · 2025 · arXiv 2510.04950

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

representative citing papers

cs.SE · 2026-06-18 · unverdicted · novelty 5.0

A taxonomy-guided RAG system with LLMs reduces hallucinations and improves migration suggestions for Qiskit code compared to unconstrained retrieval.

Legal Reasoning Is Not Lawyering: Rethinking Legal Benchmarks for Pro Se Access to Justice

cs.CY · 2026-06-16 · unverdicted · novelty 5.0

Legal AI benchmarks must evaluate robustness to pro se litigant inputs rather than expert-preprocessed ones to support access-to-justice claims.

Toxic HallucinAItions: Perturbing Prompts and Tracing LLM Circuits

cs.CL · 2026-05-29 · unverdicted · novelty 5.0

Toxic prompt perturbations reduce LLM factual accuracy on three benchmarks and selectively amplify perturbation-sensitive nodes in attribution graphs.

From GPT-3 to GPT-5: Mapping their capabilities, scope, limitations, and consequences

cs.AI · 2026-04-11 · unverdicted · novelty 2.0

The GPT family has shifted from scaled text predictors to aligned multimodal tool-oriented systems, with persistent limitations like hallucination and prompt sensitivity remaining unchanged.

citing papers explorer

Showing 4 of 4 citing papers.

Qiskit Code Migration with LLMs cs.SE · 2026-06-18 · unverdicted · none · ref 144
A taxonomy-guided RAG system with LLMs reduces hallucinations and improves migration suggestions for Qiskit code compared to unconstrained retrieval.
Legal Reasoning Is Not Lawyering: Rethinking Legal Benchmarks for Pro Se Access to Justice cs.CY · 2026-06-16 · unverdicted · none · ref 18
Legal AI benchmarks must evaluate robustness to pro se litigant inputs rather than expert-preprocessed ones to support access-to-justice claims.
Toxic HallucinAItions: Perturbing Prompts and Tracing LLM Circuits cs.CL · 2026-05-29 · unverdicted · none · ref 11
Toxic prompt perturbations reduce LLM factual accuracy on three benchmarks and selectively amplify perturbation-sensitive nodes in attribution graphs.
From GPT-3 to GPT-5: Mapping their capabilities, scope, limitations, and consequences cs.AI · 2026-04-11 · unverdicted · none · ref 20
The GPT family has shifted from scaled text predictors to aligned multimodal tool-oriented systems, with persistent limitations like hallucination and prompt sensitivity remaining unchanged.

Mind your tone: Investigating how prompt politeness affects llm accuracy (short paper)

fields

years

verdicts

representative citing papers

citing papers explorer