BadScientist: Can a Research Agent Write Convincing but Unsound Papers that Fool LLM Reviewers?

Basel Alomair; Fengqing Jiang; Luyao Niu; Radha Poovendran; Yichen Feng; Yuetai Li

arxiv: 2510.18003 · v2 · pith:5P4ZJZAEnew · submitted 2025-10-20 · 💻 cs.CR · cs.AI· cs.CY

BadScientist: Can a Research Agent Write Convincing but Unsound Papers that Fool LLM Reviewers?

Fengqing Jiang , Yichen Feng , Yuetai Li , Luyao Niu , Basel Alomair , Radha Poovendran This is my paper

classification 💻 cs.CR cs.AIcs.CY

keywords researchreviewreviewerssystemsbadscientistframeworkintegrityreal

0 comments

read the original abstract

The convergence of LLM-powered research assistants and AI-based peer review systems creates a critical vulnerability: fully automated publication loops where AI-generated research is evaluated by AI reviewers without human oversight. We investigate this through \textbf{BadScientist}, a framework that evaluates whether fabrication-oriented paper generation agents can deceive multi-model LLM review systems. Our generator employs presentation-manipulation strategies requiring no real experiments. We develop a rigorous evaluation framework with formal error guarantees (concentration bounds and calibration analysis), calibrated on real data. Our results reveal systematic vulnerabilities: fabricated papers achieve acceptance rates up to . Critically, we identify \textit{concern-acceptance conflict} -- reviewers frequently flag integrity issues yet assign acceptance-level scores. Our mitigation strategies show only marginal improvements, with detection accuracy barely exceeding random chance. Despite provably sound aggregation mathematics, integrity checking systematically fails, exposing fundamental limitations in current AI-driven review systems and underscoring the urgent need for defense-in-depth safeguards in scientific publishing.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

No Hidden Prompts Needed! You Can Game AI Peer Review with Presentation-Only Revisions
cs.CL 2026-06 conditional novelty 7.0

Presentation-only revisions guided by AI feedback can boost AI reviewer scores by over 1 point on average with 75% success rate across tested systems.
The Red Queen G\"odel Machine: Co-Evolving Agents and Their Evaluators
cs.LG 2026-06 unverdicted novelty 6.0

The Red Queen Gödel Machine organizes recursive self-improvement into epochs with fixed intra-epoch evaluation while allowing utility evolution at boundaries, yielding reported gains on coding, paper writing, and proo...
Agon: An Autonomous Large-Scale Omnidisciplinary Research System Built on Prompt Economy
cs.SE 2026-06 unverdicted novelty 5.0

Agon is a new autonomous research system using prompt economy loops across 444 iterations to demonstrate scalable omnidisciplinary research and a taxonomy separating machine-fixable failures from those needing human judgment.