likely−invalid

Generate diverse, challenging edge−case candidates

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

VeriScale: Adversarial Test-Suite Scaling for Verifiable Code Generation

cs.LG · 2026-05-21 · unverdicted · novelty 6.0

VeriScale adversarially scales test suites for the Verina benchmark into VerinaPlus (83x larger) and VerinaLite (14x variant) that expose hidden LLM weaknesses on SpecGen and CodeGen tasks.

citing papers explorer

Showing 1 of 1 citing paper.

VeriScale: Adversarial Test-Suite Scaling for Verifiable Code Generation cs.LG · 2026-05-21 · unverdicted · none · ref 9
VeriScale adversarially scales test suites for the Verina benchmark into VerinaPlus (83x larger) and VerinaLite (14x variant) that expose hidden LLM weaknesses on SpecGen and CodeGen tasks.

likely−invalid

fields

years

verdicts

representative citing papers

citing papers explorer