PrecisionDiff is a differential testing framework that uncovers widespread precision-induced behavioral disagreements in aligned LLMs, including safety-critical jailbreak divergences across precision formats.
Your fix is my exploit: Enabling comprehensive dl library api fuzzing with large language models
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
citation-role summary
method 1
citation-polarity summary
years
2026 2verdicts
UNVERDICTED 2roles
method 1polarities
use method 1representative citing papers
ParityFuzz finds 64 new inconsistencies across six Solidity compilers by combining fine-grained mutation rules with reinforcement learning for differential testing.
citing papers explorer
-
Hidden Reliability Risks in Large Language Models: Systematic Identification of Precision-Induced Output Disagreements
PrecisionDiff is a differential testing framework that uncovers widespread precision-induced behavioral disagreements in aligned LLMs, including safety-critical jailbreak divergences across precision formats.
-
ParityFuzz: Finding Inconsistencies across Solidity Compilers via Fine-Grained Mutation and Differential Analysis
ParityFuzz finds 64 new inconsistencies across six Solidity compilers by combining fine-grained mutation rules with reinforcement learning for differential testing.