LLMs match or exceed state-of-the-art traditional methods for stabilizing numerical expressions in scientific software, succeeding on 97.9% of expressions where baselines fail to improve accuracy, but struggle with control flow and high-precision literals.
Title resolution pending
2 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
fields
cs.SE 2years
2026 2roles
baseline 1polarities
baseline 1representative citing papers
TARIPlay detects and tracks viable interactive areas in AR playback videos using stability and visibility criteria, achieving 55.8% branch coverage on AR-related code versus 41.98% for Monkey across four apps and nine videos.
citing papers explorer
-
Assessing Large Language Models for Stabilizing Numerical Expressions in Scientific Software
LLMs match or exceed state-of-the-art traditional methods for stabilizing numerical expressions in scientific software, succeeding on 97.9% of expressions where baselines fail to improve accuracy, but struggle with control flow and high-precision literals.
-
TARIPlay: A Test Framework for AR Applications based on Interactive Area Tracking in Playback Videos
TARIPlay detects and tracks viable interactive areas in AR playback videos using stability and visibility criteria, achieving 55.8% branch coverage on AR-related code versus 41.98% for Monkey across four apps and nine videos.