A hitchhiker’s guide to statistical tests for assessing randomized algorithms in software engineering.Software Testing, Verification and Reliability, 24(3):219–250

Andrea Arcuri, Lionel Briand · 2014 · DOI 10.1002/stvr.1486

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

open at publisher browse 6 citing papers

citation-role summary

background 1 method 1

citation-polarity summary

background 1 use method 1

representative citing papers

The Alignment Problem in Constrained Code Generation

cs.SE · 2026-06-19 · unverdicted · novelty 7.0

Incomplete constrainers in constrained decoding push LLMs into low-probability program regions, making unconstrained decoding outperform constrained decoding on functional correctness across seven models and three benchmarks.

FIESTA: Fast IdEntification of State-of-The-Art models using adaptive bandit algorithms

cs.LG · 2019-06-28 · unverdicted · novelty 7.0

FIESTA uses bandit algorithms to adaptively decide how many seeds and splits to run for each candidate model, focusing effort on promising ones while providing guarantees on selecting the optimal model.

Rethinking Software Empirical Studies with Structural Causal Models

cs.SE · 2026-05-27 · unverdicted · novelty 6.0

CausalSE applies SCMs and propensity score matching to reveal that causal analysis of prompt engineering on GPT-3 code generation often finds no significant effect where associational analysis suggests improvement.

Robust Mutation Analysis of Quantum Programs Under Noise

cs.SE · 2026-05-13 · conditional · novelty 6.0

Noise from quantum hardware simulators significantly alters mutant detection distances, making equivalent mutants harder to separate from faults, with output-distribution metrics reaching 73.03% accuracy and 74.89% F1-score under device-specific thresholds.

MR-Scout: Automated Synthesis of Metamorphic Relations from Existing Test Cases

cs.SE · 2023-04-15 · unverdicted · novelty 6.0

MR-Scout extracts over 11,000 metamorphic-relation-encoded test cases from 701 OSS projects, codifies 97% of them as high-quality generators, and shows they raise line coverage by 13.52% and mutation score by 9.42% on programs that already have developer tests.

Search-Based Software Engineering and AI Foundation Models: Current Landscape and Future Roadmap

cs.SE · 2025-05-26 · unverdicted · novelty 4.0

A research roadmap analyzing the current state of search-based software engineering with foundation models, outlining challenges and directions across three integration aspects.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Search-Based Software Engineering and AI Foundation Models: Current Landscape and Future Roadmap cs.SE · 2025-05-26 · unverdicted · none · ref 10
A research roadmap analyzing the current state of search-based software engineering with foundation models, outlining challenges and directions across three integration aspects.

A hitchhiker’s guide to statistical tests for assessing randomized algorithms in software engineering.Software Testing, Verification and Reliability, 24(3):219–250

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer