Title resolution pending

Amal Akli, Guillaume Haben, Sarra Habchi, Mike Papadakis, Yves Le Traon

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

browse 3 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Exploring Code Analysis: Zero-Shot Insights on Syntax and Semantics with LLMs

cs.SE · 2023-05-20 · unverdicted · novelty 7.0

LLMs achieve strong results on syntax parsing tasks but show limited and variable performance on dynamic reasoning, with a clear performance hierarchy across model scales.

NeuroFlake: A Neuro-Symbolic LLM Framework for Flaky Test Classification

cs.SE · 2026-05-12 · unverdicted · novelty 6.0

NeuroFlake integrates discriminative token mining into LLMs to classify flaky tests, raising F1-score to 69.34% on FlakeBench while showing greater robustness to semantic-preserving perturbations than prior methods.

Predicting Intermittent Job Failure Categories for Diagnosis Using Few-Shot Fine-Tuned Language Models

cs.SE · 2026-01-29 · unverdicted · novelty 5.0

FlaXifyer applies few-shot learning on pre-trained language models to categorize intermittent CI job failures from logs at 84.3% Macro F1 and 92.0% Top-2 accuracy using 12 examples per category, with LogSift reducing log review effort by 74.4%.

citing papers explorer

Showing 3 of 3 citing papers.

Exploring Code Analysis: Zero-Shot Insights on Syntax and Semantics with LLMs cs.SE · 2023-05-20 · unverdicted · none · ref 17
LLMs achieve strong results on syntax parsing tasks but show limited and variable performance on dynamic reasoning, with a clear performance hierarchy across model scales.
NeuroFlake: A Neuro-Symbolic LLM Framework for Flaky Test Classification cs.SE · 2026-05-12 · unverdicted · none · ref 1
NeuroFlake integrates discriminative token mining into LLMs to classify flaky tests, raising F1-score to 69.34% on FlakeBench while showing greater robustness to semantic-preserving perturbations than prior methods.
Predicting Intermittent Job Failure Categories for Diagnosis Using Few-Shot Fine-Tuned Language Models cs.SE · 2026-01-29 · unverdicted · none · ref 2
FlaXifyer applies few-shot learning on pre-trained language models to categorize intermittent CI job failures from logs at 84.3% Macro F1 and 92.0% Top-2 accuracy using 12 examples per category, with LogSift reducing log review effort by 74.4%.

Title resolution pending

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer