hub Canonical reference

Sánchez, Pedro Delgado-Pérez, Inmaculada Medina-Bulo, and Sergio Segura

Luca Traini, Vittorio Cortellessa, Daniele Di Pompeo, Michele Tucci · 2022 · DOI 10.1007/s10664-

Canonical reference. 100% of citing Pith papers cite this work as background.

15 Pith papers citing it

Background 100% of classified citations

open at publisher browse 15 citing papers

hub tools

JSON dossier citing papers JSON publisher DOI

citation-role summary

background 5

citation-polarity summary

background 5

representative citing papers

Understanding Bugs in Template Engine-Based Applications: Symptoms, Root Causes, and Fix Patterns

cs.SE · 2026-04-30 · unverdicted · novelty 7.0

An empirical study of 1,004 bugs in template engine-based applications finds abnormal rendering results as the most common symptom (48.61%) and documents 17 root causes with fix patterns that often involve host-side logic changes.

Gleaner: A Semantically-Rich and Efficient Online Sampler for Microservice Diagnostics

cs.SE · 2026-04-18 · unverdicted · novelty 7.0

Gleaner replaces slow graph-based trace analysis with bag-of-edges set operations plus log semantics and alarm-driven diversity to deliver faster, higher-fidelity sampling that improves RCA accuracy even at 1% rates.

AgenticFlict: A Large-Scale Dataset of Merge Conflicts in AI Coding Agent Pull Requests on GitHub

cs.SE · 2026-04-04 · accept · novelty 7.0

AgenticFlict is a public dataset of 29K+ textual merge conflicts from AI agent PRs, collected via merge simulation on 107K processed PRs and showing a 27.67% conflict rate with variation across agents.

How AI Coding Agents Modify Code: A Large-Scale Study of GitHub Pull Requests

cs.SE · 2026-01-24 · unverdicted · novelty 7.0

AI coding agents produce pull requests with substantially more commits and slightly higher description-to-diff similarity than human developers, based on analysis of 29,095 merged PRs.

JunoBench: A Benchmark Dataset of Crashes in Python Machine Learning Jupyter Notebooks

cs.SE · 2025-10-20 · unverdicted · novelty 7.0

JunoBench is the first benchmark of 111 reproducible crashes in Python ML Jupyter notebooks from Kaggle, with verified fixes and rich annotations for bug research.

Do AI Models Dream of Faster Code? An Empirical Study on LLM-Proposed Performance Improvements in Real-World Software

cs.SE · 2025-10-17 · unverdicted · novelty 7.0

LLMs propose volatile performance improvements on real-world Java tasks that lag human developers on average, showing algorithmic benchmarks overestimate capabilities.

Efficiency for Experts, Visibility for Newcomers: A Case Study of Label-Code Alignment in Kubernetes

cs.SE · 2026-03-25 · unverdicted · novelty 6.0 · 2 refs

Case study of 18,020 Kubernetes PRs shows label-diff congruence is prevalent and stable, with higher congruence linked to fewer review participants among core developers and more among one-time contributors.

MutDafny: A Mutation-Based Approach to Assess Dafny Specifications

cs.SE · 2025-11-19 · conditional · novelty 6.0

MutDafny uses 40 mutation operators on 794 real-world Dafny programs to detect weak specifications, manually confirming five such cases at a rate of one per 241 lines.

Understanding the Challenges and Opportunities of Generative AI Apps: An Empirical Study

cs.SE · 2025-06-19 · unverdicted · novelty 6.0

Large-scale review mining of 1M+ comments from 171 Gen-AI apps using an LLM framework reveals top topics plus three opportunities and three challenges for developers.

Hidden Dependencies and Component Variants in SBOM-Based Software Composition Analysis

cs.SE · 2026-04-23 · unverdicted · novelty 5.0

Hidden dependencies and component variants in SBOMs cause inconsistent vulnerability reporting and VEX handling across scanners.

Reliability of AI Bots Footprints in GitHub Actions CI/CD Workflows

cs.SE · 2026-04-20 · unverdicted · novelty 5.0

Large-scale analysis of AI bot PRs shows Copilot and Codex achieve the highest CI/CD success rates but more frequent AI contributions correlate with reduced workflow reliability.

Misleading Microbenchmarks on the Java Virtual Machines

cs.PL · 2026-05-22 · unverdicted · novelty 4.0

Microbenchmarks on the JVM can produce misleading results due to unrealistic profiles collected during isolated execution despite following JMH guidelines.

StartFlow: From Method Conception to Multi-Perspective Evaluation in UX Prototyping for Software Startups

cs.HC · 2026-05-11 · conditional · novelty 4.0

StartFlow is a new structured method that helps startup teams without UX expertise produce clearer wireflow prototypes with fewer usability problems.

Human-Machine Co-Boosted Bug Report Identification with Mutualistic Neural Active Learning

cs.SE · 2026-04-20 · unverdicted · novelty 4.0

MNAL reduces human effort in bug report labeling by up to 95.8% for readability and 196% for identifiability while improving identification performance and working with various neural models.

Classport: Designing Runtime Dependency Introspection for Java

cs.SE · 2025-10-23

citing papers explorer

Showing 1 of 1 citing paper after filters.

Misleading Microbenchmarks on the Java Virtual Machines cs.PL · 2026-05-22 · unverdicted · none · ref 50
Microbenchmarks on the JVM can produce misleading results due to unrealistic profiles collected during isolated execution despite following JMH guidelines.

Sánchez, Pedro Delgado-Pérez, Inmaculada Medina-Bulo, and Sergio Segura

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer