Title resolution pending

Mary L McHugh · 2012

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

browse 6 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

citation-role summary

method 2

citation-polarity summary

use method 2

representative citing papers

FLARE: Agentic Coverage-Guided Fuzzing for LLM-Based Multi-Agent Systems

cs.SE · 2026-04-07 · unverdicted · novelty 7.0

FLARE extracts specifications from multi-agent LLM code and applies coverage-guided fuzzing to achieve 96.9% inter-agent and 91.1% intra-agent coverage while uncovering 56 new failures across 16 applications.

Characterizing and Mitigating False-Positive Bug Reports in the Linux Kernel

cs.SE · 2026-05-08 · conditional · novelty 6.0

False-positive bug reports in the Linux kernel consume effort comparable to real bugs and can be filtered by LLMs using retrieval-augmented generation at 88% F1.

ComPASS: Towards Personalized Agentic Social Support via Tool-Augmented Companionship

cs.CL · 2026-04-20 · unverdicted · novelty 6.0

ComPASS creates tool-augmented LLM agents for substantive social support, releases the first personalized benchmark ComPASS-Bench, and fine-tunes ComPASS-Qwen to outperform its base model while matching larger LLMs.

PrivacyAkinator: Articulating Key Privacy Design Decisions by Answering LLM-Generated Multiple-choice Questions

cs.HC · 2026-04-08 · unverdicted · novelty 6.0

PrivacyAkinator uses LLM-generated questions grounded in data-flow representations and a news-mined design space to help developers surface privacy decisions, yielding 47% more decisions identified in 73% less time than PRAM in a 24-person study.

Characterizing Faults in Agentic AI: A Taxonomy of Types, Symptoms, and Root Causes

cs.SE · 2026-03-06 · unverdicted · novelty 6.0

An empirical study of real-world issues yields a taxonomy of 34 fault types, symptoms, and root causes in agentic AI systems, validated by 145 practitioners.

Automated Classification of Human Code Review Comments with Large Language Models

cs.SE · 2026-04-26 · unverdicted · novelty 4.0

LLMs reach moderate macro-F1 scores of 0.36-0.37 when classifying code review comments into six smells and three useful intents, with one-shot examples helping some models on intent labels.

citing papers explorer

Showing 6 of 6 citing papers.

FLARE: Agentic Coverage-Guided Fuzzing for LLM-Based Multi-Agent Systems cs.SE · 2026-04-07 · unverdicted · none · ref 31
FLARE extracts specifications from multi-agent LLM code and applies coverage-guided fuzzing to achieve 96.9% inter-agent and 91.1% intra-agent coverage while uncovering 56 new failures across 16 applications.
Characterizing and Mitigating False-Positive Bug Reports in the Linux Kernel cs.SE · 2026-05-08 · conditional · none · ref 36
False-positive bug reports in the Linux kernel consume effort comparable to real bugs and can be filtered by LLMs using retrieval-augmented generation at 88% F1.
ComPASS: Towards Personalized Agentic Social Support via Tool-Augmented Companionship cs.CL · 2026-04-20 · unverdicted · none · ref 18
ComPASS creates tool-augmented LLM agents for substantive social support, releases the first personalized benchmark ComPASS-Bench, and fine-tunes ComPASS-Qwen to outperform its base model while matching larger LLMs.
PrivacyAkinator: Articulating Key Privacy Design Decisions by Answering LLM-Generated Multiple-choice Questions cs.HC · 2026-04-08 · unverdicted · none · ref 71
PrivacyAkinator uses LLM-generated questions grounded in data-flow representations and a news-mined design space to help developers surface privacy decisions, yielding 47% more decisions identified in 73% less time than PRAM in a 24-person study.
Characterizing Faults in Agentic AI: A Taxonomy of Types, Symptoms, and Root Causes cs.SE · 2026-03-06 · unverdicted · none · ref 27
An empirical study of real-world issues yields a taxonomy of 34 fault types, symptoms, and root causes in agentic AI systems, validated by 145 practitioners.
Automated Classification of Human Code Review Comments with Large Language Models cs.SE · 2026-04-26 · unverdicted · none · ref 22
LLMs reach moderate macro-F1 scores of 0.36-0.37 when classifying code review comments into six smells and three useful intents, with one-shot examples helping some models on intent labels.

Title resolution pending

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer