Large Language Models are Few-Shot Summarizers: Multi-Intent Comment Generation via In-Context Learning , url=

Lianglu Pan, Shaanan Cohney, Toby Murray, Van-Thuan Pham · 2024

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

browse 3 citing papers

representative citing papers

Evaluating Non-English Developer Support in Machine Learning for Software Engineering

cs.SE · 2026-05-07 · unverdicted · novelty 7.0

Code LLMs generate substantially worse comments outside English, and no tested automatic metric or LLM judge reliably matches human assessment of those outputs.

POSTCONDBENCH: Benchmarking Correctness and Completeness in Formal Postcondition Inference

cs.SE · 2026-05-05 · unverdicted · novelty 7.0

POSTCONDBENCH is a new multilingual benchmark that evaluates LLM postcondition generation on real code using defect discrimination to assess completeness beyond surface matching.

Beyond Code Reasoning: Specification-Anchored Auditing of Multi-Implementation Distributed Protocols

cs.CR · 2026-04-29 · conditional · novelty 7.0

SPECA derives categorized security properties from specifications to enable cross-implementation auditing of distributed protocols, recovering all 15 expert-augmented vulnerabilities on an Ethereum contest and achieving 88.9% precision at 100% recall on a C/C++ benchmark.

citing papers explorer

Showing 3 of 3 citing papers.

Evaluating Non-English Developer Support in Machine Learning for Software Engineering cs.SE · 2026-05-07 · unverdicted · none · ref 47
Code LLMs generate substantially worse comments outside English, and no tested automatic metric or LLM judge reliably matches human assessment of those outputs.
POSTCONDBENCH: Benchmarking Correctness and Completeness in Formal Postcondition Inference cs.SE · 2026-05-05 · unverdicted · none · ref 89
POSTCONDBENCH is a new multilingual benchmark that evaluates LLM postcondition generation on real code using defect discrimination to assess completeness beyond surface matching.
Beyond Code Reasoning: Specification-Anchored Auditing of Multi-Implementation Distributed Protocols cs.CR · 2026-04-29 · conditional · none · ref 18
SPECA derives categorized security properties from specifications to enable cross-implementation auditing of distributed protocols, recovering all 15 expert-augmented vulnerabilities on an Ethereum contest and achieving 88.9% precision at 100% recall on a C/C++ benchmark.

Large Language Models are Few-Shot Summarizers: Multi-Intent Comment Generation via In-Context Learning , url=

fields

years

verdicts

representative citing papers

citing papers explorer