Y., Liu, C., Gao, H., Thongtanunam, P., and Treude, C.CodeReviewQA: The code review comprehension assessment for large language models

Lin, H · 2025

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Beyond Code Snippets: Benchmarking LLMs on Repository-Level Question Answering

cs.SE · 2026-03-27 · unverdicted · novelty 7.0

StackRepoQA shows LLMs reach only moderate accuracy on multi-file Java QA tasks, with gains from graph-based retrieval but frequent reliance on verbatim answer reproduction.

citing papers explorer

Showing 1 of 1 citing paper.

Beyond Code Snippets: Benchmarking LLMs on Repository-Level Question Answering cs.SE · 2026-03-27 · unverdicted · none · ref 31
StackRepoQA shows LLMs reach only moderate accuracy on multi-file Java QA tasks, with gains from graph-based retrieval but frequent reliance on verbatim answer reproduction.

Y., Liu, C., Gao, H., Thongtanunam, P., and Treude, C.CodeReviewQA: The code review comprehension assessment for large language models

fields

years

verdicts

representative citing papers

citing papers explorer