pith. sign in

Evaluating large language models trained on code,

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

fields

cs.CR 2 cs.SE 2

verdicts

UNVERDICTED 4

representative citing papers

CodeMind: Evaluating Large Language Models for Code Reasoning

cs.SE · 2024-02-15 · unverdicted · novelty 7.0

CodeMind evaluates ten LLMs on four benchmarks using three new code reasoning tasks, finding performance varies by model size and drops with complexity while showing no correlation with bug repair ability.

citing papers explorer

Showing 4 of 4 citing papers.