main . MyClass . func

** I M P O R T A N T **: Always use fully q u a l i f i e d names with the module prefix

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

An Execution-Verified Multi-Language Benchmark for Code Semantic Reasoning

cs.SE · 2026-05-10 · unverdicted · novelty 8.0

TraceEval is the first execution-verified multi-language benchmark for recovering runtime call structures from source code, containing 10,583 programs and showing top LLMs reach 72.9% average F1.

citing papers explorer

Showing 1 of 1 citing paper.

An Execution-Verified Multi-Language Benchmark for Code Semantic Reasoning cs.SE · 2026-05-10 · unverdicted · none · ref 7
TraceEval is the first execution-verified multi-language benchmark for recovering runtime call structures from source code, containing 10,583 programs and showing top LLMs reach 72.9% average F1.

main . MyClass . func

fields

years

verdicts

representative citing papers

citing papers explorer