Rethinking Code Complexity Through the Lens of Large Language Models

Beijun Shen; Chen Xie; Xiaodong Gu; Yuling Shi

arxiv: 2602.07882 · v2 · pith:LBTS4EPEnew · submitted 2026-02-08 · 💻 cs.SE

Rethinking Code Complexity Through the Lens of Large Language Models

Chen Xie , Xiaodong Gu , Yuling Shi , Beijun Shen This is my paper

classification 💻 cs.SE

keywords complexitycodelm-ccllmsdifficultymetricsperformancecoding

0 comments

read the original abstract

Code complexity metrics such as cyclomatic complexity have long been used to assess software quality and maintainability. With the rapid advancement of large language models (LLMs) on coding tasks, an important yet underexplored question arises: do traditional complexity metrics meaningfully characterize the coding difficulty that LLMs perceive? In this work, we empirically demonstrate that classical complexity metrics exhibit no consistent correlation with LLM performance, revealing a fundamental mismatch with model-perceived difficulty. To address this gap, we propose LM-CC, a novel code complexity metric tailored for LLMs, grounded in the hypothesis that model-perceived code difficulty is fundamentally driven by semantic nonlinearity. LM-CC quantifies complexity through an entropy-guided semantic compositional hierarchy, capturing the cumulative uncertainty encountered by LLMs during code understanding. Our experimental results demonstrate that LM-CC exhibits strong and consistent partial correlations with LLM performance, while semantics-preserving reductions in LM-CC consistently lead to improved downstream task performance. The source code is available at: https://github.com/xchen121/lm-cc.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Zero-Shot Vulnerability Detection in Low-Resource Smart Contracts Through Solidity-Only Training
cs.CR 2026-03 unverdicted novelty 5.0

Sol2Vy transfers vulnerability detection from Solidity to Vyper in zero-shot fashion, outperforming prior methods on reentrancy, weak randomness, and unchecked transfers.