Adaptive layer-skipping in pre-trained llms.arXiv preprint arXiv:2503.23798,

Xuan Luo, Weizhi Wang, Xifeng Yan · arXiv 2503.23798

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

cs.CL · 2025-10-14 · unverdicted · novelty 6.0

Dr. LLM retrofits frozen LLMs with MCTS-supervised per-layer routers for skip/execute/repeat decisions, delivering up to +3.4% accuracy and 5-layer savings on reasoning tasks with strong out-of-domain generalization.

citing papers explorer

Showing 1 of 1 citing paper.

Dr.LLM: Dynamic Layer Routing in LLMs cs.CL · 2025-10-14 · unverdicted · none · ref 13
Dr. LLM retrofits frozen LLMs with MCTS-supervised per-layer routers for skip/execute/repeat decisions, delivering up to +3.4% accuracy and 5-layer savings on reasoning tasks with strong out-of-domain generalization.

Adaptive layer-skipping in pre-trained llms.arXiv preprint arXiv:2503.23798,

fields

years

verdicts

representative citing papers

citing papers explorer