Evo-L2S uses multi-objective evolutionary model merging to produce reasoning models that cut generated chain-of-thought length by over 50% while preserving or improving accuracy on math benchmarks.
From system 1 to system 2: A survey of reasoning large language models.IEEE Trans
3 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 3verdicts
UNVERDICTED 3roles
background 1polarities
background 1representative citing papers
VPG-EA applies variational posterior guidance and efficiency-aware distillation to compress LLM reasoning chains while preserving performance.
WebUncertainty improves web agent performance on benchmarks by adaptively selecting planning modes based on task uncertainty and using confidence-induced action uncertainty in MCTS to quantify aleatoric and epistemic uncertainty for better decisions.
citing papers explorer
-
Multi-objective Evolutionary Merging Enables Efficient Reasoning Models
Evo-L2S uses multi-objective evolutionary model merging to produce reasoning models that cut generated chain-of-thought length by over 50% while preserving or improving accuracy on math benchmarks.
-
Efficient LLM Reasoning via Variational Posterior Guidance with Efficiency Awareness
VPG-EA applies variational posterior guidance and efficiency-aware distillation to compress LLM reasoning chains while preserving performance.
-
WebUncertainty: Dual-Level Uncertainty Driven Planning and Reasoning For Autonomous Web Agent
WebUncertainty improves web agent performance on benchmarks by adaptively selecting planning modes based on task uncertainty and using confidence-induced action uncertainty in MCTS to quantify aleatoric and epistemic uncertainty for better decisions.