Let ' s Sample Step by Step: Adaptive-Consistency for Efficient Reasoning and Coding with LLM s

Aggarwal, Pranjal, Madaan, Aman, Yang, Yiming, Mausam · 2023 · DOI 10.18653/v1/2023.emnlp-main.761

9 Pith papers cite this work. Polarity classification is still indexing.

9 Pith papers citing it

open at publisher browse 9 citing papers

citation-role summary

baseline 1

citation-polarity summary

baseline 1

representative citing papers

Fork-Think with Confidence

cs.LG · 2026-06-30 · unverdicted · novelty 7.0

Fork-think with confidence identifies forking points via model confidence in a single path before sampling continuations, cutting tokens up to 30% and runtime up to 57% on reasoning benchmarks while matching or exceeding parallel thinking performance.

DART: Draft-Agreement Routing for Training-Free Adaptive Thinking Budgets in Hybrid Reasoning Models

cs.AI · 2026-06-22 · unverdicted · novelty 7.0

DART is a training-free router that accepts direct answers on draft agreement and allocates thinking budgets via draft entropy on disagreement, reporting accuracy gains and token reductions on math and code benchmarks across model scales.

Skill-CMIB: Multimodal Agent Skill for Consistent Action via Conditional Multimodal Information Bottleneck

cs.LG · 2026-05-08 · unverdicted · novelty 7.0

CMIB uses a conditional multimodal information bottleneck to create reusable agent skills that separate verbalizable text content from predictive perceptual residuals, improving execution stability.

Two Calls, Two Moments, and the Vote-Accuracy Curve of Repeated LLM Inference

cs.LG · 2026-05-05 · unverdicted · novelty 7.0 · 2 refs

Two calls per example identify the first two moments of latent correctness probability, enabling exact bounds on the vote-accuracy curve for any majority-vote budget under conditional i.i.d. assumptions.

L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning

cs.CL · 2025-03-06 · unverdicted · novelty 7.0

LCPO trains L1 reasoning models to adhere to prompt-specified CoT lengths, supporting accuracy-compute trade-offs and yielding short reasoning models that outperform larger baselines at matched lengths.

ZAS-SQL: Distilling Rules from Failures for Zero-Shot Text-to-SQL

cs.CL · 2026-06-06 · unverdicted · novelty 6.0

ZAS-SQL distills rules from zero-shot Text-to-SQL failures to reach 87.2-88.6% execution accuracy on Spider, new zero-shot SOTA surpassing some GPT-4 few-shot and fine-tuned baselines.

A Dynamic LLM-Powered Agent Network for Task-Oriented Agent Collaboration

cs.CL · 2023-10-03 · conditional · novelty 6.0

DyLAN automatically selects and dynamically organizes LLM agents for collaboration, outperforming fixed-agent baselines on code generation, reasoning, and decision tasks with up to 25% accuracy gains on some MMLU subjects.

Resource-Aware Neuro-Symbolic Reasoning for Local Small Language Models

cs.LO · 2026-06-25 · unverdicted · novelty 3.0

VFR-LLM combines small LLMs with symbolic verification and solving to reach 0.983 and 0.933 accuracy on precedence and logical deduction tasks using one model call versus lower results from self-consistency baselines.

Self-Consistency from Only Two Samples: CoT-PoT Ensembling for Efficient LLM Reasoning

cs.CL · 2026-04-19

citing papers explorer

Showing 7 of 7 citing papers after filters.

Fork-Think with Confidence cs.LG · 2026-06-30 · unverdicted · none · ref 96
Fork-think with confidence identifies forking points via model confidence in a single path before sampling continuations, cutting tokens up to 30% and runtime up to 57% on reasoning benchmarks while matching or exceeding parallel thinking performance.
DART: Draft-Agreement Routing for Training-Free Adaptive Thinking Budgets in Hybrid Reasoning Models cs.AI · 2026-06-22 · unverdicted · none · ref 39
DART is a training-free router that accepts direct answers on draft agreement and allocates thinking budgets via draft entropy on disagreement, reporting accuracy gains and token reductions on math and code benchmarks across model scales.
Skill-CMIB: Multimodal Agent Skill for Consistent Action via Conditional Multimodal Information Bottleneck cs.LG · 2026-05-08 · unverdicted · none · ref 6
CMIB uses a conditional multimodal information bottleneck to create reusable agent skills that separate verbalizable text content from predictive perceptual residuals, improving execution stability.
Two Calls, Two Moments, and the Vote-Accuracy Curve of Repeated LLM Inference cs.LG · 2026-05-05 · unverdicted · none · ref 1 · 2 links
Two calls per example identify the first two moments of latent correctness probability, enabling exact bounds on the vote-accuracy curve for any majority-vote budget under conditional i.i.d. assumptions.
L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning cs.CL · 2025-03-06 · unverdicted · none · ref 2
LCPO trains L1 reasoning models to adhere to prompt-specified CoT lengths, supporting accuracy-compute trade-offs and yielding short reasoning models that outperform larger baselines at matched lengths.
ZAS-SQL: Distilling Rules from Failures for Zero-Shot Text-to-SQL cs.CL · 2026-06-06 · unverdicted · none · ref 24
ZAS-SQL distills rules from zero-shot Text-to-SQL failures to reach 87.2-88.6% execution accuracy on Spider, new zero-shot SOTA surpassing some GPT-4 few-shot and fine-tuned baselines.
Resource-Aware Neuro-Symbolic Reasoning for Local Small Language Models cs.LO · 2026-06-25 · unverdicted · none · ref 8
VFR-LLM combines small LLMs with symbolic verification and solving to reach 0.983 and 0.933 accuracy on precedence and logical deduction tasks using one model call versus lower results from self-consistency baselines.

Let ' s Sample Step by Step: Adaptive-Consistency for Efficient Reasoning and Coding with LLM s

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer