Title resolution pending

· 1991

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Adaptive Test-Time Compute Allocation for Reasoning LLMs via Constrained Policy Optimization

cs.LG · 2026-04-16 · unverdicted · novelty 6.0

A Lagrangian-relaxation plus imitation-learning pipeline adaptively allocates test-time compute to LLMs, outperforming uniform baselines by up to 12.8% relative accuracy on MATH while staying within a fixed average budget.

ORTHOBO: Orthogonal Bayesian Hyperparameter Optimization

cs.LG · 2026-05-07 · unverdicted · novelty 5.0

OrthoBO introduces an orthogonal acquisition estimator subtracting an optimally weighted score-function control variate to reduce Monte Carlo variance, preserve the acquisition target, and improve ranking stability in Bayesian hyperparameter optimization.

citing papers explorer

Showing 2 of 2 citing papers.

Adaptive Test-Time Compute Allocation for Reasoning LLMs via Constrained Policy Optimization cs.LG · 2026-04-16 · unverdicted · none · ref 27
A Lagrangian-relaxation plus imitation-learning pipeline adaptively allocates test-time compute to LLMs, outperforming uniform baselines by up to 12.8% relative accuracy on MATH while staying within a fixed average budget.
ORTHOBO: Orthogonal Bayesian Hyperparameter Optimization cs.LG · 2026-05-07 · unverdicted · none · ref 32
OrthoBO introduces an orthogonal acquisition estimator subtracting an optimally weighted score-function control variate to reduce Monte Carlo variance, preserve the acquisition target, and improve ranking stability in Bayesian hyperparameter optimization.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer