Mixed citations

A Stochastic Approximation Method,

doi: 10 · 1951 · arXiv aoms/1177729

Mixed citation behavior. Most common role is background (62%).

94 Pith papers citing it

Background 62% of classified citations

read on arXiv browse 94 citing papers

citation-role summary

background 10 method 3

citation-polarity summary

background 8 use method 3 unclear 2

representative citing papers

Random Reshuffling Dominates Stochastic Gradient Descent

math.OC · 2026-06-30 · unverdicted · novelty 8.0

RR dominates SGD in smooth convex optimization under any reasonable stepsize after any finite number of epochs.

Optimal Deterministic Multicalibration and Omniprediction

cs.LG · 2026-06-18 · unverdicted · novelty 8.0

Presents a deterministic minimax-optimal multicalibration algorithm and its generalization to outcome indistinguishability and omniprediction, resolving open questions on randomization necessity.

The Geometric Wall: Manifold Structure Predicts Layerwise Sparse Autoencoder Scaling Laws

cs.LG · 2026-05-11 · unverdicted · novelty 8.0

Manifold curvature and intrinsic dimension predict layerwise SAE width exponents and asymptotic floors across Gemma models, with cross-model transfer of the geometric regression, establishing a transferable geometric law instead of a universal scaling law.

Steered LLM Activations are Non-Surjective

cs.AI · 2026-04-10 · unverdicted · novelty 8.0 · 2 refs

Steered LLM activations are non-surjective: under practical assumptions, they lie outside the set of states reachable from any discrete prompt.

Quantum-optimal coronagraphy with spatial mode sorting for direct exoplanet observations

astro-ph.IM · 2026-07-02 · unverdicted · novelty 7.0

The paper derives quantum-optimal spatial modes for mode-sorting coronagraphy that account for finite star size and complex apertures, improving detection performance at close working angles.

Fast Computation of Free-Support Wasserstein Medians

stat.CO · 2026-06-17 · unverdicted · novelty 7.0

Direct fixed-weight solver for free-support Wasserstein medians relocates atoms using OT barycentric projections and inverse-distance weights, achieving monotone descent on smoothed objectives with fewer subproblems than nested Weiszfeld baselines.

Expected Free Energy-based Planning as Variational Inference

cs.AI · 2026-06-09 · unverdicted · novelty 7.0

EFE-based planning is formulated as variational free energy minimization with epistemic priors, decomposing into expected plan costs plus a complexity term.

Arbitrage-free Data Pricing

cs.GT · 2026-06-09 · unverdicted · novelty 7.0

The paper shows that arbitrage-free information pricing is computationally hard in general, provides a branch-and-bound algorithm, and proves that for threshold utilities arbitrage-freeness reduces to Blackwell dominance, unifying prior query and model pricing results.

What Type of Inference is Active Inference?

cs.AI · 2026-06-03 · unverdicted · novelty 7.0

EFE-based active inference planning is characterized as VFE on an augmented model plus entropy and planning corrections, with a derived message-passing implementation and grid-world validation.

Experimental Collapse in Virophysics: Protocol-Resolved Observation, Inference, and Plaque-Assay Blindness

physics.bio-ph · 2026-05-27 · unverdicted · novelty 7.0

The paper introduces a protocol-resolved framework for virological measurements, defining an observation operator that maps latent ensembles to observed data and recasting plaque assays as estimates of protocol-conditioned infectious concentration.

Scale-Calibrated Median-of-Means for Robust Distributed Principal Component Analysis

stat.ME · 2026-05-20 · unverdicted · novelty 7.0

Proposes a scale-calibrated median-of-means estimator for robust aggregation of distributed PCA estimates on the product of Euclidean space and Grassmann manifold.

The Spatial Cram'{e}r--von Mises Test of Independence under $\beta$-Mixing: Asymptotic Theory and Python Implementation

stat.ME · 2026-05-18 · unverdicted · novelty 7.0

Derives the asymptotic distribution of the spatial Cramér-von Mises independence statistic under β-mixing on R² and implements it in Python with eigenvalue-based critical values.

Quantum enhanced identification of boosted jets with quantum graph neural networks

hep-ph · 2026-05-18 · unverdicted · novelty 7.0

A 10-qubit convolutional quantum graph neural network fed by autoencoder-compressed jet data achieves performance comparable to classical graph networks in distinguishing boosted Z jets from gluon jets.

Generative reconstruction of 2D and 3D polycrystalline microstructures using symmetrized hyperspherical harmonics

cond-mat.mtrl-sci · 2026-05-14 · unverdicted · novelty 7.0

A new differentiable reconstruction method uses symmetrized hyperspherical harmonics on quaternions plus two- and three-point descriptors to generate 3D microstructures from 2D data, demonstrated on aluminum alloy with L-BFGS-B optimization.

Characterizing the Generalization Error of Random Feature Regression with Arbitrary Data-Augmentation

stat.ML · 2026-05-11 · conditional · novelty 7.0

The test error of random-feature ridge regression with arbitrary data augmentation admits a closed-form asymptotic characterization in the proportional regime that depends only on population covariances and augmentation statistics.

Entropic Reciprocity in Time-Reversed Young Interferometry

quant-ph · 2026-05-01 · unverdicted · novelty 7.0

Time-reversed Young interferometry acts as a source-space information processor where mutual information is the reciprocal invariant and source-label entropy can decrease near destructive interference while Fisher information rises.

Fast and Exact: Asymptotically Linear KL-Optimal Frequency Normalization

cs.IT · 2026-05-01 · unverdicted · novelty 7.0

Three new provably KL-optimal frequency normalization algorithms are presented, one running in linear time in the number of symbols.

Profile Likelihood Inference for Anisotropic Hyperbolic Wrapped Normal Models on Hyperbolic Space

math.ST · 2026-05-01 · unverdicted · novelty 7.0

The profile maximum likelihood estimator for the location in anisotropic hyperbolic wrapped normal models is strongly consistent, asymptotically normal, and attains the Hájek-Le Cam minimax lower bound under squared geodesic loss.

Complexity Guarantees for Zeroth-order Methods via Exponentially-shifted Gaussian Smoothing: Mitigating Dimension-dependence and Incorporating Decision-dependence

math.OC · 2026-04-16 · unverdicted · novelty 7.0

Exponentially-shifted Gaussian smoothing yields zeroth-order gradient estimators with linear dimension dependence, enabling improved complexity bounds for stochastic optimization including decision-dependent regimes.

Reinforcement Learning via Value Gradient Flow

cs.LG · 2026-04-15 · unverdicted · novelty 7.0

VGF solves behavior-regularized RL by transporting particles from a reference distribution to the value-induced optimal policy via discrete value-guided gradient flow.

Stability of the Shannon--McMillan--Breiman Theorem under Sublinear Parsings

cs.IT · 2026-04-15 · unverdicted · novelty 7.0

The normalized sum of negative log-likelihoods under sublinear parsings converges almost surely and in L1 to the entropy rate h_P for any shift-invariant measure on a finite shift space.

Obtaining Partition Crossover masks using Statistical Linkage Learning for solving noised optimization problems with hidden variable dependency structure

stat.ML · 2026-04-13 · unverdicted · novelty 7.0

Statistical Linkage Learning enables a new mask construction algorithm for Partition Crossover that maintains effectiveness on noisy problems with hidden dependencies and matches noise-free performance when decomposition quality is high.

Many-Tier Instruction Hierarchy in LLM Agents

cs.CL · 2026-04-10 · unverdicted · novelty 7.0

ManyIH and ManyIH-Bench address instruction conflicts in LLM agents with up to 12 privilege levels across 853 tasks, revealing frontier models achieve only ~40% accuracy.

Causal Multi-Task Demand Learning

cs.LG · 2026-02-10 · unverdicted · novelty 7.0

A meta-learning method identifies the conditional mean of task-specific causal demand parameters by conditioning on all prices while masking two demand outcomes, assuming at least two locally exogenous prices per task.

citing papers explorer

Showing 6 of 6 citing papers after filters.

Steered LLM Activations are Non-Surjective cs.AI · 2026-04-10 · unverdicted · none · ref 4 · 2 links
Steered LLM activations are non-surjective: under practical assumptions, they lie outside the set of states reachable from any discrete prompt.
Expected Free Energy-based Planning as Variational Inference cs.AI · 2026-06-09 · unverdicted · none · ref 166
EFE-based planning is formulated as variational free energy minimization with epistemic priors, decomposing into expected plan costs plus a complexity term.
What Type of Inference is Active Inference? cs.AI · 2026-06-03 · unverdicted · none · ref 181
EFE-based active inference planning is characterized as VFE on an augmented model plus entropy and planning corrections, with a derived message-passing implementation and grid-world validation.
When Outcome Looks Right But Discipline Fails: Trace-Based Evaluation Under Hidden Competitor State cs.AI · 2026-05-18 · unverdicted · none · ref 7
The paper introduces discipline stability, a trace-based evaluation paradigm for checking if RL agents maintain behavioral discipline like rule-based competitors in hidden-state competitive settings such as hotel pricing and bidding.
Scaling Laws for Task-Specific LLM Distillation cs.AI · 2026-06-23 · unverdicted · none · ref 39
Empirical scaling laws for task-specific LLM distillation in quantitative finance indicate that chain-of-thought supervision recovers general knowledge lost during iterative pruning while in-domain performance degrades predictably.
PRISMat: Policy-Driven, Permutation-Invariant Autoregressive Material Generation cs.AI · 2026-05-15 · unverdicted · none · ref 23
PRISMat generates crystal slabs with mean absolute errors of 0.188 eV/A² for cleavage energy and 2.79 eV for work function, reducing error by 4× versus the next best model while using less inference time.

A Stochastic Approximation Method,

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer