Probability inequalities for sums of bounded random variables.Journal of the American statistical association, 58(301):13–30

Wassily Hoeffding · 1963

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

browse 5 citing papers

representative citing papers

TabQL: In-Context Q-Learning with Tabular Foundation Models

cs.LG · 2026-05-18 · unverdicted · novelty 7.0

TabQL is a reinforcement learning framework that substitutes a tabular foundation model with in-context capabilities for the parametric Q-network in DQN, with a warm-up phase and theoretical analysis claiming improved sample efficiency.

Data Reuse and the Long Shadow of Error: Splitting, Subsampling, and Prospectively Managing Inferential Errors

math.ST · 2026-04-08 · unverdicted · novelty 7.0

Subsampling techniques enable uncoordinated data reuse while keeping the expected variance of Type I error counts close to the independent case, outperforming data splitting in scaling with the number of tests.

LiBaGS: Lightweight Boundary Gap Synthesis for Targeted Synthetic Data Selection

cs.LG · 2026-05-11 · unverdicted · novelty 6.0 · 2 refs

LiBaGS scores and selects synthetic data near decision boundaries using proximity, uncertainty, density, and validity, with boundary-gap allocation and marginal stopping to improve training accuracy.

Quantum embedding of graphs for subgraph counting

quant-ph · 2026-04-20 · unverdicted · novelty 5.0

A quantum adjacency state on 2 log N qubits plus ancilla enables subgraph count estimation via m-fold tensor product measurements, producing quantum logspace algorithms for motif counting.

Value Mirror Descent for Reinforcement Learning

math.OC · 2026-04-07 · unverdicted · novelty 5.0

Value mirror descent integrates mirror descent into value iteration for discounted MDPs, delivering near-optimal sample complexity of order |S||A|(1-γ)^{-3}ε^{-2} for general convex regularizers and bounded Bregman divergence between generated and optimal policies.

citing papers explorer

Showing 5 of 5 citing papers.

TabQL: In-Context Q-Learning with Tabular Foundation Models cs.LG · 2026-05-18 · unverdicted · none · ref 45
TabQL is a reinforcement learning framework that substitutes a tabular foundation model with in-context capabilities for the parametric Q-network in DQN, with a warm-up phase and theoretical analysis claiming improved sample efficiency.
Data Reuse and the Long Shadow of Error: Splitting, Subsampling, and Prospectively Managing Inferential Errors math.ST · 2026-04-08 · unverdicted · none · ref 21
Subsampling techniques enable uncoordinated data reuse while keeping the expected variance of Type I error counts close to the independent case, outperforming data splitting in scaling with the number of tests.
LiBaGS: Lightweight Boundary Gap Synthesis for Targeted Synthetic Data Selection cs.LG · 2026-05-11 · unverdicted · none · ref 20 · 2 links
LiBaGS scores and selects synthetic data near decision boundaries using proximity, uncertainty, density, and validity, with boundary-gap allocation and marginal stopping to improve training accuracy.
Quantum embedding of graphs for subgraph counting quant-ph · 2026-04-20 · unverdicted · none · ref 21
A quantum adjacency state on 2 log N qubits plus ancilla enables subgraph count estimation via m-fold tensor product measurements, producing quantum logspace algorithms for motif counting.
Value Mirror Descent for Reinforcement Learning math.OC · 2026-04-07 · unverdicted · none · ref 8
Value mirror descent integrates mirror descent into value iteration for discounted MDPs, delivering near-optimal sample complexity of order |S||A|(1-γ)^{-3}ε^{-2} for general convex regularizers and bounded Bregman divergence between generated and optimal policies.

Probability inequalities for sums of bounded random variables.Journal of the American statistical association, 58(301):13–30

fields

years

verdicts

representative citing papers

citing papers explorer