arXiv preprint arXiv:2504.00461 , year=

Efficient near-optimal algorithm for online shortest paths in directed acyclic graphs with bandit feedback against adaptive adversaries , author= · 2025 · arXiv 2504.00461

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

representative citing papers

On the Power of Adaptivity for $\varepsilon$-Best Arm Identification in Linear Bandits

cs.LG · 2026-05-15 · unverdicted · novelty 7.0

Adaptivity in linear bandits for ε-best arm identification gives only logarithmic improvements on hypercube, ℓ2 ball, m-sets and multi-task settings but polynomial-factor gains on a specially constructed action set, enabled by an adaptive O(d log(1/δ)/ε²) ℓ2-norm estimator.

Differential Privacy in the Extensive-Form Bandit Problem

cs.CR · 2026-05-06 · unverdicted · novelty 7.0

An algorithm achieves Õ(√(A ln(S) T)/ε) regret for extensive-form bandits under ε-local differential privacy, claimed as the first such result.

citing papers explorer

Showing 2 of 2 citing papers.

On the Power of Adaptivity for $\varepsilon$-Best Arm Identification in Linear Bandits cs.LG · 2026-05-15 · unverdicted · none · ref 4
Adaptivity in linear bandits for ε-best arm identification gives only logarithmic improvements on hypercube, ℓ2 ball, m-sets and multi-task settings but polynomial-factor gains on a specially constructed action set, enabled by an adaptive O(d log(1/δ)/ε²) ℓ2-norm estimator.
Differential Privacy in the Extensive-Form Bandit Problem cs.CR · 2026-05-06 · unverdicted · none · ref 13
An algorithm achieves Õ(√(A ln(S) T)/ε) regret for extensive-form bandits under ε-local differential privacy, claimed as the first such result.

arXiv preprint arXiv:2504.00461 , year=

fields

years

verdicts

representative citing papers

citing papers explorer