Bernstein-type dimension-free concentration for self-normalised martingales
classification
🧮 math.PR
math.STstat.TH
keywords
bernstein-typedimension-freelogisticmartingalesself-normalisedadaptivelyapplicationsbandits
read the original abstract
We introduce a dimension-free Bernstein-type tail inequality for self-normalised martingales, where the normalisation uses the predictable quadratic variation and the radius depends on the information gain of the observed covariance. As applications, we provide ellipsoidal confidence sequences for logistic regression with adaptively chosen Hilbert-valued covariates, and give instance-adaptive regret bounds for Hilbert-armed logistic bandits.
This paper has not been read by Pith yet.
Forward citations
Cited by 1 Pith paper
-
Logistic Bandits with $\tilde{O}(\sqrt{dT})$ Regret without Context Diversity Assumptions
SupSplitLog achieves Õ(√(dT)) regret for logistic bandits without context diversity assumptions by splitting samples for an initial estimator and Newton correction, and can adapt to data-dependent bounds.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.