Contextual Bandits with Random Projection

· 2019 · cs.LG · arXiv 1903.08600

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Contextual bandits with linear payoffs, which are also known as linear bandits, provide a powerful alternative for solving practical problems of sequential decisions, e.g., online advertisements. In the era of big data, contextual data usually tend to be high-dimensional, which leads to new challenges for traditional linear bandits mostly designed for the setting of low-dimensional contextual data. Due to the curse of dimensionality, there are two challenges in most of the current bandit algorithms: the first is high time-complexity; and the second is extreme large upper regret bounds with high-dimensional data. In this paper, in order to attack the above two challenges effectively, we develop an algorithm of Contextual Bandits via RAndom Projection (\texttt{CBRAP}) in the setting of linear payoffs, which works especially for high-dimensional contextual data. The proposed \texttt{CBRAP} algorithm is time-efficient and flexible, because it enables players to choose an arm in a low-dimensional space, and relaxes the sparsity assumption of constant number of non-zero components in previous work. Besides, we provide a linear upper regret bound for the proposed algorithm, which is associated with reduced dimensions.

representative citing papers

Graph Dimensionality Reduction for Contextual Bandits: Structure-Specific Regret Bounds under Approximate Smoothness and Noisy Eigenspaces

cs.LG · 2026-06-26 · unverdicted · novelty 7.0

GraphDR-LinUCB projects contextual bandit arms onto a graph's low-frequency eigenspace to obtain the first Õ(k√T) regret bound under approximate smoothness, with a spectral predictor Γ_k that matches outcomes on five of six real datasets.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Graph Dimensionality Reduction for Contextual Bandits: Structure-Specific Regret Bounds under Approximate Smoothness and Noisy Eigenspaces cs.LG · 2026-06-26 · unverdicted · none · ref 43 · internal anchor
GraphDR-LinUCB projects contextual bandit arms onto a graph's low-frequency eigenspace to obtain the first Õ(k√T) regret bound under approximate smoothness, with a spectral predictor Γ_k that matches outcomes on five of six real datasets.

Contextual Bandits with Random Projection

fields

years

verdicts

representative citing papers

citing papers explorer