Simon S Du, Sham M Kakade, Ruosong Wang, and Lin F Yang

Du, S · 1910 · arXiv 1910.03016

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Provably Efficient Offline-to-Online Value Adaptation with General Function Approximation

cs.LG · 2026-04-15 · unverdicted · novelty 7.0

Offline-to-online value adaptation in RL has a minimax lower bound matching pure online learning in hard cases, yet O2O-LSVI improves sample complexity under a novel structural condition on pretrained Q-functions.

On the Power of Foundation Models

cs.AI · 2022-11-29 · unverdicted · novelty 5.0

Category theory proves prompt-based learning on perfect foundation models works only for representable tasks, fine-tuning solves tasks in the pretext category, and models can represent unseen target-category objects using source-category structure.

citing papers explorer

Showing 2 of 2 citing papers.

Provably Efficient Offline-to-Online Value Adaptation with General Function Approximation cs.LG · 2026-04-15 · unverdicted · none · ref 2
Offline-to-online value adaptation in RL has a minimax lower bound matching pure online learning in hard cases, yet O2O-LSVI improves sample complexity under a novel structural condition on pretrained Q-functions.
On the Power of Foundation Models cs.AI · 2022-11-29 · unverdicted · none · ref 25
Category theory proves prompt-based learning on perfect foundation models works only for representable tasks, fine-tuning solves tasks in the pretext category, and models can represent unseen target-category objects using source-category structure.

Simon S Du, Sham M Kakade, Ruosong Wang, and Lin F Yang

fields

years

verdicts

representative citing papers

citing papers explorer