pith. sign in

Andrei Polubarov

Identifiers

  • name variant Andrei Polubarov 0.60 · backfill

Papers (2)

  1. Vintix II: Decision Pre-Trained Transformer is a Scalable In-Context Reinforcement Learner cs.LG · 2026 · author #1
  2. Yes, Q-learning Helps Offline In-Context RL cs.LG · 2025 · author #5

Mentions

  • 2502.17666 #5 · arxiv_oai · confidence 0.70 Andrei Polubarov

Frequent Coauthors