Aviral Kumar — Pith Author Registry

Identifiers

name variant Aviral Kumar 0.60 · backfill

Papers (21)

AsyncWebRL: Efficient Multi-Step RL for Visual Web Agents cs.LG · 2026 · author #5
Recursive Agent Optimization cs.LG · 2026 · author #4
QED-Nano: Teaching a Tiny Model to Prove Hard Theorems cs.AI · 2026 · author #9
What Does Flow Matching Bring To TD Learning? cs.LG · 2026 · author #3
TRIM: Hybrid Inference via Targeted Stepwise Routing in Multi-Step Reasoning Tasks cs.AI · 2026 · author #6
WebGym: Scaling Training Environments for Visual Web Agents with Realistic Tasks cs.LG · 2026 · author #4
Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities cs.CL · 2025 · author #1615
Grounded Reinforcement Learning for Visual Reasoning cs.CV · 2025 · author #6
Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning cs.LG · 2024 · author #9
Training Language Models to Self-Correct via Reinforcement Learning cs.LG · 2024 · author #1
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters cs.LG · 2024 · author #4
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context cs.CL · 2024 · author #282
Gemini: A Family of Highly Capable Multimodal Models cs.CL · 2023 · author #842
Zero-Shot Robotic Manipulation with Pretrained Image-Editing Diffusion Models cs.RO · 2023 · author #6
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems cs.LG · 2020 · author #2
D4RL: Datasets for Deep Data-Driven Reinforcement Learning cs.LG · 2020 · author #2
Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning cs.LG · 2019 · author #2
Graph Normalizing Flows cs.LG · 2019 · author #2
Calibration of Encoder Decoder Models for Neural Machine Translation cs.LG · 2019 · author #1
Diagnosing Bottlenecks in Deep Q-learning Algorithms cs.LG · 2019 · author #2
The Reach-Avoid Problem for Constant-Rate Multi-Mode Systems cs.LO · 2017 · author #2

Mentions

2606.05597 #5 · arxiv_oai · confidence 0.70 Aviral Kumar
2410.08146 #9 · arxiv_oai · confidence 0.70 Aviral Kumar
2505.23678 #6 · arxiv_oai · confidence 0.70 Aviral Kumar
2409.12917 #1 · arxiv_oai · confidence 0.70 Aviral Kumar
2310.10639 #6 · arxiv_oai · confidence 0.70 Aviral Kumar

Frequent Coauthors

George Tucker 6 shared papers
Sergey Levine 5 shared papers
Colton Bishop 4 shared papers
Cosmin Paduraru 4 shared papers
Disha Shrivastava 4 shared papers
Justin Fu 4 shared papers
Kate Baumli 4 shared papers
Kelvin Xu 4 shared papers
Rishabh Agarwal 4 shared papers
Shariq Iqbal 4 shared papers
Aaron Cohen 3 shared papers
Aaron Parisi 3 shared papers
Abe Ittycheriah 3 shared papers
Abhanshu Sharma 3 shared papers
Abhijit Karmarkar 3 shared papers
Abhimanyu Goyal 3 shared papers
Abhishek Chakladar 3 shared papers
Achintya Singhal 3 shared papers
Ada Ma 3 shared papers
Adam Bloniarz 3 shared papers