Aviral Kumar
Identifiers
- name variant Aviral Kumar 0.60 · backfill
Papers (21)
- AsyncWebRL: Efficient Multi-Step RL for Visual Web Agents cs.LG · 2026 · author #5
- Recursive Agent Optimization cs.LG · 2026 · author #4
- QED-Nano: Teaching a Tiny Model to Prove Hard Theorems cs.AI · 2026 · author #9
- What Does Flow Matching Bring To TD Learning? cs.LG · 2026 · author #3
- TRIM: Hybrid Inference via Targeted Stepwise Routing in Multi-Step Reasoning Tasks cs.AI · 2026 · author #6
- WebGym: Scaling Training Environments for Visual Web Agents with Realistic Tasks cs.LG · 2026 · author #4
- Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities cs.CL · 2025 · author #1615
- Grounded Reinforcement Learning for Visual Reasoning cs.CV · 2025 · author #6
- Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning cs.LG · 2024 · author #9
- Training Language Models to Self-Correct via Reinforcement Learning cs.LG · 2024 · author #1
- Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters cs.LG · 2024 · author #4
- Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context cs.CL · 2024 · author #282
- Gemini: A Family of Highly Capable Multimodal Models cs.CL · 2023 · author #842
- Zero-Shot Robotic Manipulation with Pretrained Image-Editing Diffusion Models cs.RO · 2023 · author #6
- Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems cs.LG · 2020 · author #2
- D4RL: Datasets for Deep Data-Driven Reinforcement Learning cs.LG · 2020 · author #2
- Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning cs.LG · 2019 · author #2
- Graph Normalizing Flows cs.LG · 2019 · author #2
- Calibration of Encoder Decoder Models for Neural Machine Translation cs.LG · 2019 · author #1
- Diagnosing Bottlenecks in Deep Q-learning Algorithms cs.LG · 2019 · author #2
- The Reach-Avoid Problem for Constant-Rate Multi-Mode Systems cs.LO · 2017 · author #2
Mentions
- 2606.05597 #5 · arxiv_oai · confidence 0.70 Aviral Kumar
- 2410.08146 #9 · arxiv_oai · confidence 0.70 Aviral Kumar
- 2505.23678 #6 · arxiv_oai · confidence 0.70 Aviral Kumar
- 2409.12917 #1 · arxiv_oai · confidence 0.70 Aviral Kumar
- 2310.10639 #6 · arxiv_oai · confidence 0.70 Aviral Kumar
Frequent Coauthors
- George Tucker 6 shared papers
- Sergey Levine 5 shared papers
- Colton Bishop 4 shared papers
- Cosmin Paduraru 4 shared papers
- Disha Shrivastava 4 shared papers
- Justin Fu 4 shared papers
- Kate Baumli 4 shared papers
- Kelvin Xu 4 shared papers
- Rishabh Agarwal 4 shared papers
- Shariq Iqbal 4 shared papers
- Aaron Cohen 3 shared papers
- Aaron Parisi 3 shared papers
- Abe Ittycheriah 3 shared papers
- Abhanshu Sharma 3 shared papers
- Abhijit Karmarkar 3 shared papers
- Abhimanyu Goyal 3 shared papers
- Abhishek Chakladar 3 shared papers
- Achintya Singhal 3 shared papers
- Ada Ma 3 shared papers
- Adam Bloniarz 3 shared papers