George Tucker
Identifiers
- name variant George Tucker 0.60 · backfill
Papers (27)
- Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities cs.CL · 2025 · author #2263
- Training Language Models to Self-Correct via Reinforcement Learning cs.LG · 2024 · author #15
- Gemma: Open Models Based on Gemini Research and Technology cs.CL · 2024 · author #35
- Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context cs.CL · 2024 · author #524
- Gemini: A Family of Highly Capable Multimodal Models cs.CL · 2023 · author #79
- Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems cs.LG · 2020 · author #3
- D4RL: Datasets for Deep Data-Driven Reinforcement Learning cs.LG · 2020 · author #4
- Behavior Regularized Offline Reinforcement Learning cs.LG · 2019 · author #2
- Reinforcement Learning Driven Heuristic Optimization cs.LG · 2019 · author #4
- On Variational Bounds of Mutual Information cs.LG · 2019 · author #5
- Learning to Walk via Deep Reinforcement Learning cs.LG · 2018 · author #5
- Soft Actor-Critic Algorithms and Applications cs.LG · 2018 · author #4
- The Laplacian in RL: Learning Representations with Efficient Approximations cs.LG · 2018 · author #2
- Doubly Reparameterized Gradient Estimators for Monte Carlo Objectives cs.LG · 2018 · author #1
- Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value Expansion cs.LG · 2018 · author #3
- Guided evolutionary strategies: Augmenting random search with surrogate gradients cs.NE · 2018 · author #3
- Smoothed Action Value Functions for Learning Gaussian Policies cs.LG · 2018 · author #3
- The Mirage of Action-Dependent Baselines in Reinforcement Learning cs.LG · 2018 · author #1
- Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling stat.ML · 2018 · author #2
- An online sequence-to-sequence model for noisy speech recognition cs.CL · 2017 · author #4
- Filtering Variational Objectives cs.LG · 2017 · author #3
- Learning Hard Alignments with Variational Inference cs.AI · 2017 · author #3
- Max-Pooling Loss Training of Long Short-Term Memory Networks for Small-Footprint Keyword Spotting cs.CL · 2017 · author #3
- REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models cs.LG · 2017 · author #1
- Particle Value Functions cs.LG · 2017 · author #3
- Regularizing Neural Networks by Penalizing Confident Output Distributions cs.NE · 2017 · author #2
- Compacting Neural Network Classifiers via Dropout Training stat.ML · 2016 · author #2
Mentions
- 2409.12917 #15 · arxiv_oai · confidence 0.70 George Tucker
Frequent Coauthors
- Aviral Kumar 6 shared papers
- Dieterich Lawson 6 shared papers
- Sergey Levine 5 shared papers
- Alek Andreev 4 shared papers
- Alex Castro-Ros 4 shared papers
- Ambrose Slone 4 shared papers
- Andrea Tacchetti 4 shared papers
- Anna Bulanova 4 shared papers
- Charline Le Lan 4 shared papers
- Chris J. Maddison 4 shared papers
- Christopher A. Choquette-Choo 4 shared papers
- Chung-Cheng Chiu 4 shared papers
- Colton Bishop 4 shared papers
- Cosmin Paduraru 4 shared papers
- David Reid 4 shared papers
- Demis Hassabis 4 shared papers
- Disha Shrivastava 4 shared papers
- Elena Buchatskaya 4 shared papers
- Eli Collins 4 shared papers
- Eric Ni 4 shared papers