George Tucker — Pith Author Registry

Identifiers

name variant George Tucker 0.60 · backfill

Papers (27)

Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities cs.CL · 2025 · author #2263
Training Language Models to Self-Correct via Reinforcement Learning cs.LG · 2024 · author #15
Gemma: Open Models Based on Gemini Research and Technology cs.CL · 2024 · author #35
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context cs.CL · 2024 · author #524
Gemini: A Family of Highly Capable Multimodal Models cs.CL · 2023 · author #79
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems cs.LG · 2020 · author #3
D4RL: Datasets for Deep Data-Driven Reinforcement Learning cs.LG · 2020 · author #4
Behavior Regularized Offline Reinforcement Learning cs.LG · 2019 · author #2
Reinforcement Learning Driven Heuristic Optimization cs.LG · 2019 · author #4
On Variational Bounds of Mutual Information cs.LG · 2019 · author #5
Learning to Walk via Deep Reinforcement Learning cs.LG · 2018 · author #5
Soft Actor-Critic Algorithms and Applications cs.LG · 2018 · author #4
The Laplacian in RL: Learning Representations with Efficient Approximations cs.LG · 2018 · author #2
Doubly Reparameterized Gradient Estimators for Monte Carlo Objectives cs.LG · 2018 · author #1
Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value Expansion cs.LG · 2018 · author #3
Guided evolutionary strategies: Augmenting random search with surrogate gradients cs.NE · 2018 · author #3
Smoothed Action Value Functions for Learning Gaussian Policies cs.LG · 2018 · author #3
The Mirage of Action-Dependent Baselines in Reinforcement Learning cs.LG · 2018 · author #1
Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling stat.ML · 2018 · author #2
An online sequence-to-sequence model for noisy speech recognition cs.CL · 2017 · author #4
Filtering Variational Objectives cs.LG · 2017 · author #3
Learning Hard Alignments with Variational Inference cs.AI · 2017 · author #3
Max-Pooling Loss Training of Long Short-Term Memory Networks for Small-Footprint Keyword Spotting cs.CL · 2017 · author #3
REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models cs.LG · 2017 · author #1
Particle Value Functions cs.LG · 2017 · author #3
Regularizing Neural Networks by Penalizing Confident Output Distributions cs.NE · 2017 · author #2
Compacting Neural Network Classifiers via Dropout Training stat.ML · 2016 · author #2

Mentions

2409.12917 #15 · arxiv_oai · confidence 0.70 George Tucker

Frequent Coauthors

Aviral Kumar 6 shared papers
Dieterich Lawson 6 shared papers
Sergey Levine 5 shared papers
Alek Andreev 4 shared papers
Alex Castro-Ros 4 shared papers
Ambrose Slone 4 shared papers
Andrea Tacchetti 4 shared papers
Anna Bulanova 4 shared papers
Charline Le Lan 4 shared papers
Chris J. Maddison 4 shared papers
Christopher A. Choquette-Choo 4 shared papers
Chung-Cheng Chiu 4 shared papers
Colton Bishop 4 shared papers
Cosmin Paduraru 4 shared papers
David Reid 4 shared papers
Demis Hassabis 4 shared papers
Disha Shrivastava 4 shared papers
Elena Buchatskaya 4 shared papers
Eli Collins 4 shared papers
Eric Ni 4 shared papers