pith. sign in

George Tucker

Identifiers

  • name variant George Tucker 0.60 · backfill

Papers (27)

  1. Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities cs.CL · 2025 · author #2263
  2. Training Language Models to Self-Correct via Reinforcement Learning cs.LG · 2024 · author #15
  3. Gemma: Open Models Based on Gemini Research and Technology cs.CL · 2024 · author #35
  4. Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context cs.CL · 2024 · author #524
  5. Gemini: A Family of Highly Capable Multimodal Models cs.CL · 2023 · author #79
  6. Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems cs.LG · 2020 · author #3
  7. D4RL: Datasets for Deep Data-Driven Reinforcement Learning cs.LG · 2020 · author #4
  8. Behavior Regularized Offline Reinforcement Learning cs.LG · 2019 · author #2
  9. Reinforcement Learning Driven Heuristic Optimization cs.LG · 2019 · author #4
  10. On Variational Bounds of Mutual Information cs.LG · 2019 · author #5
  11. Learning to Walk via Deep Reinforcement Learning cs.LG · 2018 · author #5
  12. Soft Actor-Critic Algorithms and Applications cs.LG · 2018 · author #4
  13. The Laplacian in RL: Learning Representations with Efficient Approximations cs.LG · 2018 · author #2
  14. Doubly Reparameterized Gradient Estimators for Monte Carlo Objectives cs.LG · 2018 · author #1
  15. Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value Expansion cs.LG · 2018 · author #3
  16. Guided evolutionary strategies: Augmenting random search with surrogate gradients cs.NE · 2018 · author #3
  17. Smoothed Action Value Functions for Learning Gaussian Policies cs.LG · 2018 · author #3
  18. The Mirage of Action-Dependent Baselines in Reinforcement Learning cs.LG · 2018 · author #1
  19. Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling stat.ML · 2018 · author #2
  20. An online sequence-to-sequence model for noisy speech recognition cs.CL · 2017 · author #4
  21. Filtering Variational Objectives cs.LG · 2017 · author #3
  22. Learning Hard Alignments with Variational Inference cs.AI · 2017 · author #3
  23. Max-Pooling Loss Training of Long Short-Term Memory Networks for Small-Footprint Keyword Spotting cs.CL · 2017 · author #3
  24. REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models cs.LG · 2017 · author #1
  25. Particle Value Functions cs.LG · 2017 · author #3
  26. Regularizing Neural Networks by Penalizing Confident Output Distributions cs.NE · 2017 · author #2
  27. Compacting Neural Network Classifiers via Dropout Training stat.ML · 2016 · author #2

Mentions

  • 2409.12917 #15 · arxiv_oai · confidence 0.70 George Tucker

Frequent Coauthors