Jonathan Uesato
Identifiers
No identifiers captured yet.
Papers (18)
- Reasoning Models Don't Always Say What They Think cs.CL · 2025 · author #4
- OpenAI o1 System Card cs.AI · 2024 · author #121
- Alignment faking in large language models cs.AI · 2024 · author #16
- GPT-4o System Card cs.CL · 2024 · author #196
- Gemini: A Family of Highly Capable Multimodal Models cs.CL · 2023 · author #816
- Solving math word problems with process- and outcome-based feedback cs.LG · 2022 · author #1
- Improving alignment of dialogue agents via targeted human judgements cs.LG · 2022 · author #12
- Scaling Language Models: Methods, Analysis & Insights from Training Gopher cs.CL · 2021 · author #24
- Ethical and social risks of harm from Language Models cs.CL · 2021 · author #5
- Verification of Non-Linear Specifications for Neural Networks cs.LG · 2019 · author #7
- Rigorous Agent Evaluation: An Adversarial Approach to Uncover Catastrophic Failures cs.LG · 2018 · author #1
- Robustness via curvature regularization, and vice versa cs.LG · 2018 · author #3
- Strength in Numbers: Trading-off Robustness and Computation via Adversarially-Trained Ensembles cs.NE · 2018 · author #4
- Training verified learners with learned verifiers cs.LG · 2018 · author #6
- Adversarial Risk and the Dangers of Evaluating Against Weak Attacks cs.LG · 2018 · author #1
- Semantic Code Repair using Neuro-Symbolic Transformation Networks cs.AI · 2017 · author #2
- RobustFill: Neural Program Learning under Noisy I/O cs.AI · 2017 · author #2
- Technical Report on the CleverHans v2.1.0 Adversarial Examples Library cs.LG · 2016 · author #19
Mentions
No mention provenance yet.
Frequent Coauthors
- Pushmeet Kohli 7 shared papers
- Geoffrey Irving 5 shared papers
- Brendan O'Donoghue 4 shared papers
- John Mellor 4 shared papers
- Laura Weidinger 4 shared papers
- Lisa Anne Hendricks 4 shared papers
- Maribeth Rauh 4 shared papers
- William Isaac 4 shared papers
- Aidan Clark 3 shared papers
- Amelia Glaese 3 shared papers
- Ananya Kumar 3 shared papers
- Demis Hassabis 3 shared papers
- Doug Fritz 3 shared papers
- Francis Song 3 shared papers
- Iason Gabriel 3 shared papers
- Jacob Devlin 3 shared papers
- Jiahui Yu 3 shared papers
- John Aslanides 3 shared papers
- Keren Gu-Lemberg 3 shared papers
- Koray Kavukcuoglu 3 shared papers