pith. machine review for the scientific record. sign in

Jonathan Uesato

Identifiers

No identifiers captured yet.

Papers (18)

  1. Reasoning Models Don't Always Say What They Think cs.CL · 2025 · author #4
  2. OpenAI o1 System Card cs.AI · 2024 · author #121
  3. Alignment faking in large language models cs.AI · 2024 · author #16
  4. GPT-4o System Card cs.CL · 2024 · author #196
  5. Gemini: A Family of Highly Capable Multimodal Models cs.CL · 2023 · author #816
  6. Solving math word problems with process- and outcome-based feedback cs.LG · 2022 · author #1
  7. Improving alignment of dialogue agents via targeted human judgements cs.LG · 2022 · author #12
  8. Scaling Language Models: Methods, Analysis & Insights from Training Gopher cs.CL · 2021 · author #24
  9. Ethical and social risks of harm from Language Models cs.CL · 2021 · author #5
  10. Verification of Non-Linear Specifications for Neural Networks cs.LG · 2019 · author #7
  11. Rigorous Agent Evaluation: An Adversarial Approach to Uncover Catastrophic Failures cs.LG · 2018 · author #1
  12. Robustness via curvature regularization, and vice versa cs.LG · 2018 · author #3
  13. Strength in Numbers: Trading-off Robustness and Computation via Adversarially-Trained Ensembles cs.NE · 2018 · author #4
  14. Training verified learners with learned verifiers cs.LG · 2018 · author #6
  15. Adversarial Risk and the Dangers of Evaluating Against Weak Attacks cs.LG · 2018 · author #1
  16. Semantic Code Repair using Neuro-Symbolic Transformation Networks cs.AI · 2017 · author #2
  17. RobustFill: Neural Program Learning under Noisy I/O cs.AI · 2017 · author #2
  18. Technical Report on the CleverHans v2.1.0 Adversarial Examples Library cs.LG · 2016 · author #19

Mentions

No mention provenance yet.

Frequent Coauthors