pith. sign in

Peter Henderson

Identifiers

  • name variant Peter Henderson 0.60 · backfill

Papers (28)

  1. Temporally Extended Mixture-of-Experts Models cs.LG · 2026 · author #2
  2. Large Language Models Generate Harmful Content Using a Distinct, Unified Mechanism cs.CL · 2026 · author #5
  3. Legal Retrieval for Public Defenders cs.IR · 2026 · author #7
  4. Legal Alignment for Safe and Ethical AI cs.CY · 2026 · author #12
  5. How Can AI Augment Access to Justice? Public Defenders' Perspectives on AI Adoption cs.CY · 2025 · author #4
  6. Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To! cs.CL · 2023 · author #7
  7. Holistic Evaluation of Language Models cs.CL · 2022 · author #36
  8. BLOOM: A 176B-Parameter Open-Access Multilingual Language Model cs.CL · 2022 · author #100
  9. On the Opportunities and Risks of Foundation Models cs.LG · 2021 · author #35
  10. Separating value functions across time-scales cs.LG · 2019 · author #2
  11. Distilling Information from a Flood: A Possibility for the Use of Meta-Analysis and Systematic Review in Machine Learning Research cs.DL · 2018 · author #1
  12. An Introduction to Deep Reinforcement Learning cs.LG · 2018 · author #2
  13. The RLLChatbot: a solution to the ConvAI challenge cs.CL · 2018 · author #3
  14. Adversarial Gain cs.LG · 2018 · author #1
  15. Where Did My Optimum Go?: An Empirical Analysis of Gradient Descent Optimization in Policy Gradient Methods cs.LG · 2018 · author #1
  16. Reward Estimation for Variance Reduction in Deep Reinforcement Learning cs.LG · 2018 · author #2
  17. Learning Robust Dialog Policies in Noisy Environments cs.CL · 2017 · author #5
  18. Bayesian Policy Gradients via Alpha Divergence Dropout Inference cs.LG · 2017 · author #1
  19. Ethical Challenges in Data-Driven Dialogue Systems cs.CL · 2017 · author #1
  20. Underwater Multi-Robot Convoying using Visual Tracking by Detection cs.RO · 2017 · author #3
  21. Cost Adaptation for Robust Decentralized Swarm Behaviour cs.AI · 2017 · author #1
  22. OptionGAN: Learning Joint Reward-Policy Options using Generative Adversarial Inverse Reinforcement Learning cs.LG · 2017 · author #1
  23. Deep Reinforcement Learning that Matters cs.LG · 2017 · author #1
  24. Benchmark Environments for Multitask Learning in Continuous Domains cs.AI · 2017 · author #1
  25. Reproducibility of Benchmarked Deep Reinforcement Learning Tasks for Continuous Control cs.LG · 2017 · author #2
  26. An Analysis of Parallelized Motion Masking Using Dual-Mode Single Gaussian Models cs.CV · 2017 · author #1
  27. Chaotic Memory Randomization for Securing Embedded Systems cs.CR · 2016 · author #1
  28. A Survey of Available Corpora for Building Data-Driven Dialogue Systems cs.CL · 2015 · author #3

Mentions

  • 2601.04175 #12 · arxiv_oai · confidence 0.70 Peter Henderson
  • 2601.14348 #7 · arxiv_oai · confidence 0.70 Peter Henderson

Frequent Coauthors