Anca Dragan

Identifiers

name variant Anca Dragan 0.60 · backfill

Papers (15)

Benchmarking and Improving Monitors for Out-Of-Distribution Alignment Failure in LLMs cs.AI · 2026 · author #3
Chain of Thought Monitorability: A New and Fragile Opportunity for AI Safety cs.AI · 2025 · author #10
Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities cs.CL · 2025 · author #463
Gemma Scope: Open Sparse Autoencoders Everywhere All At Once on Gemma 2 cs.LG · 2024 · author #8
Gemma 2: Improving Open Language Models at a Practical Size cs.CL · 2024 · author #184
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context cs.CL · 2024 · author #631
Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback cs.AI · 2023 · author #29
Preferences Implicit in the State of the World cs.LG · 2019 · author #5
The Assistive Multi-Armed Bandit cs.LG · 2019 · author #4
Should Robots be Obedient? cs.AI · 2017 · author #3
Translating Neuralese cs.CL · 2017 · author #2
DART: Noise Injection for Robust Imitation Learning cs.LG · 2017 · author #4
The Off-Switch Game cs.AI · 2016 · author #2
Comparing Human-Centric and Robot-Centric Sampling for Robot Deep Learning from Demonstrations cs.RO · 2016 · author #7
Functional Gradient Motion Planning in Reproducing Kernel Hilbert Spaces cs.RO · 2016 · author #2

Mentions

2605.21602 #3 · arxiv_oai · confidence 0.70 Anca Dragan
2507.11473 #10 · arxiv_oai · confidence 0.70 Anca Dragan

Frequent Coauthors

Rohin Shah 5 shared papers
Dylan Hadfield-Menell 4 shared papers
Alanna Walton 3 shared papers
Alek Andreev 3 shared papers
Aliaksei Severyn 3 shared papers
Alicia Parrish 3 shared papers
Allan Dafoe 3 shared papers
Anton Tsitsulin 3 shared papers
Behnam Neyshabur 3 shared papers
Charlie Chen 3 shared papers
Charline Le Lan 3 shared papers
Christopher A. Choquette-Choo 3 shared papers
Chris Welty 3 shared papers
Clement Farabet 3 shared papers
Dave Orr 3 shared papers
David Lindner 3 shared papers
Demis Hassabis 3 shared papers
Elena Buchatskaya 3 shared papers
Eli Collins 3 shared papers
Emma Wang 3 shared papers