Mrinank Sharma
Identifiers
- name variant Mrinank Sharma 0.60 · backfill
Papers (4)
- Chain-of-Thought Hijacking cs.AI · 2025 · author #4
- Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming cs.CL · 2025 · author #1
- Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training cs.CR · 2024 · author #22
- Towards Understanding Sycophancy in Language Models cs.CL · 2023 · author #1
Mentions
- 2510.26418 #4 · arxiv_oai · confidence 0.70 Mrinank Sharma
- 2501.18837 #1 · arxiv_oai · confidence 0.70 Mrinank Sharma
Frequent Coauthors
- Amanda Askell 3 shared papers
- Ethan Perez 3 shared papers
- Meg Tong 3 shared papers
- Samuel R. Bowman 3 shared papers
- Cem Anil 2 shared papers
- David Duvenaud 2 shared papers
- Fazl Barez 2 shared papers
- Jared Kaplan 2 shared papers
- Jesse Mu 2 shared papers
- Kamal Ndousse 2 shared papers
- Logan Graham 2 shared papers
- Newton Cheng 2 shared papers
- Nicholas Schiefer 2 shared papers
- Shauna Kravec 2 shared papers
- Adam Jermyn 1 shared papers
- Alex Silverstein 1 shared papers
- Alwin Peng 1 shared papers
- Andy Dau 1 shared papers
- Anjali Gopal 1 shared papers
- Ansh Radhakrishnan 1 shared papers