Tomasz Korbak
Identifiers
No identifiers captured yet.
Papers (4)
- Towards Understanding Sycophancy in Language Models cs.CL · 2023 · author #3
- Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback cs.AI · 2023 · author #8
- Exploiting Unsupervised Pre-training and Automated Feature Engineering for Low-resource Hate Speech Detection in Polish cs.CL · 2019 · author #4
- Fine-tuning Tree-LSTM for phrase-level sentiment classification on a Polish dependency treebank. Submission to PolEval task 2 cs.CL · 2017 · author #1
Mentions
No mention provenance yet.
Frequent Coauthors
- Amanda Askell 1 shared papers
- Anand Siththaranjan 1 shared papers
- Anca Dragan 1 shared papers
- Andi Peng 1 shared papers
- Charbel-Rapha\"el Segerie 1 shared papers
- Claudia Shi 1 shared papers
- David Duvenaud 1 shared papers
- David Krueger 1 shared papers
- David Lindner 1 shared papers
- Da Yan 1 shared papers
- Dmitrii Krasheninnikov 1 shared papers
- Dorsa Sadigh 1 shared papers
- Dylan Hadfield-Menell 1 shared papers
- Erdem B{\i}y{\i}k 1 shared papers
- Eric J. Michaud 1 shared papers
- Esin Durmus 1 shared papers
- Ethan Perez 1 shared papers
- Jacob Pfau 1 shared papers
- Javier Rando 1 shared papers
- J\'er\'emy Scheurer 1 shared papers