pith. sign in

Samuel R. Bowman

Identifiers

  • name variant Samuel R. Bowman 0.60 · backfill

Papers (45)

  1. Reasoning Models Don't Always Say What They Think cs.CL · 2025 · author #12
  2. Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming cs.CL · 2025 · author #15
  3. Alignment faking in large language models cs.AI · 2024 · author #19
  4. Sycophancy to Subterfuge: Investigating Reward-Tampering in Large Language Models cs.AI · 2024 · author #12
  5. LLM Evaluators Recognize and Favor Their Own Generations cs.CL · 2024 · author #2
  6. Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training cs.CR · 2024 · author #32
  7. GPQA: A Graduate-Level Google-Proof Q&A Benchmark cs.AI · 2023 · author #8
  8. Towards Understanding Sycophancy in Language Models cs.CL · 2023 · author #6
  9. Measuring Faithfulness in Chain-of-Thought Reasoning cs.AI · 2023 · author #29
  10. Language Models Don't Always Say What They Think: Unfaithful Explanations in Chain-of-Thought Prompting cs.CL · 2023 · author #4
  11. Discovering Language Model Behaviors with Model-Written Evaluations cs.CL · 2022 · author #56
  12. Constitutional AI: Harmlessness from AI Feedback cs.CL · 2022 · author #44
  13. Measuring Progress on Scalable Oversight for Large Language Models cs.HC · 2022 · author #1
  14. Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models cs.CL · 2022 · author #353
  15. BBQ: A Hand-Built Bias Benchmark for Question Answering cs.CL · 2021 · author #8
  16. Human vs. Muppet: A Conservative Estimate of Human Performance on the GLUE Benchmark cs.CL · 2019 · author #2
  17. What do you learn from context? Probing for sentence structure in contextualized word representations cs.CL · 2019 · author #9
  18. SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems cs.CL · 2019 · author #8
  19. Identifying and Reducing Gender Bias in Word-Level Language Models cs.CL · 2019 · author #2
  20. On Measuring Social Biases in Sentence Encoders cs.CL · 2019 · author #4
  21. Can You Tell Me How to Get Past Sesame Street? Sentence-Level Pretraining Beyond Language Modeling cs.CL · 2018 · author #16
  22. Verb Argument Structure Alternations in Word and Sentence Embeddings cs.CL · 2018 · author #4
  23. Sentence Encoders on STILTs: Supplementary Training on Intermediate Labeled-data Tasks cs.CL · 2018 · author #3
  24. Language Modeling Teaches You More Syntax than Translation Does: Lessons Learned Through Auxiliary Task Analysis cs.CL · 2018 · author #2
  25. XNLI: Evaluating Cross-lingual Sentence Representations cs.CL · 2018 · author #5
  26. Grammar Induction with Neural Language Models: An Unusual Replication cs.CL · 2018 · author #3
  27. A Stable and Effective Learning Strategy for Trainable Greedy Decoding cs.CL · 2018 · author #4
  28. GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding cs.CL · 2018 · author #6
  29. ListOps: A Diagnostic Dataset for Latent Tree Learning cs.CL · 2018 · author #2
  30. Training a Ranking Function for Open-Domain Question Answering cs.CL · 2018 · author #2
  31. Annotation Artifacts in Natural Language Inference Data cs.CL · 2018 · author #5
  32. The Lifted Matrix-Space Model for Semantic Composition cs.CL · 2017 · author #3
  33. Do latent tree learning models identify meaningful structure in sentences? cs.CL · 2017 · author #3
  34. The RepEval 2017 Shared Task: Multi-Genre Natural Language Inference with Sentence Representations cs.CL · 2017 · author #4
  35. Sequential Attention: A Context-Aware Alignment Function for Machine Reading cs.CL · 2017 · author #3
  36. Ruminating Reader: Reasoning with Gated Multi-Hop Attention cs.CL · 2017 · author #2
  37. Discourse-Based Objectives for Fast Unsupervised Sentence Representation Learning cs.CL · 2017 · author #2
  38. A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference cs.CL · 2017 · author #3
  39. A Fast Unified Model for Parsing and Sentence Understanding cs.CL · 2016 · author #1
  40. Generating Sentences from a Continuous Space cs.LG · 2015 · author #1
  41. A large annotated corpus for learning natural language inference cs.CL · 2015 · author #1
  42. Tree-structured composition in neural networks without tree-structured architectures cs.CL · 2015 · author #1
  43. Learning Distributed Word Representations for Natural Logic Reasoning cs.CL · 2014 · author #1
  44. Recursive Neural Networks Can Learn Logical Semantics cs.CL · 2014 · author #1
  45. Can recursive neural tensor networks learn logical reasoning? cs.CL · 2013 · author #1

Mentions

  • 1410.4176 #1 · backfill · confidence 0.70 Samuel R. Bowman
  • 1406.1827 #1 · backfill · confidence 0.70 Samuel R. Bowman
  • 1312.6192 #1 · backfill · confidence 0.70 Samuel R. Bowman
  • 2404.13076 #2 · arxiv_oai · confidence 0.70 Samuel R. Bowman
  • 2501.18837 #15 · arxiv_oai · confidence 0.70 Samuel R. Bowman
  • 2211.03540 #1 · arxiv_oai · confidence 0.70 Samuel R. Bowman
  • 2406.10162 #12 · arxiv_oai · confidence 0.70 Samuel R. Bowman
  • 2110.08193 #8 · arxiv_oai · confidence 0.70 Samuel R. Bowman

Frequent Coauthors