pith. sign in

arxiv: 2601.19926 · v2 · pith:NT7ZDF6Lnew · submitted 2026-01-09 · 💻 cs.CL · cs.AI

The Grammar of Transformers: A Systematic Review of Interpretability Research on Syntactic Knowledge in Language Models

classification 💻 cs.CL cs.AI
keywords syntacticmodelsknowledgeperformancephenomenaremainstlmslanguage
0
0 comments X
read the original abstract

We present a systematic review of 337 articles evaluating the syntactic abilities of Transformer-based language models (TLMs), reporting on over 3,000 datapoints spanning a wide range of syntactic phenomena, languages, models, and methods. We take the data to collectively show that TLMs encode a non-trivial amount of syntactic knowledge. Behavioral evidence shows strong performance on formal syntactic phenomena, but weaker and more variable performance on phenomena at the syntax-semantics interface. Performance is also consistently lower for languages with less digital support. Probing and mechanistic studies further support the presence of syntactic knowledge in TLMs. Yet, because most work remains observational and current approaches are methodologically heterogeneous, insight into the detailed computational mechanisms underlying syntactic processing remains limited. At the same time, the literature remains heavily concentrated on English and BERT-like models. We discuss the implications of our results and provide recommendations for future research.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.