pith. sign in

arxiv: 1903.03260 · v1 · pith:KY6NSQPMnew · submitted 2019-03-08 · 💻 cs.CL

Neural Language Models as Psycholinguistic Subjects: Representations of Syntactic State

classification 💻 cs.CL
keywords trainedmodelslargelstmrnngsmallstatesyntactic
0
0 comments X
read the original abstract

We deploy the methods of controlled psycholinguistic experimentation to shed light on the extent to which the behavior of neural network language models reflects incremental representations of syntactic state. To do so, we examine model behavior on artificial sentences containing a variety of syntactically complex structures. We test four models: two publicly available LSTM sequence models of English (Jozefowicz et al., 2016; Gulordava et al., 2018) trained on large datasets; an RNNG (Dyer et al., 2016) trained on a small, parsed dataset; and an LSTM trained on the same small corpus as the RNNG. We find evidence that the LSTMs trained on large datasets represent syntactic state over large spans of text in a way that is comparable to the RNNG, while the LSTM trained on the small dataset does not or does so only weakly.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Readers make targeted regressions to plausible errors in reanalysis of "noisy-channel garden-path" sentences

    cs.CL 2026-05 unverdicted novelty 5.0

    Readers direct regressions to plausible error sites in noisy-channel garden-path sentences, consistent with Bayesian reanalysis under a noisy-channel model.