A Data-Oriented Model of Literary Language

Andreas van Cranenburgh; Rens Bod

arxiv: 1701.03329 · v2 · pith:IAHF73WVnew · submitted 2017-01-12 · 💻 cs.CL

A Data-Oriented Model of Literary Language

Andreas van Cranenburgh , Rens Bod This is my paper

classification 💻 cs.CL

keywords literaryfeaturesmodelratingsstandardsyntacticapplyaside

0 comments

read the original abstract

We consider the task of predicting how literary a text is, with a gold standard from human ratings. Aside from a standard bigram baseline, we apply rich syntactic tree fragments, mined from the training set, and a series of hand-picked features. Our model is the first to distinguish degrees of highly and less literary novels using a variety of lexical and syntactic features, and explains 76.0 % of the variation in literary ratings.

This paper has not been read by Pith yet.

A Data-Oriented Model of Literary Language

discussion (0)