GraphLit: Learning Text-Enriched Dynamic Character Network Representations for Literary Study

Christophe Cerisara; Elena V. Epure; Gaspard Michel; Mirella Lapata; Romain Hennequin

arxiv: 2605.28643 · v1 · pith:KOGN6E7Bnew · submitted 2026-05-27 · 💻 cs.CL

GraphLit: Learning Text-Enriched Dynamic Character Network Representations for Literary Study

Gaspard Michel , Elena V. Epure , Romain Hennequin , Christophe Cerisara , Mirella Lapata This is my paper

classification 💻 cs.CL

keywords graphlitliterarycharacterdhcnsdynamicgraphscharactersheterogeneous

0 comments

read the original abstract

Methods to represent literary texts as graphs or sequences of graphs mainly focus on representing character interactions, and often overlook another crucial aspect: the textual context in which characters interact. We introduce Dynamic Heterogeneous Character Networks (DHCNs), which organize long novels into temporally localized heterogeneous graphs that align characters with their textual contexts. We extract around 20,000 DHCNs from Project Gutenberg, and propose GraphLit, a self-supervised learning framework that learns rich literary representations through a masked graph autoencoder objective. Across a wide-range of 12 character-related tasks, GraphLit improves over text-only and graph-only baselines, particularly on tasks requiring contextual understanding. Finally, we demonstrate the applicability of DHCNs and GraphLit for literary analysis by studying the link between narrative non-linearity and dynamic social features.

This paper has not been read by Pith yet.

GraphLit: Learning Text-Enriched Dynamic Character Network Representations for Literary Study

discussion (0)