pith. sign in

Syntactically Informed Text Compression with Recurrent Neural Networks

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it
abstract

We present a self-contained system for constructing natural language models for use in text compression. Our system improves upon previous neural network based models by utilizing recent advances in syntactic parsing -- Google's SyntaxNet -- to augment character-level recurrent neural networks. RNNs have proven exceptional in modeling sequence data such as text, as their architecture allows for modeling of long-term contextual information.

fields

cs.LG 1

years

2023 1

verdicts

ACCEPT 1

representative citing papers

Language Modeling Is Compression

cs.LG · 2023-09-19 · accept · novelty 6.0

Large language models serve as strong general-purpose lossless compressors for text, images, and audio, outperforming domain-specific methods and revealing insights into scaling, tokenization, and in-context learning.

citing papers explorer

Showing 1 of 1 citing paper.

  • Language Modeling Is Compression cs.LG · 2023-09-19 · accept · none · ref 5 · internal anchor

    Large language models serve as strong general-purpose lossless compressors for text, images, and audio, outperforming domain-specific methods and revealing insights into scaling, tokenization, and in-context learning.