An Empirical Study of Smoothing Techniques for Language Modeling

arxiv: cmp-lg/9606011 · v1 · submitted 1996-06-11 · cmp-lg · cs.CL

An Empirical Study of Smoothing Techniques for Language Modeling

Stanley F. Chen , Joshua T. Goodman (Harvard University) This is my paper

classification cmp-lg cs.CL

keywords smoothingtechniquesdataempiricallanguagemethodsmodelingversus

0 comments p. Extension

read the original abstract

We present an extensive empirical comparison of several smoothing techniques in the domain of language modeling, including those described by Jelinek and Mercer (1980), Katz (1987), and Church and Gale (1991). We investigate for the first time how factors such as training data size, corpus (e.g., Brown versus Wall Street Journal), and n-gram order (bigram versus trigram) affect the relative performance of these methods, which we measure through the cross-entropy of test data. In addition, we introduce two novel smoothing techniques, one a variation of Jelinek-Mercer smoothing and one a very simple linear interpolation technique, both of which outperform existing methods.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Beyond Monolingual Assumptions: A Survey of Code-Switched NLP in the Era of Large Language Models across Modalities
cs.CL 2025-10 accept novelty 7.0

A comprehensive survey of code-switched NLP research with LLMs across modalities, covering 327 studies, 15+ tasks, 30+ datasets, and 80+ languages while outlining challenges and a future roadmap.