BERT has a Mouth, and It Must Speak: BERT as a Markov Random Field Language Model

Alex Wang; Kyunghyun Cho

arxiv: 1902.04094 · v2 · pith:OY5ZGURZnew · submitted 2019-02-11 · 💻 cs.CL · cs.LG

BERT has a Mouth, and It Must Speak: BERT as a Markov Random Field Language Model

Alex Wang , Kyunghyun Cho This is my paper

classification 💻 cs.CL cs.LG

keywords bertlanguagemodelfieldgenerationsmarkovrandomsentences

0 comments

read the original abstract

We show that BERT (Devlin et al., 2018) is a Markov random field language model. This formulation gives way to a natural procedure to sample sentences from BERT. We generate from BERT and find that it can produce high-quality, fluent generations. Compared to the generations of a traditional left-to-right language model, BERT generates sentences that are more diverse but of slightly worse quality.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 4 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

CellxPert: Inference-Time MCMC Steering of a Multi-Omics Single-Cell Foundation Model for In-Silico Perturbation
q-bio.GN 2026-04 unverdicted novelty 7.0

CellxPert uses inference-time MCMC steering on a multi-omics single-cell foundation model to predict genome-wide transcriptomic responses to gene perturbations and outperforms baselines on cell-type annotation, pertur...
Discrete Stochastic Localization for Non-autoregressive Generation
cs.LG 2026-02 unverdicted novelty 7.0

Discrete Stochastic Localization lets a single trained network support an entire family of per-token SNR paths for discrete sequence generation, with masked diffusion as a special case, and improves MAUVE scores when ...
Interpolating Discrete Diffusion Models with Controllable Resampling
cs.LG 2026-04 unverdicted novelty 6.0

IDDM interpolates diffusion transitions with a resampling mechanism to lessen dependence on intermediate latents and improve sample quality over masked and uniform discrete diffusion models.
Patent Claim Generation by Fine-Tuning OpenAI GPT-2
cs.CL 2019-07 unverdicted novelty 5.0

Fine-tunes GPT-2 on patent claims, probes training steps, analyzes conditional and unconditional sampling outputs, proposes a new sampling method, and releases an email bot for exploration.