pith. machine review for the scientific record. sign in

arxiv: 1906.04043 · v1 · submitted 2019-06-10 · 💻 cs.CL · cs.AI· cs.HC· cs.LG

Recognition: unknown

GLTR: Statistical Detection and Visualization of Generated Text

Authors on Pith no claims yet
classification 💻 cs.CL cs.AIcs.HCcs.LG
keywords gltrtextgenerateddetectdetectinggenerationmethodsstatistical
0
0 comments X
read the original abstract

The rapid improvement of language models has raised the specter of abuse of text generation systems. This progress motivates the development of simple methods for detecting generated text that can be used by and explained to non-experts. We develop GLTR, a tool to support humans in detecting whether a text was generated by a model. GLTR applies a suite of baseline statistical methods that can detect generation artifacts across common sampling schemes. In a human-subjects study, we show that the annotation scheme provided by GLTR improves the human detection-rate of fake text from 54% to 72% without any prior training. GLTR is open-source and publicly deployed, and has already been widely used to detect generated outputs

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Language Models are Few-Shot Learners

    cs.CL 2020-05 accept novelty 8.0

    GPT-3 shows that scaling an autoregressive language model to 175 billion parameters enables strong few-shot performance across diverse NLP tasks via in-context prompting without fine-tuning.