GLTR: Statistical Detection and Visualization of Generated Text

Alexander M. Rush; Hendrik Strobelt; Sebastian Gehrmann

arxiv: 1906.04043 · v1 · pith:MYHIOW6Cnew · submitted 2019-06-10 · 💻 cs.CL · cs.AI· cs.HC· cs.LG

GLTR: Statistical Detection and Visualization of Generated Text

Sebastian Gehrmann , Hendrik Strobelt , Alexander M. Rush This is my paper

classification 💻 cs.CL cs.AIcs.HCcs.LG

keywords gltrtextgenerateddetectdetectinggenerationmethodsstatistical

0 comments

read the original abstract

The rapid improvement of language models has raised the specter of abuse of text generation systems. This progress motivates the development of simple methods for detecting generated text that can be used by and explained to non-experts. We develop GLTR, a tool to support humans in detecting whether a text was generated by a model. GLTR applies a suite of baseline statistical methods that can detect generation artifacts across common sampling schemes. In a human-subjects study, we show that the annotation scheme provided by GLTR improves the human detection-rate of fake text from 54% to 72% without any prior training. GLTR is open-source and publicly deployed, and has already been widely used to detect generated outputs

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 8 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Language Models are Few-Shot Learners
cs.CL 2020-05 accept novelty 8.0

GPT-3 shows that scaling an autoregressive language model to 175 billion parameters enables strong few-shot performance across diverse NLP tasks via in-context prompting without fine-tuning.
SoK: Exposing the Generation and Detection Gaps in LLM-Generated Phishing
cs.CR 2025-08 unverdicted novelty 7.0

This SoK paper introduces a nine-stage taxonomy for LLM guardrail breaches in phishing, characterizes evasion and manipulation tactics, and identifies a dynamic-offense versus static-defense asymmetry.
ExaGPT: Example-Based Machine-Generated Text Detection for Human Interpretability
cs.CL 2025-02 unverdicted novelty 7.0

ExaGPT uses span-level similarity retrieval from human and LLM datastores to detect machine-generated text while supplying the matching spans as human-interpretable evidence, achieving up to 37-point accuracy gains ov...
GigaCheck: Detecting LLM-generated Content via Object-Centric Span Localization
cs.CL 2024-10 unverdicted novelty 6.0

GigaCheck detects LLM-generated text at both document and span levels by combining fine-tuned language-model embeddings with a DETR-like architecture that treats generated intervals as detectable objects.
Can AI-Generated Text be Reliably Detected?
cs.CL 2023-03 unverdicted novelty 6.0

Recursive paraphrasing attacks substantially lower detection rates for multiple AI text detectors with only minor quality loss, while a theoretical analysis ties best-case AUROC to total variation distance between hum...
Multi-Level Contextual Token Relation Modeling for Machine-Generated Text Detection
cs.CL 2026-05 unverdicted novelty 5.0

A multi-level framework that models local and global relations among token detection scores to improve machine-generated text detection with low overhead.
Detecting LLM-Assisted Academic Dishonesty using Keystroke Dynamics
cs.HC 2025-11 unverdicted novelty 5.0

Keystroke dynamics models outperform text-only detectors for spotting LLM-assisted academic dishonesty in practical scenarios, though performance drops under adversarial conditions.
Findings of the Counter Turing Test: AI-Generated Text Detection
cs.CL 2026-05 unverdicted novelty 2.0

Shared task findings show F1=1.0000 for binary AI text detection and 0.9531 for model attribution using fine-tuned DeBERTa and BART transformers with ensembles.