Rare Event Analysis of Large Language Models

Dominic C. Rose; Edward Gillman; Jake McAllister Dorman; Jamie F. Mair; Juan P. Garrahan

arxiv: 2602.06791 · v2 · pith:WI3A5CYInew · submitted 2026-02-06 · 💻 cs.LG · cond-mat.dis-nn· cond-mat.stat-mech

Rare Event Analysis of Large Language Models

Jake McAllister Dorman , Edward Gillman , Dominic C. Rose , Jamie F. Mair , Juan P. Garrahan This is my paper

classification 💻 cs.LG cond-mat.dis-nncond-mat.stat-mech

keywords eventsmodelsrareanalysisduringherelanguagelarge

0 comments

read the original abstract

Being probabilistic models, during inference large language models (LLMs) display rare events: behaviour that is far from typical but highly significant. By definition all rare events are hard to see, but the enormous scale of LLM usage means that events completely unobserved during development are likely to become prominent in deployment. Here we present an end-to-end framework for the systematic analysis of rare events in LLMs. We provide a practical implementation spanning theory, efficient generation strategies, probability estimation and error analysis, which we illustrate with concrete examples. We outline extensions and applications to other models and contexts, highlighting the generality of the concepts and techniques presented here.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Training ML Models with Predictable Failures
cs.LG 2026-05 unverdicted novelty 5.0

A finite-k decomposition reveals a bias toward over-prediction in failure rate extrapolation from evaluation data, addressed by a new forecastability loss that improves held-out forecast accuracy in language-model and...